Functions | |
| bool | is_cjk_enabled () |
| Should we use the CJK n-gram code? | |
| bool | codepoint_is_cjk (unsigned codepoint) |
| std::string | get_cjk (Xapian::Utf8Iterator &it) |
| bool CJK::codepoint_is_cjk | ( | unsigned | codepoint | ) |
Definition at line 73 of file cjk-tokenizer.cc.
Referenced by get_cjk(), Xapian::TermGenerator::Internal::index_text(), and Xapian::QueryParser::Internal::parse_term().
| string CJK::get_cjk | ( | Xapian::Utf8Iterator & | it | ) |
Definition at line 88 of file cjk-tokenizer.cc.
References Xapian::Unicode::append_utf8(), codepoint_is_cjk(), and Xapian::Internal::str().
Referenced by Xapian::TermGenerator::Internal::index_text(), and Xapian::QueryParser::Internal::parse_term().
| bool CJK::is_cjk_enabled | ( | ) |
Should we use the CJK n-gram code?
The first time this is called it reads the environmental variable XAPIAN_CJK_NGRAM and returns true if it is set to a non-empty value. Subsequent calls cache and return the same value.
Definition at line 43 of file cjk-tokenizer.cc.
Referenced by Xapian::TermGenerator::Internal::index_text().