#include <cjk-tokenizer.h>

Public Member Functions | |
| CJKTokenIterator (const std::string &s) | |
| CJKTokenIterator (const Xapian::Utf8Iterator &it_) | |
| CJKTokenIterator () | |
| const std::string & | operator* () const |
| CJKTokenIterator & | operator++ () |
| unsigned | get_length () const |
| Get the length of the current token in Unicode characters. | |
Private Attributes | |
| Xapian::Utf8Iterator | it |
| Xapian::Utf8Iterator | p |
| unsigned | len |
| std::string | current_token |
Friends | |
| bool | operator== (const CJKTokenIterator &, const CJKTokenIterator &) |
Definition at line 51 of file cjk-tokenizer.h.
| CJKTokenIterator::CJKTokenIterator | ( | const std::string & | s | ) | [inline] |
Definition at line 61 of file cjk-tokenizer.h.
| CJKTokenIterator::CJKTokenIterator | ( | const Xapian::Utf8Iterator & | it_ | ) | [inline] |
Definition at line 64 of file cjk-tokenizer.h.
| CJKTokenIterator::CJKTokenIterator | ( | ) | [inline] |
Definition at line 67 of file cjk-tokenizer.h.
| unsigned CJKTokenIterator::get_length | ( | ) | const [inline] |
Get the length of the current token in Unicode characters.
Definition at line 75 of file cjk-tokenizer.h.
References len.
| const string & CJKTokenIterator::operator* | ( | ) | const |
Definition at line 99 of file cjk-tokenizer.cc.
References Xapian::Unicode::append_utf8(), Assert, current_token, it, len, and p.
| CJKTokenIterator & CJKTokenIterator::operator++ | ( | ) |
Definition at line 112 of file cjk-tokenizer.cc.
References Xapian::Unicode::append_utf8(), Assert, current_token, it, len, NGRAM_SIZE, and p.
| bool operator== | ( | const CJKTokenIterator & | a, | |
| const CJKTokenIterator & | b | |||
| ) | [friend] |
Definition at line 81 of file cjk-tokenizer.h.
std::string CJKTokenIterator::current_token [mutable, private] |
Xapian::Utf8Iterator CJKTokenIterator::it [private] |
Definition at line 52 of file cjk-tokenizer.h.
Referenced by operator*(), operator++(), and operator==().
unsigned CJKTokenIterator::len [mutable, private] |
Definition at line 56 of file cjk-tokenizer.h.
Referenced by get_length(), operator*(), and operator++().
Xapian::Utf8Iterator CJKTokenIterator::p [mutable, private] |