An iterator which returns Unicode character values from a UTF-8 encoded string.
More...
#include <unicode.h>
|
typedef std::input_iterator_tag | iterator_category |
| We implement the semantics of an STL input_iterator.
|
|
typedef unsigned | value_type |
|
typedef size_t | difference_type |
|
typedef const unsigned * | pointer |
|
typedef const unsigned & | reference |
|
|
const char * | raw () const |
| Return the raw const char* pointer for the current position.
|
|
size_t | left () const |
| Return the number of bytes left in the iterator's buffer.
|
|
void | assign (const char *p_, size_t len) |
| Assign a new string to the iterator.
|
|
void | assign (const std::string &s) |
| Assign a new string to the iterator.
|
|
| Utf8Iterator (const char *p_) |
| Create an iterator given a pointer to a null terminated string.
|
|
| Utf8Iterator (const char *p_, size_t len) |
| Create an iterator given a pointer and a length.
|
|
| Utf8Iterator (const std::string &s) |
| Create an iterator given a string.
|
|
| Utf8Iterator () |
| Create an iterator which is at the end of its iteration.
|
|
unsigned | operator* () const |
| Get the current Unicode character value pointed to by the iterator.
|
|
Utf8Iterator | operator++ (int) |
| Move forward to the next Unicode character.
|
|
Utf8Iterator & | operator++ () |
| Move forward to the next Unicode character.
|
|
bool | operator== (const Utf8Iterator &other) const |
| Test two Utf8Iterators for equality.
|
|
bool | operator!= (const Utf8Iterator &other) const |
| Test two Utf8Iterators for inequality.
|
|
An iterator which returns Unicode character values from a UTF-8 encoded string.
◆ Utf8Iterator() [1/4]
Xapian::Utf8Iterator::Utf8Iterator |
( |
const char * |
p_ | ) |
|
|
explicit |
Create an iterator given a pointer to a null terminated string.
The iterator will return characters from the start of the string when next called. The string is not copied into the iterator, so it must remain valid while the iteration is in progress.
- Parameters
-
p_ | A pointer to the start of the null terminated string to read. |
◆ Utf8Iterator() [2/4]
Xapian::Utf8Iterator::Utf8Iterator |
( |
const char * |
p_, |
|
|
size_t |
len |
|
) |
| |
|
inline |
Create an iterator given a pointer and a length.
The iterator will return characters from the start of the string when next called. The string is not copied into the iterator, so it must remain valid while the iteration is in progress.
- Parameters
-
p_ | A pointer to the start of the string to read. |
len | The length of the string to read. |
◆ Utf8Iterator() [3/4]
Xapian::Utf8Iterator::Utf8Iterator |
( |
const std::string & |
s | ) |
|
|
inline |
Create an iterator given a string.
The iterator will return characters from the start of the string when next called. The string is not copied into the iterator, so it must remain valid while the iteration is in progress.
- Parameters
-
s | The string to read. Must not be modified while the iteration is in progress. |
◆ Utf8Iterator() [4/4]
Xapian::Utf8Iterator::Utf8Iterator |
( |
| ) |
|
|
inline |
Create an iterator which is at the end of its iteration.
This can be compared to another iterator to check if the other iterator has reached its end.
◆ assign() [1/2]
void Xapian::Utf8Iterator::assign |
( |
const char * |
p_, |
|
|
size_t |
len |
|
) |
| |
|
inline |
Assign a new string to the iterator.
The iterator will forget the string it was iterating through, and return characters from the start of the new string when next called. The string is not copied into the iterator, so it must remain valid while the iteration is in progress.
- Parameters
-
p_ | A pointer to the start of the string to read. |
len | The length of the string to read. |
◆ assign() [2/2]
void Xapian::Utf8Iterator::assign |
( |
const std::string & |
s | ) |
|
|
inline |
Assign a new string to the iterator.
The iterator will forget the string it was iterating through, and return characters from the start of the new string when next called. The string is not copied into the iterator, so it must remain valid while the iteration is in progress.
- Parameters
-
s | The string to read. Must not be modified while the iteration is in progress. |
References assign().
Referenced by assign().
◆ operator!=()
bool Xapian::Utf8Iterator::operator!= |
( |
const Utf8Iterator & |
other | ) |
const |
|
inline |
Test two Utf8Iterators for inequality.
- Parameters
-
- Returns
- true iff the iterators do not point to the same position.
◆ operator*()
unsigned Xapian::Utf8Iterator::operator* |
( |
| ) |
const |
Get the current Unicode character value pointed to by the iterator.
If an invalid UTF-8 sequence is encountered, then the byte values comprising it are returned until valid UTF-8 or the end of the input is reached.
Returns unsigned(-1) if the iterator has reached the end of its buffer.
◆ operator++() [1/2]
Move forward to the next Unicode character.
- Returns
- A reference to this object.
◆ operator++() [2/2]
Move forward to the next Unicode character.
- Returns
- An iterator pointing to the position before the move.
◆ operator==()
bool Xapian::Utf8Iterator::operator== |
( |
const Utf8Iterator & |
other | ) |
const |
|
inline |
Test two Utf8Iterators for equality.
- Parameters
-
- Returns
- true iff the iterators point to the same position.
The documentation for this class was generated from the following file: