Skip to:
The word and sentence BreakIterator fails for Thai. The DBBI code is not prepared to handle a non-1:1-indexing UText.
Milestone 4.8RC deleted
#8550 is for the real fix
The dictionary break code will crash if the input text is UTF-8 because native indexes are different from UTF-16 indexes.
This ticket covers the temp fix to skip dictionary lookup for UTF-8 encoded text. It wont give the right breaks, but it's better than a crash.
The word and sentence BreakIterator fails for Thai. The DBBI code is not prepared to handle a non-1:1-indexing UText.