From some previous doc:
Consider applying grapheme cluster iteration for breaking when dictionary for words is not present.
This is important to handle when data slicing may have removed the appropriate dictionary file.
This will have a big impact if dictionary files can be omitted.
Need to figure out the framework of how to write unit tests to test such behavior