RFE: wordbreaking rule for Chinese Ideograph

Description

Word breaking rule for Chinese ideograph in ICU4J has below,

+ "$kanji=[\u3005\u4e00-\u9fa5\uf900-\ufa2d];"
// keep together runs of Kanji
+ "$kanji*;"

Above rule will effect to the countries to use Chinese character, CJKT.
One line could be selected as a word in Chinese and Taiwanese because usually
they don't use space or symbol.

Assignee

Andy Heninger

Reporter

TracBot

Components

Labels

None

Reviewer

None

Priority

major

Time Needed

Hours

Fix versions

Configure