We're updating the issue view to help you get more done. 

Support legacy grapheme break

Description

In the 2010-08-18 CLDR meeting we discussed extended CLDR to support both extended grapheme break (new in Unicode 5.1) and legacy grapheme break (which is fixed in Unicode 6.0 to work as specified, and is preferred for Thai/Lao). We discussed putting two variants for grapheme break in root, with the root-level default being extended grapheme break (which is the current grapheme break in CLDR), allowing locales to specify a dfferent default. This bug is to implement something like that.

However, segmentation choices do not fit the model described above. Right now root just has the following types defined:

• <segmentation type="GraphemeClusterBreak">

• <segmentation type="LineBreak">

• <segmentation type="LineBreak">

• <segmentation type="SentenceBreak">

There is not really structure to support multiple instances of each of those types, or selection of a default among those instances

xpath

None

locale

None

Status

Priority

major

Assignee

Peter Edberg

Reporter

Peter Edberg

tracReporter

pedberg

Reviewer

None

Labels

None

Components

Fix versions

None

phase

dsub