We're updating the issue view to help you get more done. 

Sinhala collation

Description

Deleted Component: other

1) The rule:
<reset>෴</reset>
<p>අ</p>
gives අ wrong sort weight.

2) The rule:
<reset>ෳ</reset>
<p>්</p>
gives ් wrong sort weight, since ෳ has been moved to the correct position in UCA 5.2.
් already has correct sort weight in UCA 5.2.

3) The rule:
<reset>ෘ</reset>
<p>ෲ</p>
<p>ෟ</p>
<p>ෳ</p>
is no longer needed, because it is already the default in UCA 5.2.

The Sinhala collation will be correct if the three above mentioned rules are deleted.

The proposed revised CLDR Sinhala collation rules are the following:
<collation type="standard" references="Sri Lanka standard 1134 Part 1 - 2007 edition - Sri Lanka Standards Institution">
<settings normalization="on"/>
<rules>
<reset>ඖ</reset>
<p>ං</p>
<p>ඃ</p>
<reset>ඥ</reset>
<p>ඤ</p>
</rules>
</collation>
<collation type="dictionary" references="Sri Lanka standard 1134 Part 1 - 2007 edition - Sri Lanka Standards Institution">
<settings normalization="on"/>
<rules>
<reset>ඖ</reset>
<p>ං</p>
<p>ඃ</p>
<reset>ජ්ඤ</reset>
<s>ඥ</s>
</rules>
</collation>

References:
http://www.unicode.org/Public/UCA/latest/allkeys.txt
http://www.icta.lk/attachments/658_658_SLS%201134-%20Part%201.pdf

Environment

xpath

None

locale

None

Status

Assignee

TracBot

Reporter

TracBot

tracReporter

Åke Persson <ake.persson@ad497e7b57efe531

tracOwner

dima

tracResolution

fixed

tracStatus

closed

Reviewer

John Emmons

tracCreated

Jun 14, 2010, 2:25 PM

Priority

medium