We're updating the issue view to help you get more done. 

Updated Smaller Khmer Word Breaker Dictionary

Description

We have completely revised the current Khmer dictionary used with the ICU break iterator in order to cut down on the size significantly (it was about 2 MB now it is 372KB).

The dictionary change also requires a small change in the code of dictbe.cpp in order to work with greater accuracy. I've attached the new dictionary as well as a patch for dictbe.cpp

The previous dictionary was too large for some programs that use ICU (such as Chrome), so this dictionary will allow them to add Khmer line-breaking etc.

Environment

Status

Assignee

Jungshik Shin (신정식)

Reporter

TracBot

Time Needed

Hours

tracCc

andy,claireho,markus,pedberg

tracCreated

Jul 27, 2012, 9:45 AM

tracOwner

jungshik

tracProject

all

tracReporter

sungkhum@f74d39fa044aa309

tracStatus

accepted

tracWeeks

0.2

Components

Fix versions

Priority

medium