We're updating the issue view to help you get more done. 

RFE locale-specific RBBI rules for French

Description

The BreakIterator.getWordInstance() does not split up French contractions.
"l'homme" is treated as one word, whereas it should be tokenized as "le" +
"homme", or "l" + "homme". For the complete set of rules, see
http://french.about.com/library/pronunciation/bl-contractions.htm

Environment

Status

Assignee

TracBot

Reporter

TracBot

Labels

Time Needed

Days

tracCc

andy

tracCreated

Dec 03, 2001, 2:08 AM

tracOwner

deborah

tracProject

all

tracReporter

shef31@50cd1a9a18375803

tracResolution

wontfix

tracStatus

closed

tracWeeks

0.4

Components

Fix versions

Priority

major