Support UREGEX_CANON_EQ

Description

UREGEX_CANON_EQ (Forces normalization of pattern and strings.) is not yet implemented in ICU4C regular expression engine. Support the option.

Activity

Show:
TracBot
June 30, 2018, 11:31 PM
Trac Comment 6.7 by —2017-01-05T15:48:56.628Z

Replying to (Comment 6 Gergely Nagy <ngg@…>):

Replying to (Comment 5 srl):

Is there any progress regarding this issue?

I didn't accept it, I just fixed the status. Andy is the owner.

TracBot
June 30, 2018, 11:31 PM
Trac Comment 5.6 by Gergely Nagy <ngg@1a5ca236a280a9a6—2015-05-18T11:14:23.643Z

Replying to (Comment 5 srl):
Is there any progress regarding this issue?

TracBot
June 30, 2018, 11:31 PM
Trac Comment 2 by —2012-04-24T22:42:54.706Z

Note: I would like to templatize the engine first (ticket 9285), which would simplify the changes.

I think canonical equivalent matching is best done by making the fundamental unit of matching be a combining sequence. Match results are independent of the normalization form of both the pattern and the text being matched. Differs from what Java does.

Assignee

Andy Heninger

Reporter

Yoshito Umaoka

Components

Priority

minor

Time Needed

Weeks