Regexp set expressions, conform to standard usage

Description

There is common syntax for regular expression [expressions|bracket] that ICU
does not fully follow. This is a consequence of using the UnicodeSet parser to
handle bracket expressions.

The differences are confusing to users, who typically expect standard Perl/Posix
behavior.

Do something to fix up ICU regexp.
There are questions of compatibility, because ICU exists now with the its own
escaping behaviors.

UnicodeSet's syntax can not be changed, for compatibility reasons. A new
regexp-specific set parser is probably needed.

Activity

UnicodeBot 
June 30, 2018 at 11:46 PM

Trac Comment 4 by —2007-12-11T22:00:56.000Z

The scope of this ticket was expanded to include both pattern features and API functions that were introduced in Java with JDK 1.5.

UnicodeBot 
June 30, 2018 at 11:46 PM

Trac Comment by auditor—1970-01-01T01:45:14.000Z
  • Mon Oct 2 18:42:58 2006 grhoten changed notes2: assign: "" to "andy", priority: "" to "high", weeks: "" to "2",

  • Mon Oct 2 23:56:57 2006 andy changed notes2: target: "UNSCH" to "3.8",

Fixed

Details

Assignee

Reporter

Components

Labels

Priority

Time Needed

Weeks

Fix versions

Created June 28, 2018 at 5:13 PM
Updated October 3, 2018 at 11:02 PM
Resolved July 1, 2018 at 8:54 PM