ICU doesn't consider these characters as punctuation (punct): $+<=>^`|~

Description

The java.util.regex.Pattern documentation claims that the POSIX character class "Punct"

contains the characters:

Empirically, the following characters are not considered in Punct on ICU.

Grep seems to treat them as part of unct:. So perhaps this is a bug in ICU? We need to figure out whether that is the case and how to fix it.

GoogleIssue: 111497078

Status

Assignee

Andy Heninger

Reporter

Victor Chang

Labels

Reviewer

Victor Chang

Time Needed

None

Start date

None

Components

Fix versions

Priority

major
Configure