The java.util.regex.Pattern documentation claims that the POSIX character class "Punct"
contains the characters:
Empirically, the following characters are not considered in Punct on ICU.
Grep seems to treat them as part of unct:. So perhaps this is a bug in ICU? We need to figure out whether that is the case and how to fix it.