UnicodeSet::closeOver(simple case folding)

Description

Add a new option to UnicodeSet::closeOver() to do the operation with simple (1:1, no strings) case folding.

This can't be done in an obvious way now - there is no function to remove all the strings from a set, and even if there were it wouldn't be very efficient to generate them with a full closeOver(USET_CASE) and then remove them.

Activity

Show:

Markus Scherer February 15, 2023 at 1:00 AM

What I am implementing does not actually remove strings. Instead, it works like USET_CASE_INSENSITIVE, that is, it replaces strings with their per-code-point Simple_Case_Folding versions.

Markus Scherer February 15, 2023 at 12:59 AM

Use case: Implementations of ECMAScript (JavaScript) regex which is based on Simple_Case_Folding.

Fixed

Details

Assignee

Reporter

Components

Priority

Time Needed

Days

Fix versions

Created June 28, 2018 at 5:14 PM
Updated March 2, 2023 at 7:44 PM
Resolved March 2, 2023 at 7:44 PM