Add UnicodeSet.split

Description

It is much faster to use a UnicodeSet for splitting than a Regex split. This is a proposal that we add an API for that.

CharSequence[] split(CharSequence source);

Activity

Show:
TracBot
July 1, 2018, 12:12 AM
Trac Comment 1 by —2011-09-21T19:26:54.384Z

Need API proposal, review by Markus.

TracBot
July 1, 2018, 12:12 AM
Trac Comment 2 by —2011-09-23T18:05:37.509Z

Here are some figures:

TracBot
July 1, 2018, 12:12 AM
Trac Comment 3 by —2012-08-10T16:54:33.643Z

I think we might want some kind of a UnicodeSetSplitter class rather than adding more auxiliary methods to UnicodeSet itself. (Ideally, we should have put span() etc. on a different class too.)

Assignee

Mark Davis

Reporter

Mark Davis

Components

Labels

None

Reviewer

None

Priority

minor

Time Needed

Hours

Fix versions

None