String search matching problem

Description

I am on FreeBSD 4.

Using the samples/strsrch/strsrch.cpp program, I ran the
following tests:

My question is: why is "uss" found in "fußball", but "fuss" is not?

Activity

Show:

UnicodeBot 
June 30, 2018 at 11:47 PM

Trac Comment 10 by hbrands@8db35b36450ba7ca—2009-01-11T10:28:32.000Z

The issue is still present in ICU4J 4.0, right?
Any estimate, when it will be fixed in ICU4J?
Thanks.

UnicodeBot 
June 30, 2018 at 11:47 PM

Trac Comment 9 by eric—2008-05-28T18:26:40.000Z

This is a fundamental problem with the Boyer-Moore implementation. The value returned by getMaxExpansion is not sufficient in all cases for computing skip distances.

As an interim fix, the Boyer-Moore implementation was replaced by a linear search, which is slower but accurate.

The problems reported in tickets 3315, 5382 and 5959 are also caused by this problem.

UnicodeBot 
June 30, 2018 at 11:47 PM

Trac Comment 6 by —2007-08-31T17:58:07.000Z

The problem is confirmed, the fix will not be easy.
5382 is probably from the same cause.

UnicodeBot 
June 30, 2018 at 11:47 PM

Trac Comment by jplemieux@f74d39fa044aa309—2006-10-07T16:43:37.000Z

(Guest Reply)

I am also using the ICU4J library and my users are experiencing this bug. +1 from me for its priority...

UnicodeBot 
June 30, 2018 at 11:47 PM

Trac Comment by jplemieux@f74d39fa044aa309—2006-10-07T16:36:53.000Z

(Guest Reply)

I am also using the ICU4J library and my users are experiencing this bug. +1 from me for its priority...

Fixed

Details

Assignee

Reporter

Priority

Time Needed

Weeks

Fix versions

Created June 28, 2018 at 5:13 PM
Updated August 18, 2021 at 8:30 PM
Resolved July 1, 2018 at 8:55 PM