Improve the English keywords for non-Emoji symbols

Description

(This is the emoji component just because that's where the annotations are)

I noticed that the keywords for the symbols added in v38 are pretty bad. For example,

<annotation cp="↸">north west arrow long bar</annotation>
<annotation cp="↸" type="tts">north west arrow long bar</annotation>

This means a search for that character will only match if the entire name (north west arrow long bar) is matched. It should be something like:

<annotation cp="↹">leftwards | arrow | bar | rightwards</annotation>

The names could also be improved, eg "north west" => northwest. Remember also that names and keywords are not restricted to ASCII letters.

The file is:

https://github.com/unicode-org/cldr/blob/master/common/annotations/en.xml

The relevant items appear to be all before line 721, and have non-tts lines without | in them.

Activity

Show:

Michelle Perham 
February 21, 2025 at 2:21 AM

This issue has been bulk punted to 48 and will be re-triaged. If you feel strongly it should be fixed, please advocate for it.

Mark Davis 
May 8, 2024 at 4:56 PM

CLDR TC accepted, 2024-05-08

Mark Davis 
May 7, 2024 at 8:27 PM

Moving to esub for now; needs to be assessed for version and phase

Mark Davis 
April 22, 2024 at 5:48 PM
(edited)

Conclusion from design group:

  1. To make things simple, we could change space to “ | “ in all non-emoji search keywords. That would at least be better than current behavior.

  2. Then fixup odd cases like north | west => northwest

  3. Split out a ticket for Andrew Glass to improve readability of tts for non-emoji.

Annemarie Apple 🍎 
April 4, 2024 at 5:04 AM

Bulk moving all issues to the next version which aren't in component type: brs, charts, docs, docs-spec

Details

Components

Priority

Fix versions

Assignee

Reporter

Created June 25, 2021 at 6:05 PM
Updated February 21, 2025 at 2:21 AM