Symbols as new data in CLDR (set #2)

Description

Symbol pickers include the following broad categories:
o Punctuation
o Currency symbols
o Diacritic forms for Latin script
o Arrows and block symbols
o Math symbols
o Supplemental symbols
o Script-specific symbols and letters


More background: The goal would be to add names for use by symbols pickers and TTS. For the latter, the presumption is that the client would already use normalization on the text, and many symbols would have pronunciation by context (eg, 1:3 ==> “one to three ratio”), or affect the prosody of the sentence instead of being spoken. So what we are looking at is the “verbatim” name of a character when the above do not apply.

Note that we already support names for emoji characters and sequences, such as: https://www.unicode.org/cldr/charts/35/annotations/romance.html

For emoji we also support search keywords. That wouldn’t be needed, at least at the start, for other characters.

xpath

None

locale

None

Activity

Show:
Kristi Lee
May 13, 2020, 7:12 PM
Edited

Changing to Accepted and assignee to Mark per discussion on 5/13.

  • labeled with 38 in Column H

    • Add to modern coverage

    • Include the names that can be derived into the process that will generate the derived names

  • labeled with 38 options in Column H

    • Add them to Comprehensive coverage.

    • Added ticket 13738 to track moving from comprehensive back to modern per agreemnt.

Mark Davis
May 21, 2020, 4:16 PM

Progress is underway. Some problems arose that we might be able to fix after shakedown.

  • The “ “ (space) causes problems in the tooling, and had to be commented out.

  • The new categories couldn’t be used, since the set of categories are determined by the emoji file. So all of these became just “punctuation”, making that page quite long.

  • // {"dashes", "‐ ― _ - – — "},

  • // {"dots", "• · . … 。 ‧ ・"},

  • // {"quotation", "‘ ’ ‚ ' “ ” „ » « "},

  • // {"brackets", ") [ ] { } 〔 〕 〈 〉 《 》 「 」 『 』 〖 〗 【 】"},

  • I just noticed that ( is missing from the source set.

Mo Legend69
July 29, 2020, 7:29 PM

ຄ່າເອັນເຕີພາຍ enterprise 2017-2018-2019 3ປີ 90.000.000 ໂດລາສະຫາລັດ

Mo Legend69
July 29, 2020, 7:31 PM

ຄ່າໂທລະຄົມ ຫຼາຍປີ

Mo Legend69
July 29, 2020, 7:32 PM

ນັກແຂງຂັນພາກັນດັງແລ້ວ Iran ມາແຮງ MOCK

Priority

major

Assignee

Mark Davis

Reporter

Kristi Lee

Reviewer

Kristi Lee

Fix versions

phase

dsub

Components

Labels

Configure