At survey 34, our attention has been drawn on this guideline:
>> Don’t add emoji names (these will be added automatically)
>> Don’t add repeats of words starting with the same starting word in the emoji name.
The Unicode documentation uses "emoji short names" and "emoji names" interchangeably:
Therefore, vetters mustn’t echo the emoji (short) name in the keywords field. English has started applying this rule, as in:
That example stays also for extending the guideline about not having "face" needlessly, referenced in:
But despite changing `heavy heart exclamation` to `heart exclamation` in the following example, English still does not remove the emoji name from the keywords:
My question is //why editing the English source is not completed,// given a skillful programmer may do it by running a script, and //whether other locales should attempt to stand out// over English by doing this job (by hand if no programmer in vetting community), //or not.//
If/when CLDR has such a script, many communities might be interested in running it on their data.
Alternatively, CLDR could implement a routine check in postprocessing to remove repeats of emoji names from the keywords, making for cleaner data. That may also make room for additional “mindful” keywords (a number of emoji may have a potential).
(The word “mindful” stands in the guideline:
>> be mindful of the number of keywords and try to limit the number of keywords below 5. (You will start to see a warning on keywords beyond 6.)
>> Here are some tips on how to be mindful of the number of keywords:
Removing useless keywords is therefore an interesting issue both for other locales and for English.