We're updating the issue view to help you get more done. 

Confusable data, remove non NFKD items

Description

The Unicode confusables.txt data file includes many mappings for source characters that are not NFKD. But the algorithm for obtaining a confusables skeleton for a name is to first normalize to NFKD, which means that we will never be looking up the mapping for a non-NFKD character.

So, we could save some space in the mappings tables by eliminating all of those with non-NKFD sources.

Status

Assignee

Andy Heninger

Reporter

Andy Heninger

Labels

Time Needed

Hours

Components

Priority

medium