We're updating the issue view to help you get more done. 

sjis (and other) tables have explicit subchar mappings

Description

ibm-943 (shiftjis) and other encodings have ranges where a unicode value is
mapped explicitly to a single byte substitute character. In sjis, this is true
of the range U+0080 through U+00FF, including U+00A0 (nonbreaking space) and
U+00C9 (e with acute). Since the mapping is explicit, conversion to the
sub char is considered a successful conversion, and so no callback routine
or error value can be used. Furthermore, all of these .ucm's have

  1. (SUB) in the comment line which seems to indicate that they are known to be
    a substitute.

Status

Assignee

TracBot

Reporter

TracBot

Labels

Reviewer

None

Time Needed

None

Start date

None

Components

Priority

assess