Hey folks.
How was the table of width in [format] derived?
We have 2 issues here: Lack of explanation in the standard makes it hard to evolve that table,
and it does require maintenance as the Unicode standard evolves.
We do want:
- To treat 0-width codepoint as 1
- To treat emojis as 2
- To treat full width east asian as 2.
I think a better specification would be given that we have a floating reference to UAX44,
to say that codepoints that have the Unicode property "Emoji_Presentation" or
East_Asian_Width="W" have a width of 2.
This ensures implementation remains coherent as Unicode evolves.
Thanks,
Corentin