> Grapheme breaking is simple, and requires no locale info.
The encoding that goes with char* is part of the locale. Where the breaks go in a shift-jis stream is probably different than where they go in a UTF-8 stream or a latin-1 stream.
Billy3
On 9/7/19 8:27 PM, Tony V E wrote:
I agree that would be ideal, but...I think we would want it to be measured in glyphs.
I don't know how to achieve that. Field width doesn't really work for alignment unless one assumes a monospace font. We could measure in terms of extended grapheme clusters, but EGCS width has changed over time (e.g., family emoji). That makes alignment dependent on both display properties and Unicode version. And, of course, this would drag in locale dependence as well.Are you suggesting code points because glyphs are too hard?