On 9/8/19 12:02 PM, Steve Downey wrote:
Character repertoire sounds good, and I will eventually learn to spell it. Character set is definitely terminology from the pre-unicode times, and unfortunately tends to merge the repertoire and encoding, https://www.iana.org/assignments/character-sets/character-sets.xhtml
I think I was a little over zealous earlier in stating that Unicode uses "character repertoire" as I described. I looked again and don't find that term formally defined in the standard. However, "repertoire" is used throughout the standard in ways that I believe are consistent with my description. I wasn't able to find an alternative formal term.
The way I've been thinking about it is that a "character repertoire" describes a set of abstract characters (a formal Unicode term) and a "character set" describes a set of encoded characters (a formal Unicode term) that associate each abstract character member of a "character repertoire" with a code point (a formal Unicode term) within a codespace (A formal Unicode term). See sections 2.4 and 3.4 of Unicode 12 and uses of the word "repertoire" within those chapters. The Unicode standard does use the term "character set", but I didn't find a formal definition.