Date: Fri, 3 Jul 2020 09:11:55 +0200
On 02/07/2020 18.43, Tom Honermann via SG16 wrote:
> On 7/2/20 3:15 AM, Corentin via Core wrote:
>> literal encoding is a less ambiguous term either way.
>> We need a terminology such that we can distinguish the encoding of literals from that of runtime strings, literal (associated) encoding achieves that.
>
> Ah, I think we may be crossing hairs here. I agree that we should have an abstract name that indicates the encoding used for literals. We lack a term for that today (which is why the paper uses the phrase "encoding of the execution character set"). But that is different from what is intended by "associated character encoding"; this is intended to name an encoding (possibly indirectly, hence "encoding of the ...") that might be registered with IANA <https://www.iana.org/assignments/character-sets/character-sets.xhtml> (where the term "character set" is used to mean "character encoding", but the former is used for legacy reasons).
You lost me there.
What does IANA's list of character sets/encodings have to do with
how a compiler chooses to encode C++ literals?
Maybe the compiler has invented an encoding of its own, just
for the fun of it.
If you intend some stronger relationship to IANA here, this
needs to be a lot of explicit.
Jens
> On 7/2/20 3:15 AM, Corentin via Core wrote:
>> literal encoding is a less ambiguous term either way.
>> We need a terminology such that we can distinguish the encoding of literals from that of runtime strings, literal (associated) encoding achieves that.
>
> Ah, I think we may be crossing hairs here. I agree that we should have an abstract name that indicates the encoding used for literals. We lack a term for that today (which is why the paper uses the phrase "encoding of the execution character set"). But that is different from what is intended by "associated character encoding"; this is intended to name an encoding (possibly indirectly, hence "encoding of the ...") that might be registered with IANA <https://www.iana.org/assignments/character-sets/character-sets.xhtml> (where the term "character set" is used to mean "character encoding", but the former is used for legacy reasons).
You lost me there.
What does IANA's list of character sets/encodings have to do with
how a compiler chooses to encode C++ literals?
Maybe the compiler has invented an encoding of its own, just
for the fun of it.
If you intend some stronger relationship to IANA here, this
needs to be a lot of explicit.
Jens
Received on 2020-07-03 02:15:46