sg16: Re: [SG16] P2194R0 The character set of C++ source code is Unicode

From: Alisdair Meredith <alisdairm_at_[hidden]>
Date: Mon, 24 Aug 2020 12:29:08 -0400

Minor suggestion on the wording,

You strike the mapping of non-basic source code characters to
universal-character-name, including the cross-reference to such
mappings reverting in raw string literals (5.4). I suggest making
a matching edit to strike the reference in (5.4)p3 as well, so that
the only thing reverted is line splicing in phase 2.

That said, with these changes, I am curious what the difference
is between a u8 string literal and a plain ‘char’ string literal, as
the contents of that literal are now going to be unicode source
Text (rather than requesting a mapping from source to unicode
of literal’s contents)?

AlisdairM

> On Aug 24, 2020, at 08:31, Peter Brett via SG16 <sg16_at_[hidden]> wrote:
>
> Hi all,
>
> In this week's meeting, we are going to discuss the remaining
> proposals from P2178R1 "Misc lexing and string handling improvements".
> In particular, we will discuss proposal 9:
>
> Proposal 9: Reaffirming Unicode as the character set of the
> internal representation
>
> In anticipation of a lively discussion, Corentin and I have written a
> short new paper which will be appearing in the September mailing.
>
> P2194R0 The character set of C++ source code is Unicode
> https://isocpp.org/files/papers/P2194R0.pdf
>
> We hope that the study group finds this contribution helpful and
> informative.
>
> Best regards,
>
> Peter
>
> --
> SG16 mailing list
> SG16_at_[hidden]
> https://lists.isocpp.org/mailman/listinfo.cgi/sg16

Received on 2020-08-24 11:32:36