C++ Logo

sg16

Advanced search

Re: [SG16] format.string.std references to "Unicode encoding" unclear

From: Victor Zverovich <victor.zverovich_at_[hidden]>
Date: Wed, 26 Feb 2020 07:10:53 -0800
+1 to "Any encoding scheme capable of encoding any UCS scalar value". I
don't think we need to explain what scalar values are.

On Wed, Feb 26, 2020 at 1:57 AM Corentin Jabot via SG16 <
sg16_at_[hidden]> wrote:

>
>
> On Wed, 26 Feb 2020 at 10:34, Lyberta via SG16 <sg16_at_[hidden]>
> wrote:
>
>> Corentin Jabot via SG16:
>> >
>> That would exclude UTF-8, UTF-16 and UTF-32 because they can't encode
>> surrogates.
>>
>
> Right!
>
> Take 2:
> Any encoding scheme capable of encoding any UCS scalar value - where a UCS
> scalar value is any UCS codepoint except surrogate code points and a
> codepoint is tautologically any value in the UCS codespace (U+0 - U+10FFFF)
> (all three terms are defined in ISO/IEC 10646)
>
>
>
>>
>> --
>> SG16 mailing list
>> SG16_at_[hidden]
>> https://lists.isocpp.org/mailman/listinfo.cgi/sg16
>>
> --
> SG16 mailing list
> SG16_at_[hidden]
> https://lists.isocpp.org/mailman/listinfo.cgi/sg16
>

Received on 2020-02-26 09:13:46