C++ Logo


Advanced search

Re: Considerations for Unicode algorithms

From: Peter Brett <pbrett_at_[hidden]>
Date: Tue, 31 Jan 2023 08:25:08 +0000
Hi Zach,

Doesn't this add a lot of complexity? I really would like to understand the rationale/motivation for this level of generality, with some examples of code that is significantly improved by them.

For example, I am struggling to envisage a situation in which I'd find it useful to do sentence break iteration on a std::vector<byte> without any intermediate decoding step.

Best regards,


-----Original Message-----
From: SG16 <sg16-bounces_at_[hidden]> On Behalf Of Zach Laine via SG16
Sent: 30 January 2023 21:33
To: Corentin <corentin.jabot_at_gmail.com>
Cc: Zach Laine <whatwasthataddress_at_gmail.com>; SG16 <sg16_at_lists.isocpp.org>
Subject: Re: [SG16] Considerations for Unicode algorithms

Also, I think the algorithms should be generic. They should not work
only with char32_t, or only with int, etc. Users should be free to
use char8_t, char, unsigned char, etc., for UTF-8. std::byte if
you're nasty.

Received on 2023-01-31 08:25:16