C++ Logo


Advanced search

Re: [SG16-Unicode] Comments on D1629R1 Standard Text Encoding

From: Thiago Macieira <thiago_at_[hidden]>
Date: Sun, 18 Aug 2019 09:07:26 -0700
On Saturday, 17 August 2019 12:25:57 PDT Henri Sivonen wrote:
> To the extent other programming languages that have encoding
> conversion in their standard library, such as Java, focus on
> contiguous buffers rather than iteration, it's worthwhile to study if
> application developers really feel that something important is
> missing.

We were just discussing URLs in the cpplang Slack and that reminded me:
there's exactly one in 10 years case that I've needed to decode a non-
contiguous byte range and that's when parsing a URL.

So yes, these cases exist. On the other hand, they are rare enough that you
don't need the the regular API to support them. The URL parser has other needs
too and would be quite satisified to use a very flexible, stateful decoder

Moreover, decoding contiguous byte ranges have very interesting properties
since they can be parsed by SIMD.

Thiago Macieira - thiago (AT) macieira.info - thiago (AT) kde.org
   Software Architect - Intel System Software Products

Received on 2019-08-18 18:07:29