On Wed, Mar 17, 2021 at 9:22 AM Tom Honermann via SG16 <sg16@lists.isocpp.org> wrote:
On 3/17/21 5:23 AM, Corentin Jabot wrote:
That matches my expectations, thanks for confirming.
- P1628: Unicode character properties
As the author I do not expect to do further work on this in the 23 cycle
Not that we have a paper, or that I'm asking for anyone to do work, but without access to the UCD, all the other algorithms are off the table. In theory it's an implementation detail, in practice it would be an ABI break to change the APIs for property lookup, and having two or more mechanisms in a std library implementation seems like a bad idea.
Expectation setting is important. It's not clear to everyone how the Unicode layers work, even if the explanations are straight-forward.
We don't have proposals for any algorithms yet. I would feel some concern about trying to standardize access to the UCD without algorithms to go with it; at least reference implementations showing what is possible.
Perhaps something that might be helpful is an audit of the
algorithms and which UCD properties they each depend on. That
might help to prioritize which UCD properties to expose first,
which ones we can guarantee stability for, etc...
Tom.