On 3/7/19 11:44 AM, Steve Downey wrote:

Charsets are registered by IANA.
https://tools.ietf.org/html/rfc2978
https://www.iana.org/assignments/character-sets/character-sets.xhtml

Another good resource is ICU's converter explorer:

http://demo.icu-project.org/icu-bin/convexp

Tom.

Of course, it doesn't particularly help in this case because filesystems for the most part (*) just care about octets and don't interpret them. The filesystem isn't going to tell you an encoding scheme for a name.

(*) Some filesystems do for example case folding or unicode normalizations. However for opening a file pretty much the only guarantee is that the octets you got from a directory listing of filesystem can be used to open the file.

On Thu, Mar 7, 2019 at 11:32 AM Ben Boeckel via Modules <modules@lists.isocpp.org> wrote:

On Thu, Mar 07, 2019 at 11:16:15 -0500, Tom Honermann wrote:
> Strawman update to the JSON schema to support this:
>
> {
> ...
> "definitions": {
> +    "filename-encoding": {
> +      "$id": "#filename-encoding",
> +      "type": [
> +        "string",
> +      ],
> +      "description": "The name of the character encoding used to
> interpret filenames",

Which ISO/IETF standard are we referencing for encoding names?

- utf-8
- UTF-8
- utf8
- UTF8

Do we need codepage information as well? Is that "standard" anywhere?

What happens if an encoding that cannot be losslessly roundtripped is
specified (e.g., Shift-JIS)?

--Ben
_______________________________________________
Modules mailing list
Modules@lists.isocpp.org
Subscription: http://lists.isocpp.org/mailman/listinfo.cgi/modules
Link to this post: http://lists.isocpp.org/modules/2019/03/0208.php
_______________________________________________
Modules mailing list
Modules@lists.isocpp.org
Subscription: http://lists.isocpp.org/mailman/listinfo.cgi/modules
Link to this post: http://lists.isocpp.org/modules/2019/03/0209.php