Because I needed to circulate what I'm doing for Belfast, I've thrown together an abstract for the paper we've peripherally discussed about modernizing and tightening the specification around encodings of characters generally, and the source and execution character sets.

"
This document proposes new standard terms for the various encodings for character and string literals, and the encodings associated with some character types. It also proposes that the wording used for [lex.charset], [lex.ccon], [lex.string], and [basic.fundamental] 8 be modified to reflect the new terminology. This paper does not intend to propose any changes that would require changes in any currently conforming implementation.
"

I'm hoping to have some preliminary work by the next telecon. The direction I'm thinking is that both Source and Execution Character Set are descriptions of the abstract characters, selected from 10646, that must be present to support C++. Encodings, both source and execution, are implementation defined. I would like to introduce terminology to describe the encoding used when translating narrow and wide character and string literals. I'd also like to make it explicit somewhere up front that there are associated encodings for some, but not all, character types. This is mentioned now in filesystem, but should be moved to a section with wider scope. The encoding for `char` and `wchar_t` is controlled by `locale`. The encoding for the unicode character types is fixed. The encoding used for literals was chosen at compile time, and is implementation defined. If locale and that endcoding conflict, behavior is unspecified. Combining TU with different encodings is in general unspecified, unless it results in an ODR violation.