<div dir="ltr">I was unclear, unfortunately. The standard term &quot;basic source character set&quot; is the only source character set. Other things representing characters that will be translated are escape sequences and universal character names, which are sequences of basic source characters. There are a few places in the standard that mention &quot;source character set&quot; without the basic qualification, but it&#39;s pretty clear that the same set is meant. <br><br>The execution character set, on the other hand, has basic, corresponding to the basic source character set, and the extended superset, which is often unqualified. <br><br><div>Also, &quot;source character set&quot; is not the same as what&#39;s actually in a source file, leading to further confusion. That mapping is phase 1. If we were to switch to a code point basis, this would be a decode from source file encoding to code points. Plus some &#39;implementation defined&#39; handwaving to preserve wetware implementations. <br><br>And 26 because I think it will take more than a couple years to understand all the implications of switching from &quot;source characters&quot; and &quot;universal character names&quot; to &quot;code points&quot;. At least not without just starting over, which is equally scary. <br><br></div></div><br><div class="gmail_quote"><div dir="ltr" class="gmail_attr">On Wed, May 27, 2020 at 1:36 AM Tom Honermann &lt;<a href="mailto:tom@honermann.net" target="_blank">tom@honermann.net</a>&gt; wrote:<br></div><blockquote class="gmail_quote" style="margin:0px 0px 0px 0.8ex;border-left:1px solid rgb(204,204,204);padding-left:1ex">
  
    
  
  <div>
    <div>On 5/27/20 12:08 AM, Steve Downey via
      SG16 wrote:<br>
    </div>
    <blockquote type="cite">
      
      <div dir="ltr">With respect to P1859:<br>
        -Basic source character set<br>
        -: The abstract characters that must be representable in the
        _character set_ used for source code<br>
        +Source character set<br>
        +: The abstract characters that must be representable in the
        internal _character set_ used after phase 1 of translation. All
        characters not in the source character set are converted to
        universal-character-names, which are made up of characters from
        the basic character set. The abstract parser only sees
        characters in the source character set.<br>
        <br>
        There is no &quot;Basic&quot; source character set. There is the character
        set the lexer and parser uses that is available after the
        implementation defined conversion from whatever was presented as
        source. <br>
        I don&#39;t think anyone understands that, outside CWG. <br>
      </div>
    </blockquote>
    <p>I&#39;m not sure I&#39;m following.  The standard does define <i>basic
        source character set</i> in <a href="http://eel.is/c++draft/lex.charset#1" target="_blank">[lex.charset]p1</a>. 
      Are you proposing that it be renamed to just <i>source character
        set</i>?</p>
    <p>I think we will need a term for the encoding of source files.  We
      could use <i>source file encoding</i> for that, but I&#39;m a little
      concerned about these two terms being confused.<br>
    </p>
    <blockquote type="cite">
      <div dir="ltr">Having more precision around the values
        emitted into narrow, wide, and uN literals from the execution
        character set, and what happens when that fails I still believe
        would be useful.<br>
      </div>
    </blockquote>
    I think P2029 may address that.<br>
    <blockquote type="cite">
      <div dir="ltr"><br>
        Perhaps for 26 we could rewrite entirely in terms of processing
        code points and occasionally &quot;orginal spelling&quot;. It would be
        nice if the logical model was closer to what the physical model
        is. <br>
      </div>
    </blockquote>
    <p>Why wait for 26? :)</p>
    <p>Tom.<br>
    </p>
    <blockquote type="cite">
      <div dir="ltr">
        <div><br>
        </div>
      </div>
      <br>
      <div class="gmail_quote">
        <div dir="ltr" class="gmail_attr">On Tue, May 26, 2020 at 1:37
          PM Tom Honermann via SG16 &lt;<a href="mailto:sg16@lists.isocpp.org" target="_blank">sg16@lists.isocpp.org</a>&gt;
          wrote:<br>
        </div>
        <blockquote class="gmail_quote" style="margin:0px 0px 0px 0.8ex;border-left:1px solid rgb(204,204,204);padding-left:1ex">
          <div>
            <p>This is your friendly reminder that an SG16 telecon will
              be held tomorrow, Wednesday May 27th, at 19:30 UTC (<a href="https://www.timeanddate.com/worldclock/converter.html?iso=20200527T193000&amp;p1=1440" target="_blank">timezone
                conversion</a>).  To attend, visit <a href="https://bluejeans.com/140274541" target="_blank">https://bluejeans.com/140274541</a>
              at the start of the meeting.</p>
            <p>Steve will circulate a draft revision of P1949 on the
              SG16 mailing list today.<br>
            </p>
            <p>The agenda for the meeting is:<br>
            </p>
            <ul>
              <li>D1949R4: C++ Identifier Syntax using Unicode Standard
                Annex 31</li>
              <ul>
                <li>Review updates since the April 22nd review.</li>
              </ul>
              <li>Discuss terminology updates to strive for in C++23</li>
              <ul>
                <li><a href="https://wg21.link/p1859" target="_blank">P1859R0: Standard terminology
                    character sets and encodings</a></li>
                <li>Establish priorities for terms to address.</li>
                <li>Establish a methodology for drafting wording
                  updates.</li>
              </ul>
            </ul>
            <p> Anticipated decisions to be made at this meeting
              include: <br>
            </p>
            <ul>
              <li>Whether to forward the new draft revision of P1949 to
                EWG.</li>
            </ul>
            <p>Prior to tomorrow&#39;s meeting, please:<br>
            </p>
            <ul>
              <li>review Steve&#39;s draft revision.</li>
              <li>review P1859R0, particularly the proposed terminology.</li>
              <li>think of other terminology changes to be considered.</li>
              <li>think of how we can divide up the work for making
                terminology updates.<br>
              </li>
            </ul>
            <p>Tom.<br>
            </p>
          </div>
          -- <br>
          SG16 mailing list<br>
          <a href="mailto:SG16@lists.isocpp.org" target="_blank">SG16@lists.isocpp.org</a><br>
          <a href="https://lists.isocpp.org/mailman/listinfo.cgi/sg16" rel="noreferrer" target="_blank">https://lists.isocpp.org/mailman/listinfo.cgi/sg16</a><br>
        </blockquote>
      </div>
      <br>
      <fieldset></fieldset>
    </blockquote>
    <p><br>
    </p>
  </div>

</blockquote></div>

