<div dir="ltr"><div dir="ltr"><br></div><br><div class="gmail_quote"><div dir="ltr" class="gmail_attr">On Mon, 1 Jun 2020 at 19:14, Tom Honermann &lt;<a href="mailto:tom@honermann.net">tom@honermann.net</a>&gt; wrote:<br></div><blockquote class="gmail_quote" style="margin:0px 0px 0px 0.8ex;border-left:1px solid rgb(204,204,204);padding-left:1ex">
  
    
  
  <div>
    <div>On 6/1/20 1:01 PM, Tom Honermann via
      SG16 wrote:<br>
    </div>
    <blockquote type="cite">
      
      <div>On 6/1/20 8:53 AM, Corentin via SG16
        wrote:<br>
      </div>
      <blockquote type="cite">
        
        <div dir="ltr">
          <div><br>
          </div>
          The standard doesn&#39;t specify what the new-line character is.
          <div>According to Unicode, the following codepoint sequences
            should be considered lines terminators</div>
        </div>
      </blockquote>
      Could you please include a reference?<br></blockquote></div></blockquote><div><br></div><div><a href="https://en.wikipedia.org/wiki/Newline#Unicode">https://en.wikipedia.org/wiki/Newline#Unicode</a> which is derived from <a href="https://www.unicode.org/reports/tr14/tr14-32.html">https://www.unicode.org/reports/tr14/tr14-32.html</a></div><div><br></div><div> </div><blockquote class="gmail_quote" style="margin:0px 0px 0px 0.8ex;border-left:1px solid rgb(204,204,204);padding-left:1ex"><div><blockquote type="cite">
      <blockquote type="cite">
        <div dir="ltr">
          <div><br>
          </div>
          <div><font face="monospace"> LF:    Line Feed, U+000A<br>
               VT:    Vertical Tab, U+000B<br>
               FF:    Form Feed, U+000C<br>
               CR:    Carriage Return, U+000D<br>
               CR+LF: CR (U+000D) followed by LF (U+000A)<br>
               NEL:   Next Line, U+0085<br>
               LS:    Line Separator, U+2028<br>
               PS:    Paragraph Separator, U+2029</font><br>
          </div>
          <div><br>
          </div>
          <div>Similarly, the standard defines &quot;white spaces&quot; loosely as
            &quot;blanks, horizontal and vertical tabs&quot;, however there are
            more white space characters in unicode <a href="https://en.wikipedia.org/wiki/Whitespace_character" target="_blank">https://en.wikipedia.org/wiki/Whitespace_character</a></div>
          <div><br>
          </div>
          <div>What I would like to do:</div>
          <div><br>
          </div>
          <div>* Define new-line and white-spaces as grammar term, with
            an explicit list of codepoint sequences.   <br>
          </div>
        </div>
      </blockquote>
    </blockquote>
    <p>I know the following doesn&#39;t fit in with your wording direction,
      but for conceptual clarity, in today&#39;s wording, you would be
      suggesting something like the following, correct?<br>
    </p>
    <p>white-space:<br>
      - space, horizontal tab, vertical tab, form feed, new-line<br>
      - universal-character-name specifying U+000D (Carriage Return),
      U+0085 (Next Line), U+2028 (Line Separator), U+2029 (Paragraph
      Separator)</p></div></blockquote><div><br></div><div>Yep, but also  universal-character-name with the pattern_white_space property (which would be easier to list explicitly)</div><div><br></div><div> </div><blockquote class="gmail_quote" style="margin:0px 0px 0px 0.8ex;border-left:1px solid rgb(204,204,204);padding-left:1ex"><div>
    <p>Tom.<br>
    </p>
    <blockquote type="cite">
      <blockquote type="cite">
        <div dir="ltr">
          <div>* In phase 2, replace all characters which represent a
            line termination with Line Feed (which is reverted later for
            raw string literals). this would notably fix <a href="https://wg21.link/cwg1655" target="_blank">https://wg21.link/cwg1655</a><br>
          </div>
          <div>* It would also help to mandate that trailing whitespaces
            are removed in phase 2</div>
          <div><br>
          </div>
          <div>Does that make sense to anyone ? <br>
          </div>
        </div>
      </blockquote>
      <p>Without thinking too hard about it, this seems like a
        reasonable direction.</p>
      <p>I&#39;m not fond of adding an additional case of reversion for raw
        string literals though.<br>
      </p>
      <p>Tom.<br>
      </p>
      <br>
      <fieldset></fieldset>
    </blockquote>
    <p><br>
    </p>
  </div>

</blockquote></div></div>

