<div dir="auto"><div>Tom,<div dir="auto"><br></div><div dir="auto">That analysis has been performed, many times over. Literally tens if not hundreds of hours at this point.<div dir="auto">I do not know how to convey that in other ways than what I already tried.</div><div dir="auto"><br></div><div dir="auto">All the terminal tested match the behavior of what is proposed in the paper modulo tofu and other rendering bugs</div><div dir="auto"><br></div><div dir="auto">I looked at code of all these terminal too, it&#39;s also in the paper. The codepoints in the screenshot are the codepoints that sg-16 specifically asked about at the last meeting.</div><div dir="auto"><br></div><div dir="auto">I am glad to see that your observations concur.</div></div><br><br><div class="gmail_quote"><div dir="ltr" class="gmail_attr">On Wed, Dec 14, 2022, 19:30 Tom Honermann &lt;<a href="mailto:tom@honermann.net">tom@honermann.net</a>&gt; wrote:<br></div><blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex">
  
    
  
  <div>
    <div>On 11/30/22 5:26 PM, Corentin via SG16
      wrote:<br>
    </div>
    <blockquote type="cite">
      
      <div dir="ltr">
        <div>Hello folks.</div>
        <div>Here is a list of all the codepoint that change</div>
        <div><a href="https://gist.githubusercontent.com/cor3ntin/b7f4f52893b0b54890e970f7bbec6118/raw/720a910585d78c9ceb4e0458dcef87af2a436121/width.md" target="_blank" rel="noreferrer">https://gist.githubusercontent.com/cor3ntin/b7f4f52893b0b54890e970f7bbec6118/raw/720a910585d78c9ceb4e0458dcef87af2a436121/width.md</a><br>
        </div>
      </div>
    </blockquote>
    <p>Just a note: the gist has 8570 characters and that count matches
      the ranges specified in the <a href="https://isocpp.org/files/papers/D2675R1.pdf" target="_blank" rel="noreferrer">D2675R1</a>
      annex.<br>
    </p>
    <blockquote type="cite">
      <div dir="ltr">
        <div><br>
        </div>
        <div>Simply cat that file in the terminal.</div>
        <div>The screenshot below is a render on ITerm2</div>
        <div>You will notice the tofu for reserved codepoints is
          considered narrow</div>
        <div>but doesn&#39;t quite fit so it overlaps with the next cell,
          same for the number in square.</div>
        <div><br>
        </div>
        <div><br>
        </div>
        <img src="cid:part1.h0wnrOZm.U6amTuFE@honermann.net" alt="Screenshot 2022-11-30 at 23.19.47.png" width="452" height="283"><br>
      </div>
    </blockquote>
    <p>As discussed previously, a single screen shot that only shows a
      small subset of the relevant characters is not sufficient to
      demonstrate that the conclusions of the paper are consistent with
      existing behavior. I continue to have reservations about the
      screen shots in the paper for this reason; I don&#39;t see how they
      provide useful information at all. I think they are actively
      misleading since they do not appear to show behavior that is
      consistent with the intent of the paper.</p>
    <p>I spent some time analyzing the behavior of all 8570 characters
      in the terminal I use (Konsole 12.12.3 with the Hack 10pt font).
      Here is what I found:</p>
    <ul>
      <li>For the characters that the paper changes from width 1 to
        width 2 (based on the listings in the annex), the following are
        displayed with a width other than 2:</li>
      <ul>
        <li>Width 0:</li>
        <ul>
          <li>U+016FE4 (KHITAN SMALL SCRIPT FILLER) <br>
          </li>
        </ul>
        <li>Width 1: (These were all displayed as tofu; some are
          probably unassigned characters, others are probably unknown by
          the font)<br>
        </li>
        <ul>
          <li>U+01AFF0 .. U+01AFFE<br>
          </li>
          <li>U+01B11F ..  </li>
          <li>U+01F6DC .. U+01F6DF<br>
          </li>
          <li>U+01F7F0<br>
          </li>
          <li>U+01FA75 .. U+01FA77<br>
          </li>
          <li>U+01FA7B .. U+01FA7C<br>
          </li>
          <li>U+01FA87 .. U+01FA88<br>
          </li>
          <li>U+01FAA9 .. U+01FAAF<br>
          </li>
          <li>U+01FAB7 .. U+01FABF<br>
          </li>
          <li>U+01FAC3 .. U+01FACF<br>
          </li>
          <li>U+01FAD7 .. U+01FAF8<br>
          </li>
        </ul>
      </ul>
      <li>For the characters that the paper changes from width 1 to
        width 2 (based on the listings in the annex), the following are
        displayed with a width other than 1:</li>
      <ul>
        <li>Width 2:</li>
        <ul>
          <li>U+003248 .. U+00324F (CIRCLED NUMBER TEN ON BLACK SQUARE
            .. CIRCLED NUMBER EIGHTY ON BLACK SQUARE)<br>
          </li>
        </ul>
      </ul>
    </ul>
    <p>These results strongly match the intent of the paper and that the
      open question regarding the last group of characters should be
      answered such that they do not change width.</p>
    <p>This is the kind of analysis I would like to see performed for
      other terminals so that we can qualitatively compare behavior
      between them. I attached C++ source code I used to display the
      characters.<br>
    </p>
    <p>Tom.<br>
    </p>
    <blockquote type="cite">
      <div dir="ltr">
        <div><br>
        </div>
        <div><br>
        </div>
        <div>FYI Iterm2 also uses Unicode UAX 44</div>
        <div><a href="https://github.com/gnachman/iTerm2/blob/master/sources/NSCharacterSet+iTerm.m#L464" target="_blank" rel="noreferrer">https://github.com/gnachman/iTerm2/blob/master/sources/NSCharacterSet+iTerm.m#L464</a><br>
        </div>
      </div>
      <br>
      <fieldset></fieldset>
    </blockquote>
  </div>
</blockquote></div></div></div>

