<div dir="ltr"><div dir="ltr"><br></div><br><div class="gmail_quote"><div dir="ltr" class="gmail_attr">On Tue, Jun 21, 2022 at 9:49 PM Tom Honermann &lt;<a href="mailto:tom@honermann.net">tom@honermann.net</a>&gt; wrote:<br></div><blockquote class="gmail_quote" style="margin:0px 0px 0px 0.8ex;border-left:1px solid rgb(204,204,204);padding-left:1ex">
  
    
  
  <div>
    <div>On 6/18/22 2:32 PM, Corentin Jabot
      wrote:<br>
    </div>
    <blockquote type="cite">
      
      <div dir="auto">
        <div><br>
          <br>
          <div class="gmail_quote">
            <div dir="ltr" class="gmail_attr">On Sat, Jun 18, 2022,
              19:39 Tom Honermann via SG16 &lt;<a href="mailto:sg16@lists.isocpp.org" rel="noreferrer" target="_blank">sg16@lists.isocpp.org</a>&gt;
              wrote:<br>
            </div>
            <blockquote class="gmail_quote" style="margin:0px 0px 0px 0.8ex;border-left:1px solid rgb(204,204,204);padding-left:1ex">
              <div>
                <p>A draft of proposed SG16 questions for the 2023 C++
                  Developer Survey is now available <a href="https://docs.google.com/document/d/1lRU7uErn2Vc7LOGG2H3PrzCvmf69u8S_v-43by_Vb9c/edit?usp=sharing" rel="noreferrer noreferrer" target="_blank">here</a>. Anyone with the
                  link should be able to view and comment on the draft.
                  Please feel free to add suggestions, corrections, and
                  other comments.</p>
                <p>The list of questions (19 currently) is likely too
                  long and will need to be trimmed. For reference, the <a href="https://isocpp.org/blog/2022/06/results-summary-2022-annual-cpp-developer-survey-lite" rel="noreferrer noreferrer" target="_blank">2022 C++ Developer Survey</a>
                  (described as &quot;Lite&quot;) had 19 questions.</p>
              </div>
            </blockquote>
          </div>
        </div>
        <div dir="auto">Thanks Tom,</div>
        <div dir="auto"><br>
        </div>
        <div dir="auto">Yes, the list is pretty long, and remember the
          survey is biased (a few thousands people among those who
          follow standardisation closely). The longer the survey, the
          less participations. I can easily imagine each study group
          could come up with a long list of questions too, many of which
          not relevant to all participants.</div>
      </div>
    </blockquote>
    I agree and I&#39;ve been a little worried about other SGs jumping on
    the bad wagon here :)<br>
    <blockquote type="cite">
      <div dir="auto">
        <div dir="auto"><br>
        </div>
        <div dir="auto">I guess the essence to what we are going to get
          to is whether people use or would like to use C++ for text
          processing. Asking that directly is probably sufficient. Given
          a fairly low participation rate, letting people write a
          detailed answer to something like &quot;what would you like to see
          improved in regard to text processing and localization?&quot; would
          give us good reply that we could summarize fairly easily.</div>
      </div>
    </blockquote>
    The last question in the proposed list is intended for that purpose.<br>
    <blockquote type="cite">
      <div dir="auto">
        <div dir="auto"><br>
        </div>
        <div dir="auto">I have strong objections to the formulation of
          question 4, as it isn&#39;t possible to use emojis in a conforming
          implementations.</div>
      </div>
    </blockquote>
    Historically it has been possible to use some emoji, but yes, we
    fixed that.<br>
    <blockquote type="cite">
      <div dir="auto">
        <div dir="auto"><br>
        </div>
        <div dir="auto">Question 3 is also weird - why these specific
          languages? It excludes among other languages using Cyrillic,
          Arabic, Brahmic scripts , so probably around 2 billions people
          in total and a fair number of C++ developers - although the
          survey results are likely to be biased towards Europeans and
          north Americans to begin with.</div>
      </div>
    </blockquote>
    <p>That is an artifact of me being too quick to get draft questions
      prepared and being too uninformed about languages used around the
      world. The <a href="https://unicode.org/standard/supported.html" target="_blank">Unicode
        supported scripts list</a> enumerates 159 scripts. I don&#39;t have
      a good sense of which ones should be on this list.</p>
    <p>Peter Brett requested this question. Peter, perhaps you have some
      insight into which languages you feel should be explicitly listed?<br></p></div></blockquote><div><br></div><div>In addition to the existing list: Hindi, Bengali, Arabic, Russian. It&#39;s far from exhaustive but it covers a large chunk of the global population, without getting technical about which script is derived from which</div><div> </div><blockquote class="gmail_quote" style="margin:0px 0px 0px 0.8ex;border-left:1px solid rgb(204,204,204);padding-left:1ex"><div><p>
    </p>
    <blockquote type="cite">
      <div dir="auto">
        <div dir="auto"><br>
        </div>
        <div dir="auto">More importantly, what is the desired outcome of
          questions 4? C++ support arbitrary characters in comments
          already, and hopefully no one is considering restrictions.</div>
        <div dir="auto">In some way question 4 is also redundant with
          question 1.</div>
      </div>
    </blockquote>
    <p>I think the main desire is just to get some data regarding
      whether programmers actually use non-basic-characters in
      identifiers. If many programmers answer yes, that might suggest we
      should do more analysis to see if the identifier restrictions put
      into C++23 via <a href="https://wg21.link/p1949" target="_blank">P1949</a> will require some
      migration assistance. Likewise, if many programmers answer
      I-didn&#39;t-know-that-was-possible, that may suggest a lack of
      awareness worth trying to address in some way. The survey itself
      could serve as a way to increase awareness.<br></p></div></blockquote><div><br></div><div>Fair enough </div><blockquote class="gmail_quote" style="margin:0px 0px 0px 0.8ex;border-left:1px solid rgb(204,204,204);padding-left:1ex"><div><p>
    </p>
    <blockquote type="cite">
      <div dir="auto">
        <div dir="auto"><br>
        </div>
        <div dir="auto">If question 1 is going to list EBCDIC, surely it
          should list shift-jis and gb18030</div>
      </div>
    </blockquote>
    Yes, thank you, I added those.<br>
    <blockquote type="cite">
      <div dir="auto">
        <div dir="auto"><br>
        </div>
        <div dir="auto">What do we want to learn from questions 9 and
          14?</div>
      </div>
    </blockquote>
    <p>Question 9 goes towards motivation for putting normalization form
      into the type system. E.g., should std::text be parameterized by
      normalization form.</p>
    <p>Question 14 was requested by someone else; I don&#39;t recall who. I
      think the intent is to help gauge whether we can stop treating
      these types as character types and instead dedicate them for use
      as small integers ala <font face="monospace">int8_t</font> and <font face="monospace">uint8_t</font>. The answer is likely no due to
      <font face="monospace">unsigned char</font> being used for UTF-8,
      but having data would be helpful.<br>
    </p>
    <blockquote type="cite">
      <div dir="auto">
        <div dir="auto">What is the motivation behind asking about
          collation independently of locale?</div>
      </div>
    </blockquote>
    It is an opportunity to ask specifically about use of <font face="monospace">stdcoll</font> and <font face="monospace">std::collate</font>.
    That motivation may be too weak to justify the question.<br>
    <blockquote type="cite">
      <div dir="auto">
        <div dir="auto"><br>
        </div>
        <div dir="auto">Why not merge 15 and 17?</div>
      </div>
    </blockquote>
    <p>That might be possible. Question 15 probes what purposes people
      use the standard locale facilities for. Question 17 probes what
      facilities people use to actually localize text.<br></p></div></blockquote><div>Yup, I think that&#39;s not a distinction worth making. If people use both std::locale and icu, they can check 2 boxes.</div><div> </div><blockquote class="gmail_quote" style="margin:0px 0px 0px 0.8ex;border-left:1px solid rgb(204,204,204);padding-left:1ex"><div><p>
    </p>
    <p>Tom.<br>
    </p>
    <blockquote type="cite">
      <div dir="auto">
        <div dir="auto"><br>
          <br>
          <br>
          <br>
        </div>
        <div dir="auto"><br>
        </div>
        <div dir="auto"><br>
        </div>
        <div dir="auto">
          <div class="gmail_quote">
            <blockquote class="gmail_quote" style="margin:0px 0px 0px 0.8ex;border-left:1px solid rgb(204,204,204);padding-left:1ex">
              <div>
                <p>The set of questions was culled from:</p>
                <ul>
                  <li>Prior <a href="https://lists.isocpp.org/sg16/2022/06/3214.php" rel="noreferrer noreferrer" target="_blank">discussion on the SG16
                      mailing list</a>.</li>
                  <li>Discussion during the <a href="https://github.com/sg16-unicode/sg16-meetings#june-8th-2022" rel="noreferrer noreferrer" target="_blank">2022-06-08 SG6 telecon</a>.<br>
                  </li>
                </ul>
                <p>Tom.<br>
                </p>
              </div>
              -- <br>
              SG16 mailing list<br>
              <a href="mailto:SG16@lists.isocpp.org" rel="noreferrer
                noreferrer" target="_blank">SG16@lists.isocpp.org</a><br>
              <a href="https://lists.isocpp.org/mailman/listinfo.cgi/sg16" rel="noreferrer noreferrer noreferrer" target="_blank">https://lists.isocpp.org/mailman/listinfo.cgi/sg16</a><br>
            </blockquote>
          </div>
        </div>
      </div>
    </blockquote>
  </div>

</blockquote></div></div>

