<div dir="ltr"><div>Thanks Ben; this is a valuable answer, and I appreciate it.</div><div><br></div><div>Cheers!</div><div><br></div></div><br><div class="gmail_quote"><div dir="ltr" class="gmail_attr">Le lun. 25 avr. 2022 à 21:19, Ben Boeckel &lt;<a href="mailto:ben.boeckel@kitware.com">ben.boeckel@kitware.com</a>&gt; a écrit :<br></div><blockquote class="gmail_quote" style="margin:0px 0px 0px 0.8ex;border-left:1px solid rgb(204,204,204);padding-left:1ex">[ What follows is a personal opinion, not that of my role on CMake. I am<br>
  also not an implementor, but hopefully I can at least clear some<br>
  things up from my experience as a build systems guy. ]<br>
<br>
On Mon, Apr 25, 2022 at 18:41:46 -0400, Patrice Roy via Ext wrote:<br>
&gt; I think a piece from this discussion is missing : there seems to be strong<br>
&gt; resistance from some implementers as to supporting Tom&#39;s &quot;Congrats Gaby&quot;<br>
&gt; hello-world-style program that would only depend on a modularized standard<br>
&gt; library (let&#39;s leave Boost and other well-known but non-std libraries for<br>
&gt; the moment). This resistance would be hard to explain to users without<br>
&gt; knowing more about the reasons for this resistance.<br>
&gt; <br>
&gt; Would an implementer care to explain why this seems so unreasonable without<br>
&gt; a build system? Ideally, comparing the &quot;old-style&quot; (lexically included<br>
&gt; headers) approach to the modules-based approach.<br>
&gt; <br>
&gt; From this, it would at least be easier to explain to beginners why just<br>
&gt; compiling their simple, standard-library-only programs requires more<br>
&gt; tooling than it used to. Everyone would benefit from that knowledge, or so<br>
&gt; it seems to me. My users are game programmers; they are experienced, they<br>
&gt; use build systems, but they also compile small test code manually at the<br>
&gt; command line and if they cannot use modules for this, they will ask why and<br>
&gt; I would really like to have an answer. It&#39;s not a sand-castle vs skyscraper<br>
&gt; issue; it&#39;s something they will need to know to integrate it in their<br>
&gt; workflow.<br>
<br>
Note that I go further than just &quot;standard-library-only&quot; here, but the<br>
standard library is not immune to flags passed on the command line and<br>
can transform itself based on things like `-ffast-math` and other<br>
ABI-affecting flags that put it right back into the &quot;cannot treat as<br>
built-in&quot; that are common enough that shipping prebuilts per<br>
configuration is infeasible. Not to mention Linux setups where `clang`<br>
uses `libstdc++` or Apple where `gcc` uses `libc++` where the stdlib is<br>
suddenly *not* trivially &quot;associated with the compiler&quot; and is much<br>
closer to &quot;just another external dependency&quot;. Or that that some<br>
toolchains have historically reused platform standard libraries rather<br>
than bringing their own (IIRC, pre-OneAPI `icc` and `pgi` have done this<br>
and though their direct applicability to C++20 is likely &quot;low&quot;) or<br>
projects such as STLPort which have been standalone standard library<br>
implementations.<br>
<br>
I find the &quot;requires more tooling&quot; to be because the standard refuses to<br>
talk about code in anything other than in abstract &quot;TU&quot; components.<br>
While this has merits, it does have its costs. Because the standard does<br>
not talk about what `import foo;` means other than through verbiage like<br>
&quot;makes names reachable&quot;, handling such code isn&#39;t grounded in anything<br>
beyond &quot;implementers will provide mechanisms to make such imports have<br>
meaning&quot;. It has no relation to filesystems (be it conventional or<br>
&quot;archive as filesystem&quot; FUSE-like interfaces to other things that can be<br>
treated as filesystems in some way), so there needs to be some mechanism<br>
to translate `import foo;` into &quot;here&#39;s what that means to this TU&quot;.<br>
Right now, we only have flags like `-reference` (MSVC) or<br>
`-fmodule-mapper=` (GCC) to specify these things, but filling these out<br>
is the hard part. Now, the compiler can certainly try to answer this on<br>
its own with some to-be-decided-upon rules, but C++ projects<br>
historically end up throwing all kinds of semantically meaningful<br>
metadata (read: compiler flags) on top of what *their* module means that<br>
any default setup that make any such guess unsuitable for some<br>
substantial portion of the userbase (cf. `FOO_IS_SHARED` defines for<br>
library visibility macro expansion, `BUILT_WITH_SOME_OPTIONAL_DEP`<br>
defines altering available APIs, `-Ofast` for some performance-sensitive<br>
component, `WITH_DEBUG_MEMBERS`, etc.).<br>
<br>
I&#39;ll also note that backwards compatibility has a *lot* of value in<br>
minimizing churn of known-working code, but it also ends up welding<br>
doors shut that one might want to use. Just as an example, the list<br>
representation in CMake makes the `;` an absolute landmine and<br>
complicates safely passing CMake values around). Would it be nice if one<br>
could just do `cmake -Dfoo=${foo}` to pass it along to some build<br>
command? Sure, but breaking every non-playground CMake project in the<br>
process is not worth that price.<br>
<br>
Could C++ have said things like &quot;the source encoding must be compatible<br>
with module lookup namespaces&quot; or &quot;filenames must correlate with module<br>
names&quot;? Sure. But then folks on non-utf-8 platforms or in non-Unicode<br>
locales get upset. Could C++ have then said &quot;modules must be self<br>
contained&quot; and allowed compilers to figure out what to do just based on<br>
the source? Sure, but then there&#39;d be things like `#pragma flag`<br>
ifdeffery preludes or `/** semantic comment */` to do what is possible<br>
today without some other more structured mechanism also being available<br>
(for prior art, see Rust&#39;s `#![feature()]` and `#![cfg()]` attributes,<br>
Haskell&#39;s `{-# LANGUAGE #-}` syntax, Python&#39;s magic `from __future__`<br>
mechanism, or CMake&#39;s policy scopes).<br>
<br>
But it didn&#39;t. Did everyone understand that C++ chose a module system<br>
isomorphic to Fortran&#39;s instead of something like Python or Rust?<br>
Unlikely. But it&#39;s what we have. I can foresee projects being built<br>
using tools that cobble together a Rust-like or Haskell-like &quot;here&#39;s a<br>
pile of sources and high-level dependency metadata, please build it&quot;<br>
experience, but the problem happens when a project wants to interface<br>
with external code *not* using this pattern. There has been all manners<br>
of digital ink spilled about Cargo not &quot;playing well&quot; with<br>
non-Rust-centered build systems (Cabal is largely the same, but,<br>
rounding, &quot;no one&quot; is using Haskell in this way). Sure, Python and Rust<br>
both have &quot;here&#39;s some C or C++ code, please build it&quot; helper tools, but<br>
trying to use these to build existing projects that have long leveraged<br>
the flexibility C and C++ builds have offered (say, HDF5) is like<br>
bringing a squeaky toy toolbox to a construction site: it&#39;s just not<br>
going to cut it for many widely-used existing projects. Consuming and<br>
understanding extant external code is fraught under such a model and<br>
that&#39;s where a lot of C++&#39;s value is to large projects.<br>
<br>
Now, what do I think it would take to make this stuff much more possible<br>
within the limits we have? SG15 is discussing it. What has been proposed<br>
(though I am not aware of a paper number as yet) is basically some<br>
sidecar metadata to say something to the effect of &quot;here is what C++<br>
information is important to consume this project&quot;. Rust has this as<br>
crate metadata (not typically distributed) and just needs to be told<br>
&quot;here is a compiled crate, please use&quot;, Python has some mechanism for<br>
its packages as well (including `.pth` files and other things that have<br>
accumulated over the years) that can supplement available packages.<br>
These tools know how to handle this and consume it natively. Given that<br>
there&#39;s already have an implementation out there, and the sidecar<br>
metadata hasn&#39;t even been formally proposed, trying to mandate any such<br>
metadata at this point is like starting to build a cart for a horse that<br>
is already standing at the starting gate. So, C++ build systems are the<br>
level at which this is dealt with at this point (though that doesn&#39;t<br>
preclude some basic support from compilers themselves, it is not trivial<br>
and I don&#39;t foresee implementers chomping at the bit to put even more<br>
fractally detailed work onto their plates).<br>
<br>
In short, I would describe it as &quot;with great power comes great<br>
responsibility&quot;. The power of modules to consume APIs more precisely<br>
beyond being equivalent a fancy:<br>
<br>
    xargs -a included-files cat | $(CC) /dev/stdin<br>
<br>
and hoping everything seeing the same content gets the same idea of<br>
what&#39;s going on now comes with the responsibility to tell the compiler<br>
more about dependencies beyond &quot;look here for API descriptions and this<br>
file to the linker&quot; and hoping none of the following have occurred:<br>
<br>
  - specify flags that modify the headers in some meaningful way<br>
  - gave the wrong library to the linker<br>
  - gave different directories to different TUs for the same include<br>
  - disagree on what other dependencies used in the API mean<br>
    (`_ITERATOR_DEBUG_LEVEL`, Boost&#39;s `NDEBUG`-optional members, etc.)<br>
  - give the wrong headers for the library (e.g., macOS&#39;s SDK Python<br>
    headers with a Homebrew Python library)<br>
<br>
Would it be ideal to just say something along the lines of:<br>
<br>
    $(CC) -fdepend-on=/path/to/boost-regex.latest.json \<br>
      -c -o uses-boost.o \<br>
      uses-boost.cpp<br>
    $(CC) -fdepend-on=/path/to/boost-regex.latest.json \<br>
      -fdependency-metadata-output=uses-boost.1.0.0.json \<br>
      -shared -o uses-boost.so \<br>
      uses-boost.o<br>
<br>
Yes, I&#39;d love it. But we&#39;re not there yet and until then, we&#39;ll need<br>
build systems to dig into any such `boost.latest.json` and translate<br>
that into flags to pass to compilers that exist today. Unfortunately,<br>
given the state of the standard library&#39;s sensitivity to consumer<br>
patterns, it is also subject to such things. It could be supported in<br>
the absolute simplest situations, but the path for that is *very* narrow<br>
and people will stray off the beaten path far too easily<br>
(`clang`/`clang-tidy` on Linux and `gcc` on macOS being the most common<br>
I can think of without even considering compiler flag interactions).<br>
<br>
--Ben<br>
</blockquote></div>