ISOCPP std-proposals List: Re: [std-proposals] Dedicated website with AI that has processed all papers

From: Frederick Virchanza Gotham <cauldwell.thomas_at_[hidden]>
Date: Wed, 28 May 2025 00:06:48 +0100

On Mon, May 26, 2025 at 4:48 AM René Ferdinand Rivera Morell wrote:
>
> I do not give you permission to use my copyrighted works to train an AI.
> And I doubt that ISO would give you permission to use ISO/IEC copyrighted
> works to train an AI. If you do, or have, gone down that road.. Remove my
> copyrighted content from your AI data immediately.

I haven't got a proper graphics card so I haven't been doing much on
the AI side of things. Using my CPU is way too slow to get any real
testing done. Currently I'm focusing on using the Xapian library to do
semantic search through the ~6000 paper files. But my program will
have a few tabs for searching in a few different ways (e.g. a 'grep
regex' tab, an 'apache solr' tab, a 'manticore' tab). Actually I'm
spending most of my time right now on the tab that lets you view any
individual paper, which I have working perfectly on macOS but not
quite there yet on Linux with the javascript in the HTML files. Of
course my program will be entirely open-source and free to use.

I haven't read the small print on what you agreed to when you emailed
Nevin to get a document number, but I'll give consideration to it
before releasing version 1 of my program. I think you retain the
copyright of what you wrote, but obviously if you submit a paper,
other people have to be able to work with it. I can't just write a
paper and submit it and mandate that everyone has to be drinking
decaffeinated coffee when they read it, or that they can store it on
an USB stick but only if the USB stick was purchased on a night when
there was a full moon.

Obviously I want to be a nice person here and not use any person's
work in a way that they don't want me to, and normally I would always
respect a person's wishes whenever they say "Don't do that with my
work", but it muddies the waters a bit when you have submitted the
paper to be analysed by the entire worldwide C++ community. I feel an
entitlement to be able to work with your paper since you put it in the
public domain to be scrutinised and voted on. Whether or not this
entitles me to use an AI model to do retrieval-augmented generation on
it, I haven't come to a conclusion on yet.

But look, for the time being I've excluded your papers from the AI
processing, such as P2656 and P2810. In the future if I re-visit this
decision, I will at least email you to let you know. And maybe the ISO
committee should clarify in black and white whether, when you submit a
paper, you're agreeing to it being processed by a large language
model. (Or maybe they've done that already and I just haven't seen it
in the small print).

By the way I presume you don't mind me using the likes of the Xapian
library to do a semantic search through your papers; correct me if I'm
wrong though.

Received on 2025-05-27 23:07:00