~cnx/loang

1

AI scrapping bot ban

Adolfo Santiago <epoch@adol.pw>
Details
Message ID
<ohf2h2ji46eis3wp6tvmbdtncfluld4p2stl33agl7zczdhaoe@4cyllbaxehva>
DKIM signature
missing
Download raw message
Hello, loang users.

I write to the list in order to propose a wide-block of AI bots scrapping
anything from loang services.

Recently I came accross the GPTBot[1] documentation, where it says how to block
the bot if you don't want it to scrap anything.

The idea would be to not allow the bot the posibility of scrapping anything,
regardless of the loang service(s) you're using.

Hope to hear back from you.

Thank you for your time,
Adolph

[1]: https://platform.openai.com/docs/gptbot
Details
Message ID
<CUUOGO9BK2TM.9XKM0TV7IDAT@guix>
In-Reply-To
<ohf2h2ji46eis3wp6tvmbdtncfluld4p2stl33agl7zczdhaoe@4cyllbaxehva> (view parent)
DKIM signature
missing
Download raw message
On 2023-08-16 at 16:19+02:00, Adolfo Santiago wrote:
> I write to the list in order to propose a wide-block
> of AI bots scrapping anything from loang services.
>
> Recently I came accross the GPTBot[1] documentation,
> where it says how to block the bot
> if you don't want it to scrap anything.
>
> The idea would be to not allow the bot the posibility
> of scrapping anything, regardless of the loang service(s)
> you're using.
>
> [1]: https://platform.openai.com/docs/gptbot

Thanks for reminding me of this!
I saw this on fedi a few days ago and forgot about it.

Personally I don't object scraping, but given OpenAI
doing it to mass-launder copyright and its oligopolistic power,
I am open to blocking it server-wide.

Until anyone wants to opt into the OpenAI dataset,
I will set up the firewall to block its IP ranges.

Do note that whatever one puts out to the public interwebs
will eventually be scraped.  A resource could be mirrored
by a archiving initiative and OpenAI could scrape from there.
Other big tech corps are also building their own LLM.
Think of the block more as an act of protest, if anything.
Reply to thread Export thread (mbox)