mouthporn.net
#these ai guys are so gdamn lazy – @zenosanalytic on Tumblr
Avatar

Racing Turtles

@zenosanalytic / zenosanalytic.tumblr.com

"Why run, my little Phoenician?"
Avatar
Avatar
ralfmaximus

To understand what's going on here, know these things:

  1. OpenAI is the company that makes ChatGPT
  2. A spider is a kind of bot that autonomously crawls the web and sucks up web pages
  3. robots.txt is a standard text file that most web sites use to inform spiders whether or not they have permission to crawl the site; basically a No Trespassing sign for robots
  4. OpenAI's spider is ignoring robots.txt (very rude!)
  5. the web.sp.am site is a research honeypot created to trap ill-behaved spiders, consisting of billions of nonsense garbage pages that look like real content to a dumb robot
  6. OpenAI is training its newest ChatGPT model using this incredibly lame content, having consumed over 3 million pages and counting...

It's absurd and horrifying at the same time.

You are using an unsupported browser and things might not work as intended. Please make sure you're using the latest version of Chrome, Firefox, Safari, or Edge.
mouthporn.net