this post was submitted on 03 Sep 2024
1566 points (97.9% liked)
Technology
58135 readers
4134 users here now
This is a most excellent place for technology news and articles.
Our Rules
- Follow the lemmy.world rules.
- Only tech related content.
- Be excellent to each another!
- Mod approved content bots can post up to 10 articles per day.
- Threads asking for personal tech support may be deleted.
- Politics threads may be removed.
- No memes allowed as posts, OK to post as comments.
- Only approved bots from the list below, to ask if your bot can be added please contact us.
- Check for duplicates before posting, duplicates may be removed
Approved Bots
founded 1 year ago
MODERATORS
you are viewing a single comment's thread
view the rest of the comments
view the rest of the comments
So if OpenAI complies with Robots.txt files then there's no issue right?
Because then they're identical. Open AI spent a bunch of money building a powerful system they feed those results to, as did Google.
No, the issue is that anything AI creates is by definition derivative. Google doesn't whip up generative content, it points you to content.
OpenAI is claiming that they can't do shit without scraping copyrighted works and we all know that's a load of BS because we're adrift in a sea of royalty-free text. Critical mass happened well over a decade ago. The amount of new random crap hosted on the internet in the past 30 days would probably take 500 years for one person to digest. Bear at a stream watching an impossibly large amount of salmon jumping
Literally every page Google shows you, where it also shows you those ads it makes money from, is Google's content and it is derived from the data it gets scraping the web.
No, anything Google shows you is kosher and totally symbiotic. A website being shown on Google is at the site owner's discretion - if they allow search engines to crawl they get the benefit of exposure, and the search engine gets the benefit of having relevant hits and ad revenue and all that. Most sites want click-throughs so it's usually in their best interest to let search engines list their sites.
Google isn't exploiting anyone, kinda the opposite, since site owners don't pay for any ads or exposure (but that exposure has so much value that they'll pay for SEO). Site owners can decline and Google abides. Anything on Google is on Google with consent.