this post was submitted on 09 Mar 2025
344 points (98.9% liked)
Technology
64936 readers
3977 users here now
This is a most excellent place for technology news and articles.
Our Rules
- Follow the lemmy.world rules.
- Only tech related content.
- Be excellent to each other!
- Mod approved content bots can post up to 10 articles per day.
- Threads asking for personal tech support may be deleted.
- Politics threads may be removed.
- No memes allowed as posts, OK to post as comments.
- Only approved bots from the list below, to ask if your bot can be added please contact us.
- Check for duplicates before posting, duplicates may be removed
- Accounts 7 days and younger will have their posts automatically removed.
Approved Bots
founded 2 years ago
MODERATORS
you are viewing a single comment's thread
view the rest of the comments
view the rest of the comments
My worry is that these social media alternatives might get scraped by these AI companies as well.
Sure, a company handing it over is much easier (i.e. Reddit). But with the decentralized nature, everyone needs to protect their instances themselves, which I’m not sure how well everyone will be capable of doing that.
Definitely much more difficult, so it’s a step in the right direction.
Everything on the Fediverse is almost certainly scraped, and will be repeatedly. You cant "protect" content that is freely available on a public website.
I do not entirely agree.
While what you said might be true for content that we post, things like view history and tracking in itself is much more difficult. That meta data does help with tagging content.
Yeah, fair enough, I was refering to posts and comments not other metadata because that isnt publicly available just as a get request (as far as I'm aware)