625
this post was submitted on 16 May 2025
625 points (97.3% liked)
Technology
70080 readers
3019 users here now
This is a most excellent place for technology news and articles.
Our Rules
- Follow the lemmy.world rules.
- Only tech related news or articles.
- Be excellent to each other!
- Mod approved content bots can post up to 10 articles per day.
- Threads asking for personal tech support may be deleted.
- Politics threads may be removed.
- No memes allowed as posts, OK to post as comments.
- Only approved bots from the list below, this includes using AI responses and summaries. To ask if your bot can be added please contact a mod.
- Check for duplicates before posting, duplicates may be removed
- Accounts 7 days and younger will have their posts automatically removed.
Approved Bots
founded 2 years ago
MODERATORS
you are viewing a single comment's thread
view the rest of the comments
view the rest of the comments
Relevant quote because one of us didn't read the article for sure.
Edit: not to mention that believing a system prompt somehow binds or constrains rather than influences these systems would also indicate to me that one of us definitely doesn't understand how these work, either.
That doesn’t say anything about the content of the modification itself. For all you know the internal policy could be that white genocide is a thing. But what they are in fact referring to that violates the internal policies is modifying the prompt in such a way that it takes a specific stance on a political issue. Cmon man use your brain, it’s not that fricking hard.
If the contents of the prompt were to say that white genocide is a thing, it would have likely have said something along the lines that it is a nuanced topic of debate and it depends on how you define the situation or some other non answer. But the AI was consistently taking a stance that it was misinformation, that tells you what the prompt was. Also it was reported in other outlets that that was in fact what the modification was, to not spread misinformation about that.
You continue to spout things with no citations and a bad vibe. I am done here.