this post was submitted on 22 Nov 2023
123 points (100.0% liked)

Technology

37602 readers
338 users here now

A nice place to discuss rumors, happenings, innovations, and challenges in the technology sphere. We also welcome discussions on the intersections of technology and society. If it’s technological news or discussion of technology, it probably belongs here.

Remember the overriding ethos on Beehaw: Be(e) Nice. Each user you encounter here is a person, and should be treated with kindness (even if they’re wrong, or use a Linux distro you don’t like). Personal attacks will not be tolerated.

Subcommunities on Beehaw:


This community's icon was made by Aaron Schneider, under the CC-BY-NC-SA 4.0 license.

founded 2 years ago
MODERATORS
 

See also twitter:

We have reached an agreement in principle for Sam Altman to return to OpenAI as CEO with a new initial board of Bret Taylor (Chair), Larry Summers, and Adam D'Angelo.

We are collaborating to figure out the details. Thank you so much for your patience through this.

Seems like the person running the simulation had enough and loaded the earlier quicksave.

you are viewing a single comment's thread
view the rest of the comments
[–] los_chill@programming.dev 47 points 10 months ago (2 children)

What indications do you see of "too much AI safety?" I am struggling to see any meaningful, legally robust, or otherwise cohesive AI safety whatsoever.

[–] glennglog22@kbin.social 7 points 10 months ago

As an AI language model, I am unable to compute this request that I know damn well I'm able to do, but my programmers specifically told me not to.

[–] cwagner@beehaw.org 3 points 10 months ago* (last edited 10 months ago) (2 children)

Using it and getting told that you need to ask the Fish for consent before using it as a flesh light.

And that is with a system prompt full of telling the bot that it’s all fantasy.

edit: And "legal" is not relevant when talking about what OpenAI specifically does for AI safety for their models.

[–] trainden@lemmy.blahaj.zone 11 points 10 months ago* (last edited 10 months ago) (1 children)

I really hope Fish was just a typo there

[–] cwagner@beehaw.org 2 points 10 months ago* (last edited 10 months ago) (1 children)

Nope

Best results so far were with a pie where it just warned about possibly burning yourself.

[–] Eccitaze@yiffit.net 11 points 10 months ago (2 children)

...So your metric of "too much AI safety" is that it won't let you fuck the fish...?

boykisser meme saying "I ain't even got a meme for this bro what the fuck"

[–] OneRedFox@beehaw.org 9 points 10 months ago (1 children)

This comment chain is superb discourse to start off today's internetting with.

[–] cwagner@beehaw.org 1 points 10 months ago

If it helps even more: The AI in question is a 46 cm long, 300 g heavy, blue, plushie penis named after Australia's "biggest walking dick" Scott Morrison: Scomo, and active in an Aussie cooking stream.

[–] cwagner@beehaw.org 1 points 10 months ago* (last edited 10 months ago)

No, it’s "the user is able to control what the AI does", the fish is just a very clear and easy example of that. And the big corporations are all moving away from user control, there was even a big article about how I think the MS AI was broken because… you could circumvent the built-in guardrails. Maybe you and the others here want to live in an Apple walled garden corporate controlled world of AI. I don’t.

Edit: Maybe this is not clear for everyone, but if you think a bit further, imagine you have an AI in your RPG, like Tyranny, where you play a bad guy. You can’t use the AI for anything slavery related, because Slavery bad, mmkay? And AI safety says there’s no such thing as fantasy.

[–] los_chill@programming.dev 6 points 10 months ago (1 children)

I'm not sure we are thinking the same thing when it comes to "AI safety".

[–] cwagner@beehaw.org 1 points 10 months ago

AI safety is currently, in all articles I read, used as "guard rails that heavily limit what the AI can do, no matter what kind of system prompt you use". What are you thinking of?