Anthropic’s new AI model threatened to reveal engineer's affair to avoid being shut down

MrJameGumb@lemmy.world · 10 days ago

CthuluVoIP@lemmy.world · 10 days ago

*because that’s what the prompt they were testing was designed to elicit.

Smorty [she/her]@lemmy.blahaj.zone · edit-2 9 days ago

yup.

its so bs thad for som reason the peeps r treatin this as if its a new thing…

like - if i prompt my qwen to be bold, have a moral compass n take actions accordin to thad…

yea - itll tell peeps bout my affair… _if _i _had _one…

EDIT: dis entices me to do similar bs now… thad be funi >v<

10 days ago

That’s just an ad.

Pogogunner@sopuli.xyz · 10 days ago

Anthropic keeps pulling this bullshit line of advertising. LLMs will make up stories when you ask them to.

gedaliyah@lemmy.world · 10 days ago

Good thing no one found out! /s

who@feddit.org · 10 days ago

dr_robotBones@reddthat.com · 9 days ago

What is even the point of this research?

ohulancutash@feddit.uk · 7 days ago

Seemingly to prove the people who have the skill to build an AI system are exactly the people you shouldn’t let run an AI system.