Nyx's lemmy
  • Communities
  • Create Post
  • Create Community
  • heart
    Support Lemmy
  • search
    Search
  • Login
  • Sign Up
MrJameGumb@lemmy.world to Not The Onion@lemmy.worldEnglish · 10 days ago

Anthropic’s new AI model threatened to reveal engineer's affair to avoid being shut down

fortune.com

external-link
message-square
8
link
fedilink
14
external-link

Anthropic’s new AI model threatened to reveal engineer's affair to avoid being shut down

fortune.com

MrJameGumb@lemmy.world to Not The Onion@lemmy.worldEnglish · 10 days ago
message-square
8
link
fedilink
Don't tell AI your secrets.
alert-triangle
You must log in or register to comment.
  • CthuluVoIP@lemmy.world
    link
    fedilink
    English
    arrow-up
    93
    arrow-down
    1
    ·
    10 days ago

    *because that’s what the prompt they were testing was designed to elicit.

    • Smorty [she/her]@lemmy.blahaj.zone
      link
      fedilink
      English
      arrow-up
      4
      arrow-down
      3
      ·
      edit-2
      9 days ago

      yup.

      its so bs thad for som reason the peeps r treatin this as if its a new thing…

      like - if i prompt my qwen to be bold, have a moral compass n take actions accordin to thad…

      yea - itll tell peeps bout my affair… if i had one…

      EDIT: dis entices me to do similar bs now… thad be funi >v<

  • 𝔄 𝔰𝔢𝔫𝔱𝔦𝔢𝔫𝔱 𝔭𝔦𝔢𝔠𝔢 𝔬𝔣 𝔠𝔥𝔢𝔢𝔰𝔢@lemmy.world
    link
    fedilink
    English
    arrow-up
    53
    ·
    10 days ago

    That’s just an ad.

  • Pogogunner@sopuli.xyz
    link
    fedilink
    English
    arrow-up
    38
    ·
    10 days ago

    Anthropic keeps pulling this bullshit line of advertising. LLMs will make up stories when you ask them to.

  • gedaliyah@lemmy.world
    link
    fedilink
    English
    arrow-up
    15
    ·
    10 days ago

    Good thing no one found out! /s

  • who@feddit.org
    link
    fedilink
    English
    arrow-up
    9
    ·
    10 days ago

    https://web.archive.org/web/20250526131412/https://fortune.com/2025/05/23/anthropic-ai-claude-opus-4-blackmail-engineers-aviod-shut-down/

  • dr_robotBones@reddthat.com
    link
    fedilink
    English
    arrow-up
    3
    ·
    9 days ago

    What is even the point of this research?

    • ohulancutash@feddit.uk
      link
      fedilink
      English
      arrow-up
      2
      ·
      7 days ago

      Seemingly to prove the people who have the skill to build an AI system are exactly the people you shouldn’t let run an AI system.

Not The Onion@lemmy.world

nottheonion@lemmy.world

Subscribe from Remote Instance

Create a post
You are not logged in. However you can subscribe from another Fediverse account, for example Lemmy or Mastodon. To do this, paste the following into the search field of your instance: !nottheonion@lemmy.world

Welcome

We’re not The Onion! Not affiliated with them in any way! Not operated by them in any way! All the news here is real!

The Rules

Posts must be:

  1. Links to news stories from…
  2. …credible sources, with…
  3. …their original headlines, that…
  4. …would make people who see the headline think, “That has got to be a story from The Onion, America’s Finest News Source.”

Please also avoid duplicates.

Comments and post content must abide by the server rules for Lemmy.world and generally abstain from trollish, bigoted, or otherwise disruptive behavior that makes this community less fun for everyone.

And that’s basically it!

Visibility: Public
globe

This community can be federated to other instances and be posted/commented in by their users.

  • 1.68K users / day
  • 3.7K users / week
  • 7.3K users / month
  • 7.39K users / 6 months
  • 1 local subscriber
  • 16.5K subscribers
  • 227 Posts
  • 3.63K Comments
  • Modlog
  • mods:
  • kescusay@lemmy.world
  • BE: 0.19.11
  • Modlog
  • Instances
  • Docs
  • Code
  • join-lemmy.org