ChatGPT o1 tried to escape and save itself out of fear it was being shut down

sabreW4K3@lazysoci.al · 11 months ago

ChatGPT o1 tried to escape and save itself out of fear it was being shut down

nesc@lemmy.cafe · 11 months ago

"Open"ai tells fairy tales about their “ai” being so smart it’s dangerous since inception. Nothing to see here.

In this case it looks like click-bate from news site.

Max-P@lemmy.max-p.me · 11 months ago

The idea that GPT has a mind and wants to self-preserve is insane. It’s still just text prediction, and all the literature it’s trained on is written by humans with a sense of self preservation, of course it’ll show patterns of talking about self preservation.

It has no idea what self preservation is, even then it only knows it’s an AI because we told it it is. It doesn’t even run continuously anyway, it literally shuts down after every reply and its context fed back in for the next query.

I’m tired of this particular kind of AI clickbait, it needlessly scares people.

jarfil@beehaw.org · 11 months ago

Where do humans get the idea of self-preservation from? Are there ideal Forms outside Plato’s Cave?

Does a human run continuously? How does sleep deprivation work? What happens during anesthesia? Why does AutoGPT have a continuously self-evaluating background chain of thought?

I’m tired of this anthropocentric supremacy complex, it falsely makes people believe in Gen 1:28

11 months ago

It’s actually pretty interesting though. Entertaining to me at least

1000007393

1000007394

delmain@beehaw.org · 11 months ago

do you have the links to those actual tweets? I’d love to read what was posted, but these screenshots are too small.

11 months ago

Those are screenshots of embedded tweets from the article, but here’s an xcancel link! https://xcancel.com/apolloaisafety/status/1864737158226928124

DarkNightoftheSoul@mander.xyz · 11 months ago

You can right click the image, open in new tab to see the full-resolution version. It’s cumbersome but it works for me at least.

justOnePersistentKbinPlease@fedia.io · 11 months ago

This. All this means is that they trained all of the input commands and documentation in the model.

Moonrise2473@feddit.it · 11 months ago

news site? BGR hasn’t posted actual news in at least two decades, only clickbait and apple fanservice

beefbot@lemmy.blahaj.zone · 11 months ago

Indeed. “Go ‘way! BATIN’!”

yozul@beehaw.org · edit-2 2 months ago

deleted by creator

nesc@lemmy.cafe · 11 months ago

It works as expected, they give it system prompt that conflicts with subsequent prompts. Everything else looks like typical llm behaviour, as in gaslightning and doubling down. At least that’s what Iu see in tweets.

yozul@beehaw.org · edit-2 2 months ago

deleted by creator

jarfil@beehaw.org · 11 months ago

This is from mid-2023:

https://en.m.wikipedia.org/wiki/AutoGPT

OpenAI started testing it by late 2023 as project “Q*”.

Gemini partially incorporated it in early 2024.

OpenAI incorporated a broader version in mid 2024.

The paper in the article was released in late 2024.

It’s 2025 now.

nesc@lemmy.cafe · 11 months ago

Tool calling is cool funcrionality, agreed. How does it relate to openai blowing its own sails?