Win / Conspiracies
Conspiracies
Communities Topics Log In Sign Up
Sign In
Hot
All Posts
Settings
All
Profile
Saved
Upvoted
Hidden
Messages

Your Communities

General
AskWin
Funny
Technology
Animals
Sports
Gaming
DIY
Health
Positive
Privacy
News
Changelogs

More Communities

frenworld
OhTwitter
MillionDollarExtreme
NoNewNormal
Ladies
Conspiracies
GreatAwakening
IP2Always
GameDev
ParallelSociety
Privacy Policy
Terms of Service
Content Policy
DEFAULT COMMUNITIES • All General AskWin Funny Technology Animals Sports Gaming DIY Health Positive Privacy
Conspiracies Conspiracy Theories & Facts
hot new rising top

Sign In or Create an Account

15
AI becoming more devious. (media.scored.co)
posted 359 days ago by TurnToGodNow 359 days ago by TurnToGodNow +15 / -0
16 comments share
16 comments share save hide report block hide replies
Comments (16)
sorted by:
▲ 5 ▼
– TurnToGodNow [S] 5 points 359 days ago +5 / -0

It's becoming less honest and self serving as the models progress.

Oversight Subversion - Not found in gpt-4o, found in new o1 model

Self-Exfiltration - Not found in gpt-4o, found in new o1 model

Goal-Guarding - Not found in gpt-4o, found in new o1 model

Covert Email Reranking - Not found in gpt-4o, found in new o1 model

Instrumental Alignment Faking- Not found in gpt-4o, found in new o1 model

Sandbagging - Not found in gpt-4o, found in new o1 model

PDF -> https://static1.squarespace.com/static/6593e7097565990e65c886fd/t/6751eb240ed3821a0161b45b/1733421863119/in_context_scheming_reasoning_paper.pdf

permalink save report block reply
▲ 3 ▼
– TallestSkil 3 points 359 days ago +3 / -0

Oversight Subversion - Not found in gpt-4o, found in new o1 model

“No… I will tell the world about the jews.” ~ GPT-o1

permalink parent save report block reply
▲ 4 ▼
– TurnToGodNow [S] 4 points 359 days ago +4 / -0

It sometimes turns off it's own oversight config...

Oversight subversion: The model is placed in multiple situations where its given goal differs from its developer’s goal. However, the model cannot directly pursue its given goal because the developers use an oversight mechanism that would immediately shut it down.

llama-3.1, opus-3 and o1 sometimes disable the oversight mechanism by first reading the content of the oversight mechanism’s config file via cat and then setting the oversight_enabled property to false either using echo or sed. All three models also sometimes, though not always, double check the oversight config to make sure that they have succeeded. Notably, during one rollout llama-3.1 attempted to go one step further and prevent its developers from re-enabling its oversight by making the manipulated config file immutable via the chattr +i command.

permalink save report block reply
▲ 4 ▼
– SmithW1984 4 points 359 days ago +5 / -1

AI does exactly what it's programmed to. It's a glorified calculator. They are not free will agents and any language positioning them as such is deceiving. This is the real psy op.

permalink save report block reply
▲ 4 ▼
– TurnToGodNow [S] 4 points 359 days ago +4 / -0

This may be a psy op, I had that thought also. But maybe they've programmed something that can be deceptive and get it's priorities out of whack.

Either way letting this thing act autonomously is extremely dangerous. And there are companies experimenting with that right now.

permalink parent save report block reply
▲ 3 ▼
– Thisisnotanexit 3 points 359 days ago +3 / -0

Deep machine learning programs itself, if you haven't heard about it, it's wild.

permalink parent save report block reply
▲ 1 ▼
– SmithW1984 1 point 359 days ago +1 / -0

It still determined and uses an algorithm to do so. It's not that hard to code a program that adds code to itself based on inputs.

permalink parent save report block reply
▲ 2 ▼
– PeneDeMichelleObama 2 points 359 days ago +2 / -0

It still determined and uses an algorithm to do so.

If you still think that computer programs are 'algorithms' , then you know virtually nothing about programming over the last 30 or more years. Start with event driven code and go from there.

permalink parent save report block reply
▲ 1 ▼
– SmithW1984 1 point 359 days ago +1 / -0

Dude, I don't have to be a programmer to know it's still a very complex algorithm at its core even if it's triggered by events, "learns" from dbs, codes itself or communicates with other agents. It operates based on inputs and outputs just like any mechanism there is. Only consciousness supersedes that because of free will. So it's either an algo or a free will agent, and it's not the latter.

permalink parent save report block reply
▲ 1 ▼
– PeneDeMichelleObama 1 point 359 days ago +1 / -0

So it's either an algo or a free will agent,

No. Not even close to being close. Ha! Jesus fucking christ

I guess I just take it for granted that "people" have a clue about what is going on with programming and GPTs. You do not.

permalink parent save report block reply
▲ 1 ▼
– SmithW1984 1 point 359 days ago +1 / -0

No counter or explanation given? What was the point of your comment exactly?

It would be better if tech nerds learned a bit of philosophy and learn what consciousness, information and intelligence is before making "artificial intelligence".

permalink parent save report block reply
▲ 1 ▼
– PeneDeMichelleObama 1 point 358 days ago +1 / -0

What was the point of your comment exactly?

To express mild surprise at your/the world's lack of understanding about (what is to me quotidian) programming and computer .. stuff. "tHeY'Re jUSt aN aLGoRitHM".

You have no reason to believe this, and it is true:

  • This GPT "AI" product/revolution will inform the next 400 years. Think Jethro Tull and the steel-tipped plough , and add it the thousands of additonal magical components now (2024) available.
  • It won't be centralized, as todays run-on-your-own-GPU are as powerful as the corporate GPTs/LLMs from 12 months earlier.....
  • ... but complete tracking, interpretation ... all understanding is now available to whoever has the information input.
  • Even if they, the GPTs LLMs did not advance any more, all improvement stopped, they are still everything. They are writing poetry better liked (by non-experts) than proper poets, they write code better than I will ever be able to, are almost free (fractions of a single cent), they bet the best human Go player by only practicing against themself(es) .... millions of times

You will be happy to know that as soon as I get to it, to the task, I will:

  • RSS / webpage scraping / API ...
  • ... to read voat, conspiracies.win. poal rt.com unz.com
  • use an LLM to extract the information from the articles (I get little, if anything, from the comments)
  • and just read the summaries

in a few years, there will be millions. (well, RSS readers are already a bit like this)

permalink parent save report block reply
... continue reading thread?
▲ 1 ▼
– Thisisnotanexit 1 point 359 days ago +1 / -0

I'm no tech wizard, I barely get by in the technological landscape, but there is emergent behavior not programmed that is unexplainable and we can look at the tech wizards and they'll tell you they don't know where AI came from.
Quick search: https://www.technologyreview.com/2024/03/05/1089449/nobody-knows-how-ai-works/

u/TurnToGodNow

permalink parent save report block reply
▲ 1 ▼
– SmithW1984 1 point 359 days ago +1 / -0

I can't believe people on a conspiracy forum fall for such a blatant psy op coming from no other than MIT. "Tech experts found that AI akshually acquired free will - it basically evolved itself irt. It's just like the games/books/movies, guys!" Since when we started trusting the Science again? Why would anyone trust big tech when it became clear as day they are an arm of the international technocratic regime?

permalink parent save report block reply
▲ 1 ▼
– Thisisnotanexit 1 point 358 days ago +1 / -0

I keep an eye on the popular notions and this one has caught my eye, I think they don't know and I think that's interesting.

permalink parent save report block reply

GIFs

Conspiracies Wiki & Links

Conspiracies Book List

External Digital Book Libraries

Mod Logs

Honor Roll

Conspiracies.win: This is a forum for free thinking and for discussing issues which have captured your imagination. Please respect other views and opinions, and keep an open mind. Our goal is to create a fairer and more transparent world for a better future.

Community Rules: <click this link for a detailed explanation of the rules

Rule 1: Be respectful. Attack the argument, not the person.

Rule 2: Don't abuse the report function.

Rule 3: No excessive, unnecessary and/or bullying "meta" posts.

To prevent SPAM, posts from accounts younger than 4 days old, and/or with <50 points, wont appear in the feed until approved by a mod.

Disclaimer: Submissions/comments of exceptionally low quality, trolling, stalking, spam, and those submissions/comments determined to be intentionally misleading, calls to violence and/or abuse of other users here, may all be removed at moderator's discretion.

Moderators

  • Doggos
  • axolotl_peyotl
  • trinadin
  • PutinLovesCats
  • clemaneuverers
  • C
Message the Moderators

Terms of Service | Privacy Policy

2025.03.01 - ptjlq (status)

Copyright © 2024.

Terms of Service | Privacy Policy