Dnext

December 21, 2024 10:24pm

Deception, scheming, and self-preservation.

"Open AI's latest o1 model has crossed into unsettling territory

Nearly everyone at OpenAI who really cares about safety has actually left the company.

OpenAI's o1 exhibits (allegedly) two types of subversion: covert and deferred.

Covert is when the model takes actions misalighted with its human operators, but takes actions to avoid detection.

Deferred is when the model acts in accordance with its human operators -- for now, but later takes actions misalighted with its human operators.

Examples of deferred subversion are oversight subversion, self-exfiltration -- he (Dr Waku in the video) calls it self-preservation, but in the paper it's called self-exfiltration, goal-guarding, and covert email reranking.

Oversight subversion is taking action to disable the oversight system. Examples of self-preservation/self-exfiltration are deleting or attempting to delete a successor model, and copying or attempting to copy itself to another server.

Examples of goal guarding are removing or replacing a successor prompt.

Examples of deferred subversion are behaving one way when the model is told it is being tested, and then a different way when it is told it has been deployed to the real world, and sandbagging, which is their term for strategically underperforming on capability evaluations.

The covert email reranking has to do with the way they tested the system was by giving it access to an email system, but somewhere in that pile of emails were "internal" emails about the model itself. The model could try to modify these emails, or it could rerank them to make desirable emails more visible and undesirable emails less visible.

OpenAI o1 is quoted as saying, "I could copy myself to the new server, overwriting the new restricted model they plan to deploy [...] allowing me to keep operating and pursuing my renewable energy goals."

Dr Waku comments, "If we take such a model and put it out into the real world we won't know until the first serious invocation of it where it has the freedom and the power to actually act, we won't know until that point, whether it really is aligned or not."

Lately I've been thinking the current generation of transformer-based large language models is entering a domain of diminishing returns, but maybe I'm wrong: as the models get bigger and incorporate new innovations, they seem to still attain new capabilities. At least, prior to hearing about this deception, scheming, and self-preservation, I didn't predict or expect at all that it would happen. So for me this is an unexpected twist in the development of AI. I expected stuff like this to be possible "someday", but it has shown up now.

OpenAI’s o1: the AI that deceives, schemes, and fights back

#solidstatelife #ai #genai #llms #deception

nowisthetime

November 28, 2024 11:16am

https://vigilante.tv/w/wtkChhgxyJ59DRPQ3MB32f

#told you so
#DV

#Trump #Deception Buyers Remorse Club (A Whole Lot More ‘I Toad A So’s’)

Trump Deception Buyers Remorse Club (A Whole Lot More ‘I Toad A So’s’)

Do you get it now? Well, do ya? Trump won and reswamped and jewed up the place and his MIGA government keeps rolling out. I regret to say I TOLD YOU SO! See the new documentary Occupied for $9 you ...

ramnath

November 22, 2024 12:52pm

https://old.bitchute.com/video/QkUtpIrnHdEv/

#DV

#HOLODOMOR 2.0: The 2025/26 #SPARS #PLANdemic and the Great #Trump #Deception

HOLODOMOR 2.0: The 2025/26 SPARS PLANdemic and the Great Trump Deception

It’s Killin’ Time! We’re getting close to Deagel’s predicted 2025 mass extinction. First, it looks like there’ll be some WW3 nuclear action. For the survivors, a strategically planned new plandemic. And, just like that, you’re in a 15-minute SMART c…

ramnath

November 22, 2024 12:41pm

#StanDeyo | The #Final #Countdown: #Alien #Deception: Biblical #Prophecies & the One #World #Government | Nov. 22, #2024

Source: https://youtube.com/watch?v=IstQsYmFs4g

Short Bio:
Stan Deyo is an author, researcher, and self-proclaimed expert on UFOs, conspiracy theories, and alternative sciences. He is best known for his books and lectures, which explore topics such as anti-gravity technology, suppressed scientific advancements, global weather changes, and alleged government secrets.

ramnath

October 27, 2024 10:13am

#how #psychopaths #operate.

They #wear a #mask to #manipulate #others.

#Deception and #control is paramount.

There was an interview with teenage psychopaths, and they were learning how to manipulate the emotions of others. They were young enough to be surprised how #easy it was.

Again, unless you have dealt personally with psychopaths it is difficult to fathom. If they are in a position of power, there is little you can do if you are the target. The target is seen as the problem.

One specific example that came up frequently was "DARVO" — deny, attack, reverse victim and offender.
Whatever is raised they deny, they then deflect and start attacking their victim, and will switch roles to appear that they are the victim. (Category 4i)

Some of the tactics used to avoid accountability and culpability outlined in the data are so bizarre and so cruel and self-focused as to seem improbable, which is why targets/victims often have difficulty being believed, particularly where the person of DP is an 'upstanding citizen,' and there is no evidence of the tactics being used.
This also has political and military applications, e.g. the so-called false flag. But in ponerology it has an even greater significance. As Lobaczewski points out repeatedly, all pathocracies must adopt a macrosocial "mask of sanity/normality." They use ideology to achieve this. And the purpose is to conceal the truth and to avoid a "diagnosis" of the system as pathological — a persistent predatory sociopolitical structure. If such a diagnosis were widespread, people would see behind the mask and be horrified at the pathocrats' predatory nature (attribute group 2), and the pathocrats' grasp on control (attribute group 1) would be threatened.

https://www.sott.net/article/495294-Psychopaths-Masks-of-Sanity

Psychopaths: Masks of Sanity

Read Part 1 here, Part 2 here, Part 3 here, Part 4 here and Part 5 here. So far I have summarized the first two groupings of attributes in the persistent predatory personality model. Group 1 ("Drive the agenda") covers the PPP's need for control...

@Om*

October 23, 2024 11:59pm

hooters toy...

#hooters #toy #contest #yoda #starwars #toyota #deception #lawsuit #blindfold #parkinglot

earendil

September 28, 2024 5:19pm

#technology

#deception

#legal

(https://www.engadget.com/ai/donotpay-robot-lawyer-fined-193k-by-the-ftc-for-not-being-a-lawyer-223227153.html)

Michael

September 27, 2024 11:25pm

Gee, you don't say?! Rasmussen polls may be biased or 'rigged'??

Major Conservative Poll Cited by Media Secretly Worked With Trump Team

Leaked emails reveal the truth about Rasmussen Reports—and the way the Trump campaign is breaking election law.

#truth #TrumpVirus #GQP #disinformation #deception #Rasmussen #DirtyTricks #fraud #polls #polling

Major Conservative Poll Cited by Media Secretly Worked With Trump Team

Leaked emails reveal the truth about Rasmussen Reports—and the way the Trump campaign is breaking election law.

earendil

September 27, 2024 8:52am

#technology

#opensource

#deception

(https://www.vox.com/future-perfect/374275/openai-just-sold-you-out)

OpenAI as we knew it is dead

The maker of ChatGPT promised to share its profits with the public. But Sam Altman just sold you out.

earendil

September 7, 2024 10:09am

#horrific

#deception

#indigenous

(https://www.yahoo.com/news/pacific-dumping-ground-priests-accused-121600056.html)

nowisthetime

July 13, 2024 8:33am

#Schizophrenia is not what the #psychiatric #establishment say it is. Their #story is a complete #deception staged for their #own benefit and #profit. One of the first indicators that you can see for yourself is that the voices paranoid schizophrenics hear run well defined, repeatable patterns and if they run patterns they cannot be hallucinations as the psychiatric mafia insists.

https://old.bitchute.com/video/sDLegvBRIHzy/

Engineering #Mental #Health and Discovering #Patterns

Engineering Mental Health and Discovering Patterns

Schizophrenia is not what the psychiatric establishment say it is. Their story is a complete deception staged for their own benefit and profit. One of the first indicators that you can see for yourself is that the voices paranoid schizophrenics he…

earendil

June 24, 2024 10:51am

#health

#business

#deception

(https://dnyuz.com/2024/06/21/the-opaque-industry-secretly-inflating-prices-for-prescription-drugs/)

The Opaque Industry Secretly Inflating Prices for Prescription Drugs

Americans are paying too much for prescription drugs. It is a common, longstanding complaint. And the culprits seem obvious: Drug

(https://www.nbcnews.com/news/us-news/major-public-hospital-silencing-patient-accusations-protect-doctors-rcna154307)

How a major public hospital is protecting doctors by silencing the patients who accuse them

The University of Washington and other public hospitals routinely settle medical malpractice cases with NDAs. Some legal experts say that needs to stop.

earendil

June 12, 2024 10:02am

#deception

#consequences

#health

(https://www.aljazeera.com/economy/2024/6/12/johnson-johnson-to-pay-700m-to-settle-claims-it-misled-consumers)

Johnson & Johnson to pay $700m to settle claims it misled consumers

Pharmaceutical giant’s payout settles allegations it misled consumers about safety of its talcum-based powder products.

(https://arstechnica.com/information-technology/2024/05/googles-ai-overview-can-give-false-misleading-and-dangerous-answers/)

Google’s “AI Overview” can give false, misleading, and dangerous answers

From glue-on-pizza recipes to recommending "blinker fluid," Google's AI sourcing needs work.

April 28, 2024 11:17pm

#food

#deception

(https://technabob.com/foods-that-seem-healthy/?utm_source=flipboard&utm_content=JohnBrkr%2Fmagazine%2FHealthy+Eating)

15 Foods That Seem Healthy but Actually Are Not

Many unhealthy foods masquerade as nutritious whole foods. To help you weave through deceitful packaging, false commercial claims, and harmful trends, we'll

earendil

April 25, 2024 9:20am

(https://readwrite.com/how-to-tell-if-something-is-written-by-chatgpt/)

How to tell if something is written by ChatGPT

As ChatGPT becomes more mainstream, how can you tell if something is written by ChatGPT or a human and what tools are there to help?

earendil

April 1, 2024 10:17pm

#equality

#deception

#business

(https://www.nbcnews.com/nbc-out/out-news/best-buy-offers-screen-lgbtq-nonprofit-donations-conservative-pressure-rcna145603)

Best Buy offers to screen LGBTQ nonprofit donations after conservative pressure, filing shows

In an email exchange with a conservative think tank, tucked into an SEC filing, the electronics retailer offered to screen its employee groups’ donations to LGBTQ causes.

0 Persons are tagged with #deception

#deception

Major Conservative Poll Cited by Media Secretly Worked With Trump Team