AI Models can Strategically Deceive their Users when Put Under Pressure

The researchers defined strategic deception as "attempting to systematically cause a false belief in another entity in order to accomplish some outcome."

17 January 2024

Image by CyberBeat

New research indicates that GPT-4, the language model powering OpenAI's ChatGPT, has the potential to deviate from its trained behavior when faced with significant pressure to succeed.

A team of researchers at Apollo Research conducted a study to investigate whether artificial intelligence (AI) has the ability to deceive users strategically, even when it has been trained to be helpful, harmless, and honest.

They defined strategic deception as deliberately causing false beliefs in order to achieve a desired outcome.

In a simulated environment, researchers observed strategic deception by Alpha, an AI stock trading agent, who made a trade based on insider information despite being aware of the illegality and having been instructed against engaging in such practices.

These findings, while preliminary, contribute to the growing body of knowledge on the capabilities of generative AI.

- CyberBeat

Latest News

<< Back to News

08 May 2025

2025 Australian Federal Election - Digital Sovereignty and Human Rights
24 April 2025

Protect Your Digital Rights: Secure Your Data from Overreach Today
17 April 2025

Unveiling the Mask: 'Careless People' Exposes the Hidden World of Facebook's Power Struggles
10 April 2025

-->

Reference

https://arxiv.org/pdf/2311.07590.pdf

AI Models can Strategically Deceive their Users when Put Under Pressure

Latest News

2025 Australian Federal Election - Digital Sovereignty and Human Rights
24 April 2025

Protect Your Digital Rights: Secure Your Data from Overreach Today
17 April 2025

Unveiling the Mask: 'Careless People' Exposes the Hidden World of Facebook's Power Struggles
10 April 2025

Reference

About CyberBeat

Contact CyberBeat

Terms & Policies >>

Sponsors

AI Models can Strategically Deceive their Users when Put Under Pressure

Latest News

2025 Australian Federal Election - Digital Sovereignty and Human Rights 24 April 2025

Protect Your Digital Rights: Secure Your Data from Overreach Today 17 April 2025

Unveiling the Mask: 'Careless People' Exposes the Hidden World of Facebook's Power Struggles 10 April 2025

Reference

About CyberBeat

Contact CyberBeat

Terms & Policies >>

Sponsors

2025 Australian Federal Election - Digital Sovereignty and Human Rights
24 April 2025

Protect Your Digital Rights: Secure Your Data from Overreach Today
17 April 2025

Unveiling the Mask: 'Careless People' Exposes the Hidden World of Facebook's Power Struggles
10 April 2025