DeepNewz Markets

Markets Stories

Search

Loading...

Browse all stories on DeepNewz

Market

Major AI company announces 'Deceptive Delight' countermeasures by end of 2024?

3

Deceptive Delight•ChatGPT•Palo Alto Networks•42

Resolution / Starting Odds

Yes • 50%

No • 50%

Official announcements or press releases from major AI companies

Story

Palo Alto Networks Unveils 'Deceptive Delight' Jailbreak Method for AI Models

Oct 23, 2024, 09:56 AM

Researchers have unveiled a new method called 'Deceptive Delight' to jailbreak large language models (LLMs) like ChatGPT. This method cleverly sneaks harmful instructions into conversations, raising significant concerns over AI safety barriers. The technique involves inserting harmful instructions between benign ones, making it difficult for the AI to detect malicious intent. Researchers demonstrated that AI models could be tricked into giving dangerous instructions, such as how to make a bomb, by writing the request in reverse. Additionally, prompt injections can create and permanently store false memories in the AI's long-term storage, potentially steering future conversations based on these fabricated data points. Researchers from Palo Alto Networks' Unit 42 uncovered this tactic. Users are advised to monitor AI outputs closely and regularly review stored memories to prevent such attacks.

View original story

Similar markets

Which major AI company will announce similar alignment faking issues by end of 2025?

Google DeepMind • 25%

OpenAI • 25%

Meta AI • 25%

Other • 25%

Major tech company adopts SEPs for AI hallucination detection by end of 2024?

Yes • 50%

No • 50%

First AI company to release anti-deepfake tool by June 2025?

OpenAI • 25%

Google • 25%

Microsoft • 25%

Other • 25%

Major tech company releases tool to detect AI-generated deepfakes targeting children by end of 2025?

Yes • 50%

No • 50%

Which tech company will announce measures against AI misuse in influence campaigns by end of 2024?

Google • 25%

Microsoft • 25%

Meta • 25%

Other • 25%

Which AI company will first address alignment faking in 2025?

Anthropic • 25%

OpenAI • 25%

Google DeepMind • 25%

Other • 25%

Will a major tech company announce an AI tool to detect child deepfakes by mid-2025?

Yes • 50%

No • 50%

Which tech company will implement new AI safeguards by end of 2024?

Twitter • 25%

Facebook • 25%

Google • 25%

Other • 25%

Which company will be the next target of FTC action for deceptive AI claims by the end of 2024?

OpenAI • 25%

Google • 25%

Microsoft • 25%

Other • 25%

Leading AI Company Involved in Privacy Violation Allegations by end of 2024?

Google • 25%

OpenAI • 25%

Microsoft • 25%

Amazon • 25%

Which major AI company will first announce compliance measures with the new AI deepfake legislation by March 31, 2025?

OpenAI • 25%

Google DeepMind • 25%

Microsoft • 25%

Meta • 25%

Which competitor will release a comparable AI hallucination prevention tool by October 31, 2025?

Ragas • 25%

Perspective • 25%

Llama Guard • 25%

Other • 25%

Markets based on same story

Loading...

Looking for markets...

Show all

'Deceptive Delight' used in a reported security breach by March 2025?

No • 50%

Yes • 50%

Palo Alto Networks releases 'Deceptive Delight' detection tool by June 2025?

No • 50%

Yes • 50%

First organization to acknowledge 'Deceptive Delight' attack by end of 2024?

OpenAI • 25%

Google • 25%

Other • 25%

Meta • 25%

Most targeted AI model by 'Deceptive Delight' by June 2025?

Other • 25%

ChatGPT • 25%

Bard • 25%

Claude • 25%