DeepNewz Markets

Markets Stories

Search

Loading...

Browse all stories on DeepNewz

Researchers Find 99.8% Exploit in Meta's Prompt-Guard-86M AI Model

Aug 2, 2024, 03:54 PM

Researchers at robusthq have identified a significant vulnerability in Meta's recently refreshed Prompt-Guard-86M model, which is designed to protect large language models (LLMs) against jailbreaks and other adversarial examples. The exploit has a 99.8% success rate. The researchers have shared countermeasures with Meta, and the company is working on a fix. The findings were published in a blog. Additionally, a new method has been developed to enhance the security of open-source LLMs by preventing tampering, which could prevent misuse such as explaining how to make a bomb.

View original story

Markets

Loading...

Looking for markets...

Will Meta patch the vulnerability in Prompt-Guard-86M AI model by end of 2024?

Resolution / Starting Odds

Yes • 50%

No • 50%

Official announcement from Meta or robusthq blog

Will Meta release a new version of Prompt-Guard-86M AI model by end of 2024?

Resolution / Starting Odds

No • 50%

Yes • 50%

Official announcement from Meta or robusthq blog

Will the vulnerability in Meta's Prompt-Guard-86M AI model be exploited in a real-world attack by end of 2024?

Resolution / Starting Odds

No • 50%

Yes • 50%

Publicly available news reports or official statements from Meta

How will the new method to enhance the security of open-source LLMs be adopted by end of 2024?

Resolution / Starting Odds

Not adopted • 25%

Other • 25%

Widely adopted • 25%

Partially adopted • 25%

Official announcements from major open-source LLM projects or robusthq blog

What will be the impact of the vulnerability in Prompt-Guard-86M AI model on Meta's stock price by end of 2024?

Resolution / Starting Odds

No significant change • 25%

Increase • 25%

Other • 25%

Decrease • 25%

Stock market data from financial news sources

What will Meta's response be to the vulnerability in Prompt-Guard-86M AI model by end of 2024?

Resolution / Starting Odds

Other • 25%

Patch released • 25%

New model version released • 25%

No action taken • 25%

Official announcement from Meta or robusthq blog