DeepNewz Markets

Markets Stories

Search

Loading...

Browse all stories on DeepNewz

Market

Which region will first mandate AgentHarm for AI safety evaluations by end of 2025?

2

AI Safety Institute•AISI•Gray Swan AI•AgentHarm

Resolution / Starting Odds

United States • 25%

European Union • 25%

China • 25%

Other • 25%

Government or regulatory announcements

Story

AI Safety Institute and Gray Swan AI Release AgentHarm to Measure LLM Agent Harmfulness, Address Jailbreaking

Oct 14, 2024, 12:05 PM

The AI Safety Institute (AISI) and Gray Swan AI have announced the release of AgentHarm, a benchmark designed to measure the harmfulness of large language model (LLM) agents. This dataset aims to evaluate the unique harms posed by AI agents with access to external tools. The collaboration emphasizes the importance of moving beyond simple chatbot evaluations to assess the safety of more complex agent tasks. AgentHarm is described as easy to run, comprehensive, and reliable, and it is partly public, allowing broader accessibility for safety evaluations. The dataset also addresses concerns about jailbreaking and robustness in LLM agents.

View original story

Similar markets

Which region will first regulate AI systems like Project Jarvis by mid-2025?

United States • 25%

European Union • 25%

China • 25%

Other • 25%

Which AI company will first integrate AgentHarm into safety evaluations by July 31, 2025?

OpenAI • 25%

Google DeepMind • 25%

Anthropic • 25%

Other • 25%

Which country will be the first to implement similar AI regulations to the EU by the end of 2025?

United States • 25%

United Kingdom • 25%

Canada • 25%

Australia • 25%

Which region will follow the EU's lead in AI regulation by December 31, 2025?

United States • 25%

China • 25%

India • 25%

Other • 25%

Which country will first implement new AI regulations specifically targeting OpenAI's O1 models by the end of 2024?

United States • 25%

European Union • 25%

China • 25%

Other • 25%

Which region will first implement AI e-waste regulations by end of 2024?

European Union • 25%

United States • 25%

China • 25%

Other • 25%

Which country will first announce national-level regulations on AI vulnerabilities in robotics by December 31, 2024?

United States • 25%

China • 25%

European Union • 25%

Other • 25%

In which region will OpenAI's O1 model face regulatory challenges by June 30, 2025?

US • 25%

EU • 25%

China • 25%

Other • 25%

Which major country will first pass AI training protection laws by end of 2025?

United States • 25%

United Kingdom • 25%

European Union • 25%

Other • 25%

First country to implement major regulation on AI language models after August 2024?

United States • 25%

China • 25%

European Union • 25%

Other • 25%

Which region will first mandate AI drunk driver detection systems in new cars by end of 2024?

United States • 25%

European Union • 25%

China • 25%

Other • 25%

Which body will first propose AI regulation in insurance by end of 2025?

U.S. Senate • 25%

U.S. House of Representatives • 25%

Centers for Medicare & Medicaid Services • 25%

Department of Health and Human Services • 25%

Markets based on same story

Loading...

Looking for markets...

Show all

Will AgentHarm be adopted by a major AI company for internal evaluations by March 31, 2025?

No • 50%

Yes • 50%

Will AgentHarm be a standard benchmark in AI safety research by end of 2025?

No • 50%

Yes • 50%

Will AgentHarm be updated with new safety metrics by June 30, 2025?

Yes • 50%

No • 50%

What will be the primary use case for AgentHarm by October 14, 2025?

Chatbot Safety Evaluation • 25%

Other • 25%

Jailbreaking Resistance Testing • 25%

Tool-using Agent Safety • 25%