DeepNewz Markets

Markets Stories

Search

Loading...

Browse all stories on DeepNewz

Market

Will AgentHarm be updated with new safety metrics by June 30, 2025?

3

AI Safety Institute•AISI•Gray Swan AI•AgentHarm

Resolution / Starting Odds

Yes • 50%

No • 50%

Announcements from the AI Safety Institute or Gray Swan AI

Story

AI Safety Institute and Gray Swan AI Release AgentHarm to Measure LLM Agent Harmfulness, Address Jailbreaking

Oct 14, 2024, 12:05 PM

The AI Safety Institute (AISI) and Gray Swan AI have announced the release of AgentHarm, a benchmark designed to measure the harmfulness of large language model (LLM) agents. This dataset aims to evaluate the unique harms posed by AI agents with access to external tools. The collaboration emphasizes the importance of moving beyond simple chatbot evaluations to assess the safety of more complex agent tasks. AgentHarm is described as easy to run, comprehensive, and reliable, and it is partly public, allowing broader accessibility for safety evaluations. The dataset also addresses concerns about jailbreaking and robustness in LLM agents.

View original story

Similar markets

Will AgentHarm dataset receive a major update by June 30, 2025?

Yes • 50%

No • 50%

Will AgentHarm be adopted as a standard benchmark by three major AI companies by March 31, 2025?

Yes • 50%

No • 50%

Which AI company will first integrate AgentHarm into safety evaluations by July 31, 2025?

OpenAI • 25%

Google DeepMind • 25%

Anthropic • 25%

Other • 25%

Will a significant vulnerability be discovered in AgentHarm's methodology by December 31, 2024?

Yes • 50%

No • 50%

First sector to report significant impact from AgentHarm by May 31, 2025?

Healthcare • 25%

Finance • 25%

Technology • 25%

Other • 25%

Will Anthropic implement new safety measures for Claude AI by June 30, 2025?

Yes • 50%

No • 50%

Will Character AI implement new safety features by March 31, 2025?

Yes • 50%

No • 50%

Will Character AI introduce new safety features by March 31, 2025?

Yes • 50%

No • 50%

What type of new AI safety metric will be developed by December 31, 2024?

Fairness metric • 25%

Robustness metric • 25%

Transparency metric • 25%

Other metric • 25%

Will a new AI safety guideline be released by December 31, 2024?

Yes • 50%

No • 50%

Timeline for Character AI's next major safety update by October 2025

By Q1 2025 • 25%

By Q2 2025 • 25%

By Q3 2025 • 25%

By Q4 2025 • 25%

Will Anthropic announce a new AI model as safe for public release after U.S. government evaluation by June 30, 2025?

Yes • 50%

No • 50%

Markets based on same story

Loading...

Looking for markets...

Show all

Will AgentHarm be adopted by a major AI company for internal evaluations by March 31, 2025?

No • 50%

Yes • 50%

Will AgentHarm be a standard benchmark in AI safety research by end of 2025?

No • 50%

Yes • 50%

What will be the primary use case for AgentHarm by October 14, 2025?

Chatbot Safety Evaluation • 25%

Other • 25%

Jailbreaking Resistance Testing • 25%

Tool-using Agent Safety • 25%

Which major AI conference will first feature AgentHarm by end of 2025?

ICML 2025 • 25%

Other • 25%

NeurIPS 2024 • 25%

AAAI 2025 • 25%