DeepNewz Markets

Markets Stories

Search

Loading...

Browse all stories on DeepNewz

Market

Which major AI conference will first feature AgentHarm by end of 2025?

2

AI Safety Institute•AISI•Gray Swan AI•AgentHarm

Resolution / Starting Odds

NeurIPS 2024 • 25%

ICML 2025 • 25%

AAAI 2025 • 25%

Other • 25%

Conference agendas and presentations

Story

AI Safety Institute and Gray Swan AI Release AgentHarm to Measure LLM Agent Harmfulness, Address Jailbreaking

Oct 14, 2024, 12:05 PM

The AI Safety Institute (AISI) and Gray Swan AI have announced the release of AgentHarm, a benchmark designed to measure the harmfulness of large language model (LLM) agents. This dataset aims to evaluate the unique harms posed by AI agents with access to external tools. The collaboration emphasizes the importance of moving beyond simple chatbot evaluations to assess the safety of more complex agent tasks. AgentHarm is described as easy to run, comprehensive, and reliable, and it is partly public, allowing broader accessibility for safety evaluations. The dataset also addresses concerns about jailbreaking and robustness in LLM agents.

View original story

Similar markets

Which AI company will first integrate AgentHarm into safety evaluations by July 31, 2025?

OpenAI • 25%

Google DeepMind • 25%

Anthropic • 25%

Other • 25%

Which AI conference will first feature a presentation on LiquidAI's LFMs by June 2025?

NeurIPS • 25%

ICML • 25%

CVPR • 25%

Other • 25%

Which major AI conference will first feature a keynote or major presentation on Nous Research's DisTrO by the end of 2024?

NeurIPS • 25%

ICML • 25%

AAAI • 25%

Other • 25%

At which major AI conference will LiveBench AI be featured first by end of 2024?

NeurIPS • 25%

ICML • 25%

AAAI • 25%

CVPR • 25%

Will AgentHarm be adopted as a standard benchmark by three major AI companies by March 31, 2025?

Yes • 50%

No • 50%

Which major AI conference will include a panel on AI alignment faking by end of 2025?

NeurIPS 2025 • 25%

ICML 2025 • 25%

AAAI 2025 • 25%

Other • 25%

First institution to publish follow-up study on AI agent technology by end of 2025?

Stanford University • 33%

Washington University • 33%

Google DeepMind • 34%

Which AI conference will highlight nGPT as a key innovation by end of 2024?

NeurIPS 2024 • 25%

ICML 2024 • 25%

CVPR 2024 • 25%

Other • 25%

Primary focus of major AI security conference by September 30, 2025?

AI Jailbreaks • 25%

Data Privacy • 25%

Military AI Applications • 25%

General AI Ethics • 25%

Which major AI conference will first feature a dedicated workshop on CRAB framework by end of 2024?

NeurIPS • 33%

ICML • 33%

CVPR • 33%

Will major AI conferences discuss AI exiting Vim as a milestone by August 2025?

Yes • 50%

No • 50%

At which major AI conference will NVIDIA's ChatQA 2 be featured in a keynote by end of 2024?

NeurIPS • 25%

ICML • 25%

CVPR • 25%

Other or None • 25%

Markets based on same story

Loading...

Looking for markets...

Show all

Will AgentHarm be adopted by a major AI company for internal evaluations by March 31, 2025?

No • 50%

Yes • 50%

Will AgentHarm be a standard benchmark in AI safety research by end of 2025?

No • 50%

Yes • 50%

Will AgentHarm be updated with new safety metrics by June 30, 2025?

Yes • 50%

No • 50%

What will be the primary use case for AgentHarm by October 14, 2025?

Chatbot Safety Evaluation • 25%

Other • 25%

Jailbreaking Resistance Testing • 25%

Tool-using Agent Safety • 25%