DeepNewz Markets

Markets Stories

Search

Loading...

Browse all stories on DeepNewz

Market

What will be the primary use case for AgentHarm by October 14, 2025?

2

AI Safety Institute•AISI•Gray Swan AI•AgentHarm

Resolution / Starting Odds

Chatbot Safety Evaluation • 25%

Tool-using Agent Safety • 25%

Jailbreaking Resistance Testing • 25%

Other • 25%

Surveys or reports from AI companies and industry analysts

Story

AI Safety Institute and Gray Swan AI Release AgentHarm to Measure LLM Agent Harmfulness, Address Jailbreaking

Oct 14, 2024, 12:05 PM

The AI Safety Institute (AISI) and Gray Swan AI have announced the release of AgentHarm, a benchmark designed to measure the harmfulness of large language model (LLM) agents. This dataset aims to evaluate the unique harms posed by AI agents with access to external tools. The collaboration emphasizes the importance of moving beyond simple chatbot evaluations to assess the safety of more complex agent tasks. AgentHarm is described as easy to run, comprehensive, and reliable, and it is partly public, allowing broader accessibility for safety evaluations. The dataset also addresses concerns about jailbreaking and robustness in LLM agents.

View original story

Similar markets

What will be the primary use case for Based Agent by end of 2025?

Digital commerce • 25%

Social media interaction • 25%

Crypto trading • 25%

Other • 25%

What will be the primary use case of 'Project Jarvis' by February 28, 2025?

Research • 25%

Shopping • 25%

Booking Flights • 25%

Other • 25%

Which AI company will first integrate AgentHarm into safety evaluations by July 31, 2025?

OpenAI • 25%

Google DeepMind • 25%

Anthropic • 25%

Other • 25%

First sector to report significant impact from AgentHarm by May 31, 2025?

Healthcare • 25%

Finance • 25%

Technology • 25%

Other • 25%

What will be the most common use case for AI agents based on OpenAI's o1 model by September 22, 2025?

Automated coding • 25%

Debugging and testing • 25%

Code review and optimization • 25%

Other • 25%

Will AgentHarm dataset receive a major update by June 30, 2025?

Yes • 50%

No • 50%

What will be the most popular use case for Microsoft's AI agents by September 30, 2025?

Client inquiries • 25%

Sales lead generation • 25%

Inventory management • 25%

Customer support • 25%

Will AgentHarm be adopted as a standard benchmark by three major AI companies by March 31, 2025?

Yes • 50%

No • 50%

What will be the primary use case for Microsoft's Copilot Studio agents by the end of 2024?

Client inquiries • 25%

Sales leads • 25%

Customer support • 25%

Other • 25%

What will be the primary use case for Astribot S1 by December 31, 2024?

Cleaning • 25%

Cooking • 25%

Childcare • 25%

Other • 25%

Will a significant vulnerability be discovered in AgentHarm's methodology by December 31, 2024?

Yes • 50%

No • 50%

First major client for /dev/agents AI OS by end of 2025?

Google • 25%

Facebook • 25%

Tesla • 25%

Other • 25%

Markets based on same story

Loading...

Looking for markets...

Show all

Will AgentHarm be adopted by a major AI company for internal evaluations by March 31, 2025?

No • 50%

Yes • 50%

Will AgentHarm be a standard benchmark in AI safety research by end of 2025?

No • 50%

Yes • 50%

Will AgentHarm be updated with new safety metrics by June 30, 2025?

Yes • 50%

No • 50%

Which major AI conference will first feature AgentHarm by end of 2025?

ICML 2025 • 25%

Other • 25%

NeurIPS 2024 • 25%

AAAI 2025 • 25%