DeepNewz Markets

Markets Stories

Search

Loading...

Browse all stories on DeepNewz

Market

Scale AI Leaderboards Face Bias Criticism by October 2024?

5

Resolution / Starting Odds

Yes • 50%

No • 50%

Media reports, studies, or public statements from AI ethics experts

Story

Scale AI Launches First LLM Leaderboards with Private Datasets and Paid Annotators

May 29, 2024, 05:41 PM

Scale AI has launched its first LLM Leaderboards, ranking AI model performance across specific domains. This initiative addresses significant issues in current evaluation methods, such as contaminated evaluation sets and inconsistent rater quality. The leaderboards feature private datasets that cannot be trained on and paid annotators to ensure fair and high-quality evaluations. Human expert evaluations are also part of the process. To ensure leaderboard integrity, models can only be featured the first time an organization encounters the prompts. This effort is seen as a crucial step towards improving the evaluation field, providing a trusted resource for assessing AI models. The move has been widely praised by experts, who highlight its potential to enhance the integrity and utility of AI benchmarks.

View original story

Similar markets

Will Scale AI have a more diverse workforce by end of 2024?

Yes • 50%

No • 50%

Public sentiment on AI ethics post Johansson accusation by 2024

Significantly more negative • 25%

Somewhat more negative • 25%

No significant change • 25%

More positive (support for AI innovation) • 25%

Stricter AI Content Policies Recommendation by Meta's Oversight Board in 2024

Yes • 50%

No • 50%

Backlash Against AI-Generated Ads by End of 2024?

Yes • 50%

No • 50%

AI Overview negative reviews drop below 10% by end of 2024?

Yes • 50%

No • 50%

$Will Scale AI introduce a new domain to SEAL Leaderboards by December 31, 2024?$

Will Scale AI introduce a new domain to SEAL Leaderboards by December 31, 2024?

Yes • 50%

No • 50%

Social Media AI Ethics Guidelines Implemented by Mid-2025?

Yes • 50%

No • 50%

OpenAI acknowledges developer concerns by 2024?

Yes • 50%

No • 50%

LinkedIn faces backlash over AI tools by end of 2024?

Yes • 50%

No • 50%

OpenAI introduces new ethical guidelines by end of 2024?

Yes • 50%

No • 50%

Will Google face public backlash for ads in AI-generated search answers by end of 2024?

Yes • 50%

No • 50%

Public criticism by former OpenAI employee by end of November 2024?

Yes • 50%

No • 50%

Markets based on same story

Loading...

Looking for markets...

Show all

Scale AI Leaderboards Show 50% Improvement in Model Performance by November 2024?

Yes • 50%

No • 50%

Top 10 AI Companies Adopt Scale AI Leaderboards by 2024 End?

Yes • 50%

No • 50%

Dominant Domain of AI Models in Scale AI Leaderboards in 2024

Natural Language Processing • 25%

Computer Vision • 25%

Healthcare • 25%

Finance • 25%

Most Common Use Case for AI Models in Scale AI Leaderboards in 2024

Medical Diagnosis • 25%

Customer Service • 25%

Content Generation • 25%

Fraud Detection • 25%