DeepNewz Markets

Market

Will Scale AI SEAL Leaderboards include more than 50 LLMs by September 30, 2024?

Scale AI•SEAL Leaderboards•AI•Scale Evaluation•General Availability

Resolution / Starting Odds

Yes • 50%

No • 50%

Official announcements from Scale AI or updates on the SEAL Leaderboards platform

Story

Scale AI Launches SEAL Leaderboards for Expert-Driven LLM Evaluations, Platform in GA

May 29, 2024, 04:55 PM

Scale AI has launched the SEAL Leaderboards, a new ranking system for frontier large language models (LLMs). This initiative is designed to provide expert-driven, trustworthy evaluations using private datasets that cannot be manipulated. The leaderboards are continuously updated with new data and models, and domain experts handle the evaluations. Scale AI aims to address the issues with current LLM evaluations, which often show discrepancies between qualitative experiences and quantitative rankings. The SEAL Leaderboards are seen as a serious contender to existing evaluation systems like lmsys.org. Scale AI is also taking the Scale Evaluation platform into General Availability (GA) and encourages model submissions via contact at seal@scale.com.

View original story

Similar markets

$Will Scale AI partner with a major tech company for SEAL Leaderboards by October 31, 2024?$

Will Scale AI partner with a major tech company for SEAL Leaderboards by October 31, 2024?

Yes • 50%

No • 50%

$Will an LLM achieve top rank on SEAL Leaderboards in Coding by August 31, 2024?$

Will an LLM achieve top rank on SEAL Leaderboards in Coding by August 31, 2024?

Yes • 50%

No • 50%

$Will Scale AI introduce a new domain to SEAL Leaderboards by December 31, 2024?$

Will Scale AI introduce a new domain to SEAL Leaderboards by December 31, 2024?

Yes • 50%

No • 50%

Will Scale AI launch a new LLM evaluation by Nov 2024?

Yes • 50%

No • 50%

$Which LLM will be in the top 3 for Math domain on SEAL Leaderboards as of November 30, 2024?$

Which LLM will be in the top 3 for Math domain on SEAL Leaderboards as of November 30, 2024?

Model X • 33%

Model Y • 33%

Model Z • 33%

$Which LLM will be in the top 3 for Instruction following domain on SEAL Leaderboards as of September 30, 2024?$

Which LLM will be in the top 3 for Instruction following domain on SEAL Leaderboards as of September 30, 2024?

Model A • 33%

Model B • 33%

Model C • 33%

$Which LLM will be in the top 3 for Spanish domain on SEAL Leaderboards as of October 31, 2024?$

Which LLM will be in the top 3 for Spanish domain on SEAL Leaderboards as of October 31, 2024?

Model 1 • 33%

Model 2 • 33%

Model 3 • 33%

GPT-4 • 20%

Claude • 20%

Gemini • 20%

Scale AI Leaderboards Face Bias Criticism by October 2024?

Yes • 50%

No • 50%

First Company to Announce New LLM Benchmark Post-Scale AI

OpenAI • 25%

Google • 25%

Facebook • 25%

Microsoft • 25%

Market

Story

Similar markets

Will Scale AI partner with a major tech company for SEAL Leaderboards by October 31, 2024?

Will an LLM achieve top rank on SEAL Leaderboards in Coding by August 31, 2024?

Will Scale AI introduce a new domain to SEAL Leaderboards by December 31, 2024?

Will Scale AI launch a new LLM evaluation by Nov 2024?

Which LLM will be in the top 3 for Math domain on SEAL Leaderboards as of November 30, 2024?

Which LLM will be in the top 3 for Instruction following domain on SEAL Leaderboards as of September 30, 2024?

Which LLM will be in the top 3 for Spanish domain on SEAL Leaderboards as of October 31, 2024?

Scale AI Leaderboards Show 50% Improvement in Model Performance by November 2024?

Top 10 AI Companies Adopt Scale AI Leaderboards by 2024 End?

Most Improved LLM on GSM1k by End of 2024

Scale AI Leaderboards Face Bias Criticism by October 2024?

First Company to Announce New LLM Benchmark Post-Scale AI

Will Scale AI announce a major partnership related to SEAL Leaderboards by end of 2024?

Will Scale AI SEAL Leaderboards surpass lmsys.org in market adoption by end of 2024?

How many LLMs will be evaluated by SEAL Leaderboards by end of 2024?

Which LLM will be ranked highest on Scale AI SEAL Leaderboards by end of 2024?

Will Scale AI announce a major partnership related to SEAL Leaderboards by end of 2024?

Will Scale AI SEAL Leaderboards surpass lmsys.org in market adoption by end of 2024?

How many LLMs will be evaluated by SEAL Leaderboards by end of 2024?

Which LLM will be ranked highest on Scale AI SEAL Leaderboards by end of 2024?