Loading...
Loading...
Browse all stories on DeepNewz
VisitWill Scale AI SEAL Leaderboards include more than 50 LLMs by September 30, 2024?
Yes • 50%
No • 50%
Official announcements from Scale AI or updates on the SEAL Leaderboards platform
Scale AI Launches SEAL Leaderboards for Expert-Driven LLM Evaluations, Platform in GA
May 29, 2024, 04:55 PM
Scale AI has launched the SEAL Leaderboards, a new ranking system for frontier large language models (LLMs). This initiative is designed to provide expert-driven, trustworthy evaluations using private datasets that cannot be manipulated. The leaderboards are continuously updated with new data and models, and domain experts handle the evaluations. Scale AI aims to address the issues with current LLM evaluations, which often show discrepancies between qualitative experiences and quantitative rankings. The SEAL Leaderboards are seen as a serious contender to existing evaluation systems like lmsys.org. Scale AI is also taking the Scale Evaluation platform into General Availability (GA) and encourages model submissions via contact at seal@scale.com.
View original story
Yes • 50%
No • 50%
Model X • 33%
Model Y • 33%
Model Z • 33%
Model A • 33%
Model B • 33%
Model C • 33%
Model 1 • 33%
Model 2 • 33%
Model 3 • 33%
OpenAI • 25%
Google • 25%
Facebook • 25%
Microsoft • 25%
No • 50%
Yes • 50%
No • 50%
Yes • 50%
Less than 20 • 33%
More than 40 • 33%
20-40 • 33%
PaLM 2 • 25%
LLaMA • 25%
Claude • 25%
GPT-4 • 25%