Loading...
Loading...
Browse all stories on DeepNewz
VisitTop Performing AI Model on Scale AI Leaderboards in 2024
OpenAI • 25%
Google DeepMind • 25%
Anthropic • 25%
Microsoft • 25%
Scale AI Leaderboards official rankings
Scale AI Launches First LLM Leaderboards with Private Datasets and Paid Annotators
May 29, 2024, 05:41 PM
Scale AI has launched its first LLM Leaderboards, ranking AI model performance across specific domains. This initiative addresses significant issues in current evaluation methods, such as contaminated evaluation sets and inconsistent rater quality. The leaderboards feature private datasets that cannot be trained on and paid annotators to ensure fair and high-quality evaluations. Human expert evaluations are also part of the process. To ensure leaderboard integrity, models can only be featured the first time an organization encounters the prompts. This effort is seen as a crucial step towards improving the evaluation field, providing a trusted resource for assessing AI models. The move has been widely praised by experts, who highlight its potential to enhance the integrity and utility of AI benchmarks.
View original story
Google AI • 25%
OpenAI • 25%
Microsoft MAI-1 • 25%
Anthropic • 25%
Falcon 2 • 33%
Meta's Llama 3 • 33%
OpenAI's latest model • 34%
GPT-4o • 25%
Claude 3 • 25%
Google Bard • 25%
Other • 25%
Natural Language Processing • 25%
Computer Vision • 25%
Healthcare • 25%
Finance • 25%