DeepNewz Markets

Markets Stories

Search

Loading...

Browse all stories on DeepNewz

Market

Which AI model will achieve highest score in MMLU Social Sciences benchmark by end of 2024?

2

Meta•Llama•OpenAI•Anthropic•Claude Sonnet•Mark Zuckerberg•Groq

Resolution / Starting Odds

Llama 3.1 405B • 25%

GPT-4o • 25%

Claude Sonnet 3.5 • 25%

Other • 25%

Official MMLU Social Sciences benchmark results

Story

Meta Releases Llama 3.1 405B, First Open-Source AI Model to Rival GPT-4o on July 23, 2024

Jul 23, 2024, 03:54 PM

Meta has officially released its highly anticipated Llama 3.1 model, including the 405B parameter version, on July 23, 2024. This model is the first open-source AI model to rival top proprietary models such as OpenAI's GPT-4o and Anthropic's Claude Sonnet 3.5 across several benchmarks. Leaked benchmarks indicate that Llama 3.1 405B outperforms GPT-4o in many areas, although it falls short in specific benchmarks like human_eval and mmlu_social_sciences. The model was trained on 15 trillion tokens and fine-tuned with public datasets and 15 million synthetic samples, requiring 30.84 million GPU hours. It supports a 128K context length and offers multilingual capabilities. Meta's CEO, Mark Zuckerberg, emphasized that the release of Llama 3.1 marks a significant moment in AI history, advocating for the benefits of open-source AI. The model is available through various platforms, including Groq, and is expected to be a game-changer in the AI landscape. Additionally, improved 70B and 8B parameter versions were also released.

View original story

Similar markets

Which open-source AI model will have the highest MMLU score by December 31, 2024?

Apple's 7B AI model • 25%

Mistral 7B • 25%

Llama 3 8B • 25%

Google's Gemma • 25%

Which AI model will achieve the highest score in the MATH benchmark by end of 2024?

DeepSeek-R1-Lite-Preview • 25%

OpenAI's o1-preview • 25%

Google DeepMind's model • 25%

Other • 25%

Which AI model will achieve the highest performance benchmark by December 31, 2024?

Meta's Llama 3.1-70B • 25%

OpenAI's GPT-4 • 25%

Google's Bard • 25%

Other • 25%

Which AI model will have the highest score on the MATH dataset by the end of 2025?

OpenAI's model • 25%

Deepseek-Math (7B) • 25%

Gemini 1.5 Pro (May) • 25%

GPT-4o • 25%

Which AI model will be top-performing in benchmarks by end of 2024?

Llama 3.1 405B • 25%

GPT-4o • 25%

Claude Sonnet 3.5 • 25%

Other • 25%

Which AI model will have the best performance in public benchmarks by end of 2024?

Claude 3.5 Sonnet • 33%

GPT-4o • 33%

Google's AI Model • 33%

Which AI model will be rated highest in performance by June 30, 2025?

Google's Gemini • 25%

OpenAI's GPT • 25%

Microsoft's Azure AI • 25%

Other • 25%

Which AI model will be rated as the best performer by December 31, 2024?

OpenAI's O1 model • 25%

GPT-4 • 25%

Gemini • 25%

Anthropic's Claude • 25%

Which AI model will be the best performing in 2024 benchmarks?

Claude 3.5 Sonnet • 33%

GPT-4o • 33%

Gemini • 34%

Which model will have the highest citation F1 score in a major independent benchmark by end of 2024?

ContextCite • 25%

LongCite-8B • 25%

LongCite-9B • 25%

GPT-4o • 25%

Which AI model will have the highest lm-sys Elo score by the end of 2024?

OpenAI o1 • 25%

GPT-4 • 25%

Gemini • 25%

Claude • 25%

Which AI model will be the top performer in RewardBench by end of 2024?

Llama 3-70B • 25%

GPT-4 • 25%

Claude 2.0 • 25%

Other • 25%

Markets based on same story

Loading...

Looking for markets...

Show all

Will Llama 3.1 405B achieve over 1 million downloads by end of 2024?

Yes • 50%

No • 50%

Will Llama 3.1 405B be integrated into more than 10 major platforms by end of 2024?

Yes • 50%

No • 50%

Will Llama 3.1 405B surpass GPT-4o in OpenAI benchmarks by end of 2024?

No • 50%

Yes • 50%

Which AI model will be the top performer in MLPerf benchmark by end of 2024?

Other • 25%

Llama 3.1 405B • 25%

GPT-4o • 25%

Claude Sonnet 3.5 • 25%