Loading...
Loading...
Browse all stories on DeepNewz
VisitWhich AI model will achieve highest score in MMLU Social Sciences benchmark by end of 2024?
Llama 3.1 405B • 25%
GPT-4o • 25%
Claude Sonnet 3.5 • 25%
Other • 25%
Official MMLU Social Sciences benchmark results
Meta Releases Llama 3.1 405B, First Open-Source AI Model to Rival GPT-4o on July 23, 2024
Jul 23, 2024, 03:54 PM
Meta has officially released its highly anticipated Llama 3.1 model, including the 405B parameter version, on July 23, 2024. This model is the first open-source AI model to rival top proprietary models such as OpenAI's GPT-4o and Anthropic's Claude Sonnet 3.5 across several benchmarks. Leaked benchmarks indicate that Llama 3.1 405B outperforms GPT-4o in many areas, although it falls short in specific benchmarks like human_eval and mmlu_social_sciences. The model was trained on 15 trillion tokens and fine-tuned with public datasets and 15 million synthetic samples, requiring 30.84 million GPU hours. It supports a 128K context length and offers multilingual capabilities. Meta's CEO, Mark Zuckerberg, emphasized that the release of Llama 3.1 marks a significant moment in AI history, advocating for the benefits of open-source AI. The model is available through various platforms, including Groq, and is expected to be a game-changer in the AI landscape. Additionally, improved 70B and 8B parameter versions were also released.
View original story
Apple's 7B AI model • 25%
Mistral 7B • 25%
Llama 3 8B • 25%
Google's Gemma • 25%
DeepSeek-R1-Lite-Preview • 25%
OpenAI's o1-preview • 25%
Google DeepMind's model • 25%
Other • 25%
Meta's Llama 3.1-70B • 25%
OpenAI's GPT-4 • 25%
Google's Bard • 25%
Other • 25%
OpenAI's model • 25%
Deepseek-Math (7B) • 25%
Gemini 1.5 Pro (May) • 25%
GPT-4o • 25%
Llama 3.1 405B • 25%
GPT-4o • 25%
Claude Sonnet 3.5 • 25%
Other • 25%
Claude 3.5 Sonnet • 33%
GPT-4o • 33%
Google's AI Model • 33%
Google's Gemini • 25%
OpenAI's GPT • 25%
Microsoft's Azure AI • 25%
Other • 25%
OpenAI's O1 model • 25%
GPT-4 • 25%
Gemini • 25%
Anthropic's Claude • 25%
Claude 3.5 Sonnet • 33%
GPT-4o • 33%
Gemini • 34%
ContextCite • 25%
LongCite-8B • 25%
LongCite-9B • 25%
GPT-4o • 25%
OpenAI o1 • 25%
GPT-4 • 25%
Gemini • 25%
Claude • 25%
Llama 3-70B • 25%
GPT-4 • 25%
Claude 2.0 • 25%
Other • 25%
Yes • 50%
No • 50%
Other • 25%
Llama 3.1 405B • 25%
GPT-4o • 25%
Claude Sonnet 3.5 • 25%