Loading...
Loading...
Browse all stories on DeepNewz
VisitWhich company will lead in AI model performance benchmarks by the end of 2024?
Nvidia • 25%
OpenAI • 25%
Anthropic • 25%
Other • 25%
Performance results from major AI benchmarks like Arena Hard, AlpacaEval 2, and MT-Bench
Nvidia's Nemotron 70B Surpasses ChatGPT4o and Sonnet 3.5 in Benchmarks, Now Available on Hugging Face MLX
Oct 16, 2024, 02:45 PM
Nvidia has introduced Nemotron 70B, an open-source AI language model that reportedly surpasses OpenAI's ChatGPT4o and Anthropic's Sonnet 3.5 in various performance benchmarks. Early assessments indicate that Nemotron 70B achieved higher scores on the Arena Hard benchmark (85.0) compared to ChatGPT3.5 (79.2) and GPT-4o (79.3), as well as outperforming them on AlpacaEval 2 and MT-Bench evaluations. The model is now accessible through the Hugging Face MLX community, allowing users to experience its capabilities firsthand. This development underscores Nvidia's growing influence in the AI sector and could reshape the competitive landscape dominated by established technology giants.
View original story
Google • 25%
OpenAI • 25%
Microsoft • 25%
Other • 25%
DeepSeek • 25%
OpenAI • 25%
Google DeepMind • 25%
Other • 25%
Meta • 25%
OpenAI • 25%
Anthropic • 25%
Other • 25%
Meta (Llama 3) • 25%
OpenAI (GPT-4o) • 25%
Anthropic (Claude 3.5 Sonnet) • 25%
Other • 25%
OpenAI • 25%
Google • 25%
Meta • 25%
Other • 25%
Llama 3.1 405B • 25%
GPT-4o • 25%
Claude Sonnet 3.5 • 25%
Other • 25%
Meta leads • 33%
OpenAI leads • 33%
Google leads • 33%
ChatGPT-4o • 25%
Google's Gemini • 25%
Another AI model • 25%
No clear leader • 25%
Claude 3.5 Sonnet • 33%
GPT-4o • 33%
Google's AI Model • 33%
Amazon • 25%
Nvidia • 25%
Microsoft • 25%
Alphabet • 25%
Meta's Llama 3.1-70B • 25%
OpenAI's GPT-4 • 25%
Google's Bard • 25%
Other • 25%
xAI • 25%
Google DeepMind • 25%
OpenAI • 25%
Nvidia • 25%
No • 50%
Yes • 50%
Yes • 50%
No • 50%
Nemotron 70B • 25%
Other • 25%
Sonnet 3.5 • 25%
ChatGPT4o • 25%