Loading...
Loading...
Browse all stories on DeepNewz
VisitWhat will be the average MMLU score for NVIDIA's Mistral-NeMo-Minitron 8B model in 2024?
Below 69.0 • 25%
69.0 to 69.4 • 25%
69.5 to 69.9 • 25%
70.0 or higher • 25%
Benchmark results published by NVIDIA or independent benchmarking organizations
NVIDIA's Mistral-NeMo-Minitron 8B Model Achieves High Accuracy
Aug 21, 2024, 04:08 PM
NVIDIA has announced the release of the Mistral-NeMo-Minitron 8B, a small language model derived from the Mistral NeMo 12B model through pruning and distillation techniques. This new model is designed to offer state-of-the-art accuracy while being more computationally efficient. The Mistral-NeMo-Minitron 8B achieves high accuracy in nine popular benchmarks, including those for chatbots, virtual assistants, content generation, and coding. The model was developed using 400 billion tokens for distillation, significantly reducing the computational cost compared to training from scratch. NVIDIA's innovation in pruning and distillation has enabled the creation of a commercially permissive model that maintains high performance levels. The model achieved an MMLU score of 69.5.
View original story
1B • 25%
3B • 25%
11B • 25%
90B • 25%
Yes • 50%
No • 50%
GPT-4 Turbo • 25%
Claude 3 Opus • 25%
Llama 3.1 405B • 25%
Other • 25%
Yes • 50%
No • 50%
Apple's 7B AI model • 25%
Mistral 7B • 25%
Llama 3 8B • 25%
Google's Gemma • 25%
Above 40% • 25%
30-40% • 25%
20-30% • 25%
Below 20% • 25%
Above 90 • 25%
87-90 • 25%
84-86 • 25%
Below 84 • 25%
DeepSeek-R1-Lite-Preview • 25%
OpenAI's o1-preview • 25%
Google DeepMind's model • 25%
Other • 25%
Llama 3.1 405B • 25%
GPT-4o • 25%
Claude Sonnet 3.5 • 25%
Other • 25%
Top 1 • 25%
Top 3 • 25%
Top 5 • 25%
Below Top 5 • 25%
Coding • 25%
Hard Prompts • 25%
Math • 25%
Longer Queries • 25%
Top 1 • 25%
Top 5 • 25%
Top 10 • 25%
Below Top 10 • 25%
No • 50%
Yes • 50%
Yes • 50%
No • 50%
No • 50%
Yes • 50%
Coding • 25%
Chatbots • 25%
Virtual assistants • 25%
Content generation • 25%