Loading...
Loading...
Browse all stories on DeepNewz
VisitWhat will be the performance ranking of AI2's Molmo 72B model on the VLN benchmark by mid-2025?
Top 1 • 25%
Top 3 • 25%
Top 5 • 25%
Below Top 5 • 25%
VLN benchmark results published by AI2 or a reputable AI research organization
AI2 Launches Molmo: Open-Source AI Models in 1B, 7B, 72B Sizes, Using 1000x Less Data
Sep 25, 2024, 03:42 PM
The Allen Institute for AI (AI2) has launched Molmo, a family of open-source multimodal AI models. These models, available in 1B, 7B, and 72B-parameter sizes, are designed to outperform proprietary systems such as GPT-4V, Claude 3.5 Sonnet, and Gemini 1.5 Pro. Molmo models excel in vision and language tasks, leveraging a novel dataset called PixMo, which includes high-quality image-caption pairs and multimodal instruction data. The models are capable of rich interactions in both physical and virtual worlds, using 1000x less data compared to their closed-source counterparts. AI2's Molmo aims to democratize access to advanced AI capabilities by providing open weights and allowing researchers and developers to build upon them. Molmo also shows impressive performance on RealworldQA and OOD robotics perception tasks.
View original story
Ranked 1st • 25%
Ranked 2nd • 25%
Ranked 3rd to 5th • 25%
Ranked below 5th • 25%
Top 1 • 25%
Top 2 • 25%
Top 3 • 25%
Below Top 3 • 25%
Rank 1 • 25%
Rank 2 • 25%
Rank 3 • 25%
Rank 4 or lower • 25%
Top 1 • 25%
Top 2 to 3 • 25%
Top 4 to 5 • 25%
Below top 5 • 25%
Top 10% • 25%
Top 25% • 25%
Top 50% • 25%
Below 50% • 25%
Top-1 • 25%
Top-3 • 25%
Top-5 • 25%
Not in Top-5 • 25%
Llama 3.1 405B • 25%
GPT-4o • 25%
Claude Sonnet 3.5 • 25%
Other • 25%
Top 3 • 25%
Top 5 • 25%
Top 10 • 25%
Outside Top 10 • 25%
Claude 3.5 Sonnet • 33%
GPT-4o • 33%
Google's AI Model • 33%
Top 5 • 25%
Top 10 • 25%
Top 20 • 25%
Outside Top 20 • 25%
Yes • 50%
No • 50%
MMLU • 25%
ARC • 25%
GSM8K • 25%
None by June 30, 2024 • 25%
No • 50%
Yes • 50%
Yes • 50%
No • 50%
50,001 to 100,000 • 25%
More than 100,000 • 25%
Less than 10,000 • 25%
10,000 to 50,000 • 25%