DeepNewz Markets

Market

Which proprietary model will Molmo outperform next in a publicly available benchmark test by June 30, 2025?

Allen Institute for AI•AI•Multimodal Open Language Model•Molmo•Flash•PixMo

Resolution / Starting Odds

GPT-4V • 25%

Claude 3.5 Sonnet • 25%

Flash • 25%

Other • 25%

Results from publicly available AI benchmark tests

Story

AI2 Releases Molmo, Open-Source AI Model in 1B, 7B, and 72B Sizes Outperforming Proprietary Systems

Sep 25, 2024, 01:50 PM

The Allen Institute for AI (AI2) has released the Multimodal Open Language Model (Molmo), a state-of-the-art open-source AI model. Molmo is available in 1B, 7B, and 72B-parameter sizes and has been shown to outperform proprietary models such as GPT-4V, Claude 3.5 Sonnet, and Flash. The model's performance is attributed to its focus on data quality over quantity, utilizing a meticulously curated dataset called PixMo. Molmo's capabilities include understanding and acting on multimodal data, enabling rich interactions in both physical and virtual worlds. The release includes four model checkpoints: MolmoE-1B, Molmo-7B-O, and others, making it the most capable open-source AI model to date. Human preference for the 72B model is on par with top API models. The dataset PixMo was curated over 9 months.

View original story

Similar markets

Ranked 3rd to 5th • 25%

Ranked below 5th • 25%

Which benchmark will LiquidAI's models achieve SOTA performance in by June 30, 2024?

MMLU • 25%

ARC • 25%

GSM8K • 25%

None by June 30, 2024 • 25%

Which model will top the PlanBench planning benchmark by March 31, 2025?

OpenAI o1 • 33%

Anthropic • 33%

Other • 33%

Which benchmark will Llama 3.3 achieve the highest score on by June 30, 2025?

GPQA Diamond (CoT) • 25%

Math (CoT) • 25%

Other AI Benchmark • 25%

None • 25%

20% to 30% • 25%

More than 30% • 25%

Will AI2's Molmo 72B model outperform GPT-4V on RealworldQA benchmark by end of 2024?

Yes • 50%

No • 50%

What will be the performance ranking of AI2's Molmo 72B model on the VLN benchmark by mid-2025?

Top 1 • 25%

Top 3 • 25%

Top 5 • 25%

Below Top 5 • 25%

Which competitor will release a similar enhancement to their AI model by March 31, 2025?

Google • 25%

Microsoft • 25%

Meta • 25%

Amazon • 25%

Market

Story

Similar markets

Will Molmo outperform GPT-4V in a major AI benchmark by December 31, 2024?

Will a Fortune 500 company adopt Molmo for internal use by June 30, 2025?

What will be Molmo's ranking in the 2024 AI Model Performance Leaderboard?

Which benchmark will LiquidAI's models achieve SOTA performance in by June 30, 2024?

Which model will top the PlanBench planning benchmark by March 31, 2025?

Which benchmark will Llama 3.3 achieve the highest score on by June 30, 2025?

Will Molmo surpass GPT-4V in API integrations by March 31, 2025?

Will KAT model outperform current state-of-the-art models in at least three benchmark tests by Mar 31, 2025?

What will be Molmo's market share among multimodal vision language models by December 31, 2024?

Will AI2's Molmo 72B model outperform GPT-4V on RealworldQA benchmark by end of 2024?

What will be the performance ranking of AI2's Molmo 72B model on the VLN benchmark by mid-2025?

Which competitor will release a similar enhancement to their AI model by March 31, 2025?

Will Molmo be integrated into a major tech company's product suite by the end of 2024?

Will Molmo be used in a published research paper by a top-tier academic conference by June 30, 2025?

Will the 72B-parameter Molmo model outperform GPT-4V in a benchmark test by March 31, 2025?

Which Molmo model size will be the most popular in terms of downloads by the end of 2024?

Will Molmo be integrated into a major tech company's product suite by the end of 2024?

Will Molmo be used in a published research paper by a top-tier academic conference by June 30, 2025?

Will the 72B-parameter Molmo model outperform GPT-4V in a benchmark test by March 31, 2025?

Which Molmo model size will be the most popular in terms of downloads by the end of 2024?