DeepNewz Markets

Markets Stories

Search

Loading...

Browse all stories on DeepNewz

Market

Step-2 surpasses o1-mini in LiveBench rankings by end of 2024?

2

StepFun•Mixture of Experts•LiveBench•GPT•China•OpenAI

Resolution / Starting Odds

Yes • 50%

No • 50%

LiveBench evaluation rankings

Story

China's StepFun Unveils Step-2, Trillion-Parameter MoE Model Developed in Two Months, Ranks Fifth Globally

Nov 21, 2024, 05:00 AM

Chinese artificial intelligence startup StepFun has developed Step-2, a trillion-parameter Mixture of Experts (MoE) language model with a 16k context length that ranks fifth globally on LiveBench evaluations. The model has surpassed GPT-4o and is just behind o1-mini, demonstrating China's rapid progress in AI technology despite GPU trade blockades. Remarkably, Step-2 was developed in just two months and trained at a fraction of the cost of OpenAI's GPT-4, with an estimated expenditure of $3 million compared to GPT-4's $80–$100 million. The emergence of Step-2, China's top-performing large language model, highlights the country's significant investments in AI and its growing competitiveness in the field.

View original story

Similar markets

Will OpenAI's o1-preview model maintain the top spot on LiveBench AI by end of 2024?

Yes • 50%

No • 50%

Which model will top the PlanBench planning benchmark by March 31, 2025?

OpenAI o1 • 33%

Anthropic • 33%

Other • 33%

Which benchmark will o3 improve most by end of 2025?

ARC-AGI • 25%

Frontier Math • 25%

SWE-Bench • 25%

ARC-AGI Semi-Private Evaluation • 25%

SWE-Bench Verified test • 25%

AIME • 25%

GPQA-Diamond benchmark • 25%

Which AI model will be ranked first on LiveBench AI on December 31, 2024?

OpenAI o1-preview • 25%

Anthropic Claude 3.5 Sonnet • 25%

OpenAI o1 mini • 25%

Other • 25%

Which model will have the best performance in benchmarks by end of 2024?

GPT-4o • 33%

Gemini 1.5 • 33%

Claude 3.5 Sonnet • 34%

What will be GPT-4o Mini's rank in AI model performance benchmarks by end of 2024?

Top 1 • 25%

Top 2 to 3 • 25%

Top 4 to 5 • 25%

Below top 5 • 25%

Which company will achieve the highest score on SWE-Bench in 2024?

Cosine • 25%

Amazon • 25%

Cognition • 25%

Other • 25%

Will DeepSeek-R1-Lite-Preview surpass OpenAI's o1-preview in AIME benchmark by end of 2024?

Yes • 50%

No • 50%

Will Gemma 2 27B model achieve top-3 position in AI Benchmark Leaderboard by Dec 31, 2024?

Yes • 50%

No • 50%

Will GPT Next achieve a 3 OOMs Boost by the end of 2024?

Yes • 50%

No • 50%

Which benchmark will NVIDIA's Mistral-NeMo-Minitron 8B model achieve the highest improvement in by December 31, 2024?

Chatbots • 25%

Virtual assistants • 25%

Content generation • 25%

Coding • 25%

What will be the rank of the A18 chip in benchmark tests by November 30, 2024?

Top 1 • 25%

Top 2 • 25%

Top 3 • 25%

Below Top 3 • 25%

Markets based on same story

Loading...

Looking for markets...

Show all

StepFun partners with major Western tech company by end of 2024?

No • 50%

Yes • 50%

StepFun unveils AI model with over 1 trillion parameters by mid-2025?

No • 50%

Yes • 50%

Leading company in LiveBench rankings by end of 2024?

StepFun • 25%

Other • 25%

o1-mini • 25%

OpenAI • 25%

Leading country in AI model development by parameter size by end of 2025?

Other • 25%

China • 25%

USA • 25%

European Union • 25%