DeepNewz Markets

Market

First Meta AI Model to Achieve Human-Level Performance Across All Benchmarks by End of 2025

Meta•Iterative Reasoning Preference Optimization•Llama•Chat

Resolution / Starting Odds

Llama-2-70B-Chat • 50%

Meta-Llama-3-8B • 50%

Published AI research papers or official Meta announcements

Story

Meta Boosts AI Model Accuracy with New Iterative RPO Method

May 1, 2024, 02:17 AM

Meta has recently developed and applied a new method called Iterative Reasoning Preference Optimization (Iterative RPO) to enhance the reasoning capabilities of its AI models, specifically the Llama-2-70B-Chat. This method involves generating chain-of-thought candidates with a large language model, constructing preference pairs based on the correctness of answers, and training the model accordingly. Significant improvements were noted in model accuracy across various benchmarks: GSM8K (from 55.6% to 81.6%), MATH (from 12.5% to 20.8%), and ARC-Challenge (from 77.8% to 86.7%). Additionally, the LLM2Vec approach was applied to the Meta-Llama-3-8B model, enhancing its performance on embedding tasks.

View original story

Similar markets

Leading AI model in performance by end of 2024

Falcon 2 • 33%

Meta's Llama 3 • 33%

OpenAI's latest model • 34%

Leading AI model in benchmarks by end of 2025

Phi-3 14B • 25%

Mixtral 8x7B • 25%

GPT-3.5 • 25%

Llama-3 8B • 25%

AI model with the most significant update by end of 2025

Phi-3 series • 25%

Mixtral series • 25%

GPT series • 25%

Llama series • 25%

Outside Top 5 • 34%

Completion of Meta's 400B+ Llama 3 AI Model by 2024

Yes • 50%

No • 50%

Fastest AI model by end of 2024

Llama3 • 33%

GPT-4 • 33%

Next-gen OpenAI model • 34%

Top 5 • 25%

Outside Top 5 • 25%

Phi-3 14B retains top benchmark scores by April 2025?

Yes • 50%

No • 50%

Market

Story

Similar markets

Leading AI model in performance by end of 2024

Leading AI model in benchmarks by end of 2025

AI model with the most significant update by end of 2025

Meta launches new AI product adhering to Seoul Summit guidelines by mid-2025?

Meta Releases New AI Model after Llama 3 by End of 2024

Meta's Market Position in AI by 2025 End?

Completion of Meta's 400B+ Llama 3 AI Model by 2024

Fastest AI model by end of 2024

Meta's AGI surpasses human capability in a task by 2025?

Over 75% User Satisfaction for Meta AI Assistant by Mid-2025?

Performance ranking of Meta's Llama 3 400B by end of 2024

Phi-3 14B retains top benchmark scores by April 2025?

Adoption of Meta's Iterative RPO by Other Major Tech Companies by End of 2024?

Meta AI GSM8K Benchmark Accuracy Above 90% by End of 2024?

Meta AI Surpasses Human-Level Performance on MATH by Mid-2025?

AI Benchmark with Most Improvement Due to Meta's Iterative RPO by End of 2024

Adoption of Meta's Iterative RPO by Other Major Tech Companies by End of 2024?

Meta AI GSM8K Benchmark Accuracy Above 90% by End of 2024?

Meta AI Surpasses Human-Level Performance on MATH by Mid-2025?

AI Benchmark with Most Improvement Due to Meta's Iterative RPO by End of 2024