DeepNewz Markets

Market

What will be the next major milestone in LLM self-correction achieved by Google DeepMind by the end of 2024?

Google DeepMind•Hacker News

Resolution / Starting Odds

25% gain in MATH • 25%

15% gain in other domains • 25%

Integration into multiple commercial products • 25%

Other • 25%

Published research papers or official announcements from Google DeepMind

Story

$Google DeepMind's SCoRe Achieves 15.6% Gain in LLM Self-Correction for MATH$

Google DeepMind's SCoRe Achieves 15.6% Gain in LLM Self-Correction for MATH

Sep 20, 2024, 05:06 PM

Google DeepMind has developed a multi-turn online reinforcement learning (RL) approach to improve the self-correction capabilities of large language models (LLMs). The new method, named SCoRe, utilizes entirely self-generated data and achieves state-of-the-art performance in self-correction. This approach addresses the limitations of supervised fine-tuning (SFT), which has been found ineffective for self-correction due to a distribution mismatch. The research, titled 'Training Language Models to Self-Correct via Reinforcement Learning,' has gained significant attention, including being highlighted on Hacker News for AI papers. SCoRe achieved a 15.6% gain on self-correction for reasoning problems from MATH and a 9.1% improvement overall.

View original story

Similar markets

Will Google DeepMind's new LLM approach achieve significant benchmark improvement by end of 2024?

Yes • 50%

No • 50%

What will be the next significant milestone achieved by Google DeepMind's SCoRe approach by December 31, 2024?

25% gain in self-correction for MATH dataset • 25%

15% gain in self-correction for other datasets • 25%

Adoption by three major tech companies • 25%

Other • 25%

Which benchmark will Google DeepMind's new LLM approach top first by end of 2024?

GLUE • 25%

SuperGLUE • 25%

SQuAD • 25%

Other • 25%

What will be DeepMind's next major AI advancement announcement by end of 2024?

New AI system for medical research • 25%

New AI system for financial modeling • 25%

New AI system for climate modeling • 25%

Other • 25%

$What will be the next major achievement by Google DeepMind AI in a competitive setting by December 31, 2024?$

What will be the next major achievement by Google DeepMind AI in a competitive setting by December 31, 2024?

Gold medal at 2025 IMO • 25%

Winning a Kaggle competition • 25%

Breakthrough in protein folding • 25%

Other • 25%

What will be the next major competition for DeepMind's AI by end of 2024?

International Mathematical Olympiad • 25%

Kaggle Competition • 25%

DARPA Challenge • 25%

Other • 25%

Which Google product will first integrate DeepMind's new LLM approach by end of 2024?

Google Search • 25%

Google Assistant • 25%

Google Cloud AI • 25%

Other • 25%

Level 4: Innovators • 25%

Level 5: Organizations • 25%

What will be Google DeepMind's table tennis robot's next major achievement by December 31, 2024?

Wins a match against a professional player • 25%

Achieves 70% win rate against intermediate players • 25%

Participates in a national tournament • 25%

Other • 25%

What significant milestone will OpenAI's AI achieve by the end of 2024?

Surpasses human performance in a specific task • 25%

Achieves a breakthrough in natural language processing • 25%

Reaches a new level of general AI • 25%

Other • 25%

Market

Story

Similar markets

Will Google DeepMind's new LLM approach achieve significant benchmark improvement by end of 2024?

What will be the next significant milestone achieved by Google DeepMind's SCoRe approach by December 31, 2024?

Which benchmark will Google DeepMind's new LLM approach top first by end of 2024?

What will be DeepMind's next major AI advancement announcement by end of 2024?

What will be the next major achievement by Google DeepMind AI in a competitive setting by December 31, 2024?

What will be the next major competition for DeepMind's AI by end of 2024?

Which Google product will first integrate DeepMind's new LLM approach by end of 2024?

Will DeepMind's GenRM improve LLM benchmark scores by a significant margin by end of Q1 2025?

Will Google integrate DeepMind's new LLM approach into Google Search by mid-2025?

What will be the next major milestone achieved by OpenAI in AGI development by end of 2024?

What will be Google DeepMind's table tennis robot's next major achievement by December 31, 2024?

What significant milestone will OpenAI's AI achieve by the end of 2024?

Will another major tech company adopt Google DeepMind's SCoRe method by the end of 2024?

Will Google DeepMind's SCoRe method achieve a 20% gain in self-correction for MATH by mid-2025?

Will Google DeepMind's SCoRe method be integrated into a commercial product by the end of Q3 2024?

What will be the primary application domain for Google DeepMind's SCoRe method by the end of 2024?

Will another major tech company adopt Google DeepMind's SCoRe method by the end of 2024?

Will Google DeepMind's SCoRe method achieve a 20% gain in self-correction for MATH by mid-2025?

Will Google DeepMind's SCoRe method be integrated into a commercial product by the end of Q3 2024?

What will be the primary application domain for Google DeepMind's SCoRe method by the end of 2024?