DeepNewz Markets

Market

Will another major tech company adopt Google DeepMind's SCoRe method by the end of 2024?

Google DeepMind•Hacker News

Resolution / Starting Odds

Yes • 50%

No • 50%

Official announcements from major tech companies or credible news sources

Story

$Google DeepMind's SCoRe Achieves 15.6% Gain in LLM Self-Correction for MATH$

Google DeepMind's SCoRe Achieves 15.6% Gain in LLM Self-Correction for MATH

Sep 20, 2024, 05:06 PM

Google DeepMind has developed a multi-turn online reinforcement learning (RL) approach to improve the self-correction capabilities of large language models (LLMs). The new method, named SCoRe, utilizes entirely self-generated data and achieves state-of-the-art performance in self-correction. This approach addresses the limitations of supervised fine-tuning (SFT), which has been found ineffective for self-correction due to a distribution mismatch. The research, titled 'Training Language Models to Self-Correct via Reinforcement Learning,' has gained significant attention, including being highlighted on Hacker News for AI papers. SCoRe achieved a 15.6% gain on self-correction for reasoning problems from MATH and a 9.1% improvement overall.

View original story

Similar markets

Will another major tech company adopt Google DeepMind's SCoRe approach for their LLMs by March 31, 2024?

Yes • 50%

No • 50%

Which major tech company will be the first to adopt Google DeepMind's SCoRe approach by March 31, 2024?

Microsoft • 25%

Apple • 25%

Amazon • 25%

Other • 25%

Meta • 25%

Other • 25%

What will be the next significant milestone achieved by Google DeepMind's SCoRe approach by December 31, 2024?

25% gain in self-correction for MATH dataset • 25%

15% gain in self-correction for other datasets • 25%

Adoption by three major tech companies • 25%

Other • 25%

Meta • 25%

Other • 25%

Market

Story

Similar markets

Will another major tech company adopt Google DeepMind's SCoRe approach for their LLMs by March 31, 2024?

Which major tech company will be the first to adopt Google DeepMind's SCoRe approach by March 31, 2024?

Will a major tech company adopt GoogleDeepMind's JEST AI training technique by the end of 2024?

Will three major tech companies adopt Google DeepMind's new LLM approach by end of 2024?

Will Google DeepMind release a new version of its SCoRe approach by June 30, 2024?

Will a major tech company adopt GoogleDeepMind's IRL method by end of 2024?

Which major tech company will first adopt Google DeepMind's new LLM approach by end of 2024?

What will be the next significant milestone achieved by Google DeepMind's SCoRe approach by December 31, 2024?

Will a major tech company adopt OpenAI's Rule-Based Rewards by the end of 2024?

Will DeepMind's GenRM be adopted by at least one major tech company other than Google by end of Q2 2025?

Will Google DeepMind's MoNE framework be adopted by a major tech company by end of 2024?

Which major tech company will be the first to adopt GoogleDeepMind's JEST AI training technique by the end of 2024?

Will Google DeepMind's SCoRe method achieve a 20% gain in self-correction for MATH by mid-2025?

Will Google DeepMind's SCoRe method be integrated into a commercial product by the end of Q3 2024?

What will be the next major milestone in LLM self-correction achieved by Google DeepMind by the end of 2024?

What will be the primary application domain for Google DeepMind's SCoRe method by the end of 2024?

Will Google DeepMind's SCoRe method achieve a 20% gain in self-correction for MATH by mid-2025?

Will Google DeepMind's SCoRe method be integrated into a commercial product by the end of Q3 2024?

What will be the next major milestone in LLM self-correction achieved by Google DeepMind by the end of 2024?

What will be the primary application domain for Google DeepMind's SCoRe method by the end of 2024?