DeepNewz Markets

Market

Will another major tech company adopt Google DeepMind's SCoRe approach for their LLMs by March 31, 2024?

Google DeepMind

Resolution / Starting Odds

Yes • 50%

No • 50%

Official announcements from major tech companies or credible news reports

Story

Google DeepMind's SCoRe Achieves 15.6% Gain in Self-Correction for Language Models

Sep 20, 2024, 01:26 PM

Google DeepMind has developed a new multi-turn chain of thought online reinforcement learning (RL) approach called SCoRe to improve the self-correction capabilities of large language models (LLMs). This method uses entirely self-generated data and has achieved state-of-the-art performance in self-correction. The approach has shown a 15.6% gain in self-correction for reasoning problems from the MATH dataset and a 9.1% improvement in other areas. The research, authored by A Kumar, V Zhuang, R Agarwal, and Y Su, suggests that training with off-the-shelf datasets for RL is less effective compared to using on-policy data from the model being fine-tuned.

View original story

Similar markets

$Will another major tech company adopt Google DeepMind's SCoRe method by the end of 2024?$

Meta • 25%

Other • 25%

$Will Google DeepMind's SCoRe method be integrated into a commercial product by the end of Q3 2024?$

Market

Story

Similar markets

Will another major tech company adopt Google DeepMind's SCoRe method by the end of 2024?

Will three major tech companies adopt Google DeepMind's new LLM approach by end of 2024?

Which major tech company will first adopt Google DeepMind's new LLM approach by end of 2024?

Will Google DeepMind's SCoRe method be integrated into a commercial product by the end of Q3 2024?

Will a major tech company adopt OpenAI's O1 model by March 31, 2025?

Will DeepMind's GenRM be adopted by at least one major tech company other than Google by end of Q2 2025?

Will Google DeepMind's new LLM approach achieve significant benchmark improvement by end of 2024?

Will a major tech company adopt OpenAI's o1 model for commercial use by March 31, 2025?

Will a major tech company adopt OpenAI's Rule-Based Rewards by the end of 2024?

Will a major tech company adopt GoogleDeepMind's IRL method by end of 2024?

Will a major tech company adopt GoogleDeepMind's JEST AI training technique by the end of 2024?

Will Google DeepMind's MoNE framework be adopted by a major tech company by end of 2024?

Will Google DeepMind release a new version of its SCoRe approach by June 30, 2024?

Will Google DeepMind's SCoRe approach achieve a 20% gain in self-correction for the MATH dataset by December 31, 2024?

What will be the next significant milestone achieved by Google DeepMind's SCoRe approach by December 31, 2024?

Which area will see the highest improvement due to Google DeepMind's SCoRe approach by December 31, 2024?

Will Google DeepMind release a new version of its SCoRe approach by June 30, 2024?

Will Google DeepMind's SCoRe approach achieve a 20% gain in self-correction for the MATH dataset by December 31, 2024?

What will be the next significant milestone achieved by Google DeepMind's SCoRe approach by December 31, 2024?

Which area will see the highest improvement due to Google DeepMind's SCoRe approach by December 31, 2024?