DeepNewz Markets

Market

Will Google DeepMind release a new version of its SCoRe approach by June 30, 2024?

Google DeepMind

Resolution / Starting Odds

Yes • 50%

No • 50%

Official announcements from Google DeepMind or credible news reports

Story

Google DeepMind's SCoRe Achieves 15.6% Gain in Self-Correction for Language Models

Sep 20, 2024, 01:26 PM

Google DeepMind has developed a new multi-turn chain of thought online reinforcement learning (RL) approach called SCoRe to improve the self-correction capabilities of large language models (LLMs). This method uses entirely self-generated data and has achieved state-of-the-art performance in self-correction. The approach has shown a 15.6% gain in self-correction for reasoning problems from the MATH dataset and a 9.1% improvement in other areas. The research, authored by A Kumar, V Zhuang, R Agarwal, and Y Su, suggests that training with off-the-shelf datasets for RL is less effective compared to using on-policy data from the model being fine-tuned.

View original story

Similar markets

$Will Google DeepMind's SCoRe method be integrated into a commercial product by the end of Q3 2024?$

Will Google DeepMind's SCoRe method be integrated into a commercial product by the end of Q3 2024?

Yes • 50%

No • 50%

$Will another major tech company adopt Google DeepMind's SCoRe method by the end of 2024?$

Will another major tech company adopt Google DeepMind's SCoRe method by the end of 2024?

Yes • 50%

No • 50%

Will Google DeepMind release additional methodological details about AlphaProteo by March 31, 2025?

Yes • 50%

No • 50%

Will Google DeepMind release a major update to Imagen 3 addressing numerical reasoning and action depiction by June 30, 2025?

Yes • 50%

No • 50%

Will Google Deepmind release a public statement about 'Strawberry' AI model's performance by December 31, 2024?

Yes • 50%

No • 50%

Will Google DeepMind publish benchmarks showing two-fold reduction in inference time using MoNE framework by Nov 30, 2024?

Yes • 50%

No • 50%

$What will be the primary application domain for Google DeepMind's SCoRe method by the end of 2024?$

What will be the primary application domain for Google DeepMind's SCoRe method by the end of 2024?

Education • 25%

Healthcare • 25%

Finance • 25%

Other • 25%

Market

Story

Similar markets

Will Google DeepMind's SCoRe method be integrated into a commercial product by the end of Q3 2024?

Will another major tech company adopt Google DeepMind's SCoRe method by the end of 2024?

Will Google DeepMind release additional methodological details about AlphaProteo by March 31, 2025?

Will Google DeepMind release a major update to Imagen 3 addressing numerical reasoning and action depiction by June 30, 2025?

Will Google Deepmind release a public statement about 'Strawberry' AI model's performance by December 31, 2024?

Will Google DeepMind publish benchmarks showing two-fold reduction in inference time using MoNE framework by Nov 30, 2024?

What will be the primary application domain for Google DeepMind's SCoRe method by the end of 2024?

Will Google DeepMind's MoNE framework be integrated into a popular open-source AI library by June 30, 2024?

Will Google release an AI model that surpasses OpenAI's o1 model by June 30, 2024?

Will Google DeepMind publicly release V2A technology by end of 2024?

Will DeepMind publicly express concerns about Google's AI progress by February 28, 2025?

Will Google DeepMind release a commercial version of its table tennis robot by August 31, 2025?

Will another major tech company adopt Google DeepMind's SCoRe approach for their LLMs by March 31, 2024?

Will Google DeepMind's SCoRe approach achieve a 20% gain in self-correction for the MATH dataset by December 31, 2024?

What will be the next significant milestone achieved by Google DeepMind's SCoRe approach by December 31, 2024?

Which area will see the highest improvement due to Google DeepMind's SCoRe approach by December 31, 2024?

Will another major tech company adopt Google DeepMind's SCoRe approach for their LLMs by March 31, 2024?

Will Google DeepMind's SCoRe approach achieve a 20% gain in self-correction for the MATH dataset by December 31, 2024?

What will be the next significant milestone achieved by Google DeepMind's SCoRe approach by December 31, 2024?

Which area will see the highest improvement due to Google DeepMind's SCoRe approach by December 31, 2024?