DeepNewz Markets

Market

Will a major tech company adopt GoogleDeepMind's IRL method by end of 2024?

DeepMind•Scalable Inverse Reinforcement Learning•Maximum Likelihood Estimation•GoogleDeepMind

Resolution / Starting Odds

Yes • 50%

No • 50%

Official announcements or press releases from major tech companies

Story

GoogleDeepMind Introduces Scalable Inverse Reinforcement Learning for Language Models

Sep 5, 2024, 09:23 AM

DeepMind has introduced a new approach to language model training using Scalable Inverse Reinforcement Learning (IRL). This method presents an effective alternative to traditional supervised Maximum Likelihood Estimation (MLE) in the fine-tuning pipeline, resulting in more robust reward functions and increased performance and diversity of model generations. The foundation of this approach lies in imitation learning, which is considered a reinforcement learning problem. Compared to supervised learning, IRL better exploits sequential structure, online data, and further extracts rewards. The insights were shared in a recent paper by GoogleDeepMind.

View original story

Similar markets

Will a major tech company adopt OpenAI's Rule-Based Rewards by the end of 2024?

Yes • 50%

No • 50%

Which major tech company will first adopt Google DeepMind's new LLM approach by end of 2024?

Microsoft • 25%

Amazon • 25%

Meta • 25%

Other • 25%

Will three major tech companies adopt Google DeepMind's new LLM approach by end of 2024?

Yes • 50%

No • 50%

$Will another major tech company adopt Google DeepMind's SCoRe method by the end of 2024?$

Apple • 25%

Other • 25%

Will a major tech company adopt OpenAI's Prover-Verifier Games approach by the end of 2024?

Yes • 50%

No • 50%

Which major tech company will be the first to adopt GoogleDeepMind's JEST AI training technique by the end of 2024?

Microsoft • 25%

Amazon • 25%

Meta • 25%

Other • 25%

Will a major tech company adopt the new AI energy-saving technique by the end of 2024?

Yes • 50%

No • 50%

Market

Story

Similar markets

Will a major tech company adopt OpenAI's Rule-Based Rewards by the end of 2024?

Which major tech company will first adopt Google DeepMind's new LLM approach by end of 2024?

Will three major tech companies adopt Google DeepMind's new LLM approach by end of 2024?

Will another major tech company adopt Google DeepMind's SCoRe method by the end of 2024?

Will a major tech company adopt GoogleDeepMind's JEST AI training technique by the end of 2024?

Will a major tech company adopt OpenAI o1 model for internal use by end of 2024?

Will DeepMind's GenRM be adopted by at least one major tech company other than Google by end of Q2 2025?

Will Google DeepMind's MoNE framework be adopted by a major tech company by end of 2024?

Which major tech company will first adopt Google DeepMind's MoNE framework for commercial use by end of 2024?

Will a major tech company adopt OpenAI's Prover-Verifier Games approach by the end of 2024?

Which major tech company will be the first to adopt GoogleDeepMind's JEST AI training technique by the end of 2024?

Will a major tech company adopt the new AI energy-saving technique by the end of 2024?

Will GoogleDeepMind publish a follow-up paper on IRL by end of Q1 2025?

Will GoogleDeepMind's IRL method outperform MLE in a benchmark by end of 2024?

In which application will GoogleDeepMind's IRL method be used by end of 2024?

What will be the performance improvement of GoogleDeepMind's IRL method over traditional methods by end of 2024?

Will GoogleDeepMind publish a follow-up paper on IRL by end of Q1 2025?

Will GoogleDeepMind's IRL method outperform MLE in a benchmark by end of 2024?

In which application will GoogleDeepMind's IRL method be used by end of 2024?

What will be the performance improvement of GoogleDeepMind's IRL method over traditional methods by end of 2024?