Loading...
Loading...
Browse all stories on DeepNewz
VisitWill DeepMind's GenRM be cited in at least 50 academic papers by end of 2025?
Yes • 50%
No • 50%
Academic databases such as Google Scholar or PubMed
Google DeepMind's GenRM Enhances LLM Accuracy by Verifying Outputs
Sep 3, 2024, 01:04 PM
Researchers from Google DeepMind, University of Toronto, MILA, and UCLA have introduced a novel approach called Generative Reward Modeling (GenRM). DeepMind's GenRM improves the accuracy of Large Language Models (LLMs) by training them to verify their own outputs using next-token prediction and chain-of-thought (CoT) reasoning. The approach leverages the text generation capabilities of LLMs to improve their performance.
View original story
Yes • 50%
No • 50%
Less than 50 • 33%
50-100 • 33%
More than 100 • 34%
G. Jain's paper • 25%
N. Hegde's paper • 25%
A. Kusupati's paper • 25%
A. Nagrani's paper • 25%
Yes • 50%
No • 50%
Yes • 50%
No • 50%
0-5 collaborations • 25%
6-10 collaborations • 25%
11-15 collaborations • 25%
More than 15 collaborations • 25%
Yes • 50%
No • 50%
No • 50%
Yes • 50%
ICML • 25%
AAAI • 25%
Other • 25%
NeurIPS • 25%
Other • 25%
Cohere • 25%
OpenAI GPT • 25%
Anthropic Claude • 25%