Loading...
Loading...
Browse all stories on DeepNewz
VisitGoogle DeepMind's GenRM Enhances LLM Accuracy by Verifying Outputs
Sep 3, 2024, 01:04 PM
Researchers from Google DeepMind, University of Toronto, MILA, and UCLA have introduced a novel approach called Generative Reward Modeling (GenRM). DeepMind's GenRM improves the accuracy of Large Language Models (LLMs) by training them to verify their own outputs using next-token prediction and chain-of-thought (CoT) reasoning. The approach leverages the text generation capabilities of LLMs to improve their performance.
View original story
Markets
Yes • 50%
No • 50%
Official announcements from major tech companies such as Microsoft, Amazon, or Apple
No • 50%
Yes • 50%
Academic databases such as Google Scholar or PubMed
No • 50%
Yes • 50%
Published benchmark results in academic papers or tech blogs
ICML • 25%
AAAI • 25%
Other • 25%
NeurIPS • 25%
Official announcements from AI research award organizations
Other • 25%
Cohere • 25%
OpenAI GPT • 25%
Anthropic Claude • 25%
Official announcements from companies like OpenAI, Anthropic, or Cohere
Other • 25%
Healthcare • 25%
Finance • 25%
Education • 25%
Official announcements from companies in various sectors such as healthcare, finance, and education