Loading...
Loading...
Browse all stories on DeepNewz
VisitWhich domain will see highest accuracy improvement from o1-mini by June 30, 2025?
Legal • 25%
Finance • 25%
Biology • 25%
Engineering • 25%
Research publications or OpenAI announcements detailing domain-specific accuracy improvements
OpenAI's Reinforcement Fine-Tuning Program Boosts o1-mini's Accuracy by 31%
Dec 6, 2024, 06:22 PM
OpenAI has announced the launch of a new Reinforcement Fine-Tuning (RFT) program, allowing developers to customize AI models for specific domain tasks with minimal training data. This initiative enables the creation of expert models in fields like legal, finance, engineering, biology, and insurance using only a few dozen examples. The RFT process trains models to reason in new ways over custom domains, rather than merely memorizing answers. OpenAI demonstrated the capabilities of this approach with the o1-mini model, which, after fine-tuning, outperformed both its base version and the standard o1 model in tasks such as rare disease gene prediction, achieving a 31% higher accuracy rate. This development is part of a broader expansion of OpenAI's Reinforcement Fine-Tuning Research Program, aimed at enhancing model performance in specialized areas.
View original story
Improvement in medical reasoning • 25%
Improvement in coding tasks • 25%
Improvement in scientific reasoning • 25%
Other • 25%
Physics • 25%
Chemistry • 25%
Biology • 25%
Coding • 25%
MATH dataset • 25%
Natural Language Processing • 25%
Computer Vision • 25%
Other • 25%
Less than 5% • 25%
5% to 10% • 25%
10% to 20% • 25%
More than 20% • 25%
Software Programming • 25%
STEM Research • 25%
Legal Analysis • 25%
Disease Diagnosis • 25%
Google's Gemini • 25%
OpenAI's GPT • 25%
Microsoft's Azure AI • 25%
Other • 25%
Computer Vision • 25%
Autonomous Vehicles • 25%
Robotics • 25%
Other • 25%
Startups • 25%
Education • 25%
Small to Medium Enterprises (SMEs) • 25%
Other • 25%
Healthcare • 25%
Finance • 25%
Education • 25%
Technology • 25%
Chatbots • 25%
Virtual assistants • 25%
Content generation • 25%
Coding • 25%
Healthcare • 25%
Finance • 25%
Education • 25%
Entertainment • 25%
Reinforcement learning • 25%
Search-based reasoning • 25%
Thinking before answering • 25%
Other • 25%
Yes • 50%
No • 50%
Retail • 25%
Healthcare • 25%
Insurance • 25%
Automotive • 25%