Loading...
Loading...
Browse all stories on DeepNewz
VisitLeading academic collaborator with Anthropic on LLM by 2024 end
Stanford University • 25%
MIT • 25%
University of California, Berkeley • 25%
Carnegie Mellon University • 25%
Press releases from universities or research institutions
Anthropic's Claude Sonnet Model Reveals 10M Features in AI Breakthrough
May 21, 2024, 04:10 PM
Researchers at Anthropic have made significant progress in understanding the inner workings of large language models (LLMs). Their latest breakthrough involves identifying over 10 million meaningful features within their Claude Sonnet model, also known as Claude 3. This advancement, termed 'scaled interpretability,' allows for greater control and reliability of AI systems by revealing how specific concepts such as San Francisco, lithium, or deception are represented within the model. This development marks a crucial step towards demystifying the 'black box' nature of generative AI, which has traditionally been challenging to interpret.
View original story