Loading...
Loading...
Browse all stories on DeepNewz
VisitWhich AI research institution will publish the most papers using CRAB framework by end of 2024?
KAUST • 13%
UTokyo • 13%
CMU • 13%
Stanford • 13%
Harvard • 13%
Tsinghua • 13%
SUSTech • 13%
Oxford • 13%
Publications and research papers indexed in major academic databases such as Google Scholar or arXiv
CamelAIOrg Releases Open-Source CRAB Framework for AI Benchmarking
Aug 10, 2024, 10:44 AM
Researchers from KAUST, UTokyo, CMU, Stanford, Harvard, Tsinghua, SUSTech, and Oxford, in collaboration with CamelAIOrg, have developed the CRAB framework, an AI framework designed for building LLM agent benchmark environments in a Python-centric way. CRAB, which stands for Cross-environment Agent Benchmark, aims to become a general-purpose agent benchmark framework for Multimodal Language Model (MLM) agents. The framework includes CRAB Benchmark-v0, developed using the CRAB framework, which features 100 tasks across two environments, Ubuntu and Android. It provides an end-to-end and easy-to-use framework to build multimodal agents, operate environments, and create benchmarks to evaluate them. The CRAB framework is now open-sourced, allowing agents to control devices such as mobile phones, laptops, or desktops from a single prompt.
View original story
OpenAI • 25%
DeepMind • 25%
Google AI • 25%
Microsoft Research • 25%
Google DeepMind • 25%
OpenAI • 25%
Microsoft Research • 25%
Other • 25%
University of Oxford • 25%
University of British Columbia • 25%
MIT • 25%
Stanford University • 25%
Meta (Llama 3) • 25%
OpenAI (GPT-4o) • 25%
Anthropic (Claude 3.5 Sonnet) • 25%
Other • 25%
Nature • 25%
Science • 25%
Journal of Machine Learning Research • 25%
Other • 25%
University of Michigan • 25%
EPFL • 25%
MIT • 25%
Other • 25%
MIT • 25%
Stanford • 25%
Harvard • 25%
Other • 25%
Harvard University • 25%
MIT • 25%
Stanford University • 25%
University of Tokyo • 25%
Stanford University • 33%
Washington University • 33%
Google DeepMind • 34%
OpenAI • 25%
DeepMind • 25%
Anthropic • 25%
Other • 25%
Android • 50%
Ubuntu • 50%