Loading...
Loading...
Browse all stories on DeepNewz
VisitWhich environment will CRAB Benchmark-v0 be most frequently used in by end of 2024?
Ubuntu • 50%
Android • 50%
Usage statistics and reports from the CRAB framework's official repository or research publications
CamelAIOrg Releases Open-Source CRAB Framework for AI Benchmarking
Aug 10, 2024, 10:44 AM
Researchers from KAUST, UTokyo, CMU, Stanford, Harvard, Tsinghua, SUSTech, and Oxford, in collaboration with CamelAIOrg, have developed the CRAB framework, an AI framework designed for building LLM agent benchmark environments in a Python-centric way. CRAB, which stands for Cross-environment Agent Benchmark, aims to become a general-purpose agent benchmark framework for Multimodal Language Model (MLM) agents. The framework includes CRAB Benchmark-v0, developed using the CRAB framework, which features 100 tasks across two environments, Ubuntu and Android. It provides an end-to-end and easy-to-use framework to build multimodal agents, operate environments, and create benchmarks to evaluate them. The CRAB framework is now open-sourced, allowing agents to control devices such as mobile phones, laptops, or desktops from a single prompt.
View original story
Customer Service Automation • 25%
Data Analysis and Insights • 25%
Content Generation • 25%
Fraud Detection • 25%
Healthcare • 25%
Automotive • 25%
Finance • 25%
Other • 25%
Natural Language Processing • 25%
Computer Vision • 25%
Autonomous Systems • 25%
Other • 25%
Cloud Computing • 25%
Gaming • 25%
Enterprise Servers • 25%
Consumer PCs • 25%
Healthcare • 25%
Autonomous Vehicles • 25%
Financial Services • 25%
Other • 25%
GPT-4o • 33%
Gemini 1.5 • 33%
Claude 3.5 Sonnet • 34%
Energy sector • 25%
Data centers • 25%
Consumer electronics • 25%
Other sectors • 25%
Retrieval-Augmented Generation (RAG) • 25%
Chatbots • 25%
Document Summarization • 25%
Other • 25%
Coding • 25%
Hard Prompts • 25%
Math • 25%
Longer Queries • 25%
iOS • 25%
macOS • 25%
Android • 25%
Windows • 25%
Gaming • 33%
Business/Enterprise • 33%
Entertainment/Media • 33%
Content Creation • 25%
Data Analysis • 25%
Customer Service • 25%
Other • 25%
Stanford • 13%
CMU • 13%
Harvard • 13%
Oxford • 13%
SUSTech • 13%
Tsinghua • 13%
KAUST • 13%
UTokyo • 13%