A side-by-side comparison playground for AI browser automation agents. Compare how different AI models (Google, OpenAI, Anthropic) control browsers using natural language through Stagehand and Browserbase.
- 🥊 Side-by-Side Comparison: Run Google vs OpenAI or Google vs Anthropic simultaneously
 - 🤖 Multiple AI Models: Google Computer Use, OpenAI Computer Use, and Anthropic Claude
 - 🌐 Real Browser Environment: Powered by Browserbase with actual Chrome instances
 - 🎯 Natural Language Commands: Describe tasks in plain English
 - 📊 Real-time Streaming: Watch both agents work in parallel with live updates
 - 🔄 Flexible Provider Selection: Switch right-side provider between OpenAI and Anthropic
 
- Frontend: Next.js 15 with TypeScript, React 19, and Tailwind CSS
 - AI Models:
- Google: 
computer-use-preview-10-2025 - OpenAI: 
computer-use-preview-2025-03-11 - Anthropic: 
claude-sonnet-4-5-20250929 
 - Google: 
 - Browser Automation: Browserbase + Stagehand
 - Streaming: Server-Sent Events (SSE) for real-time updates
 - UI Components: Framer Motion animations, Lucide React icons
 - Analytics: PostHog for user tracking
 
- Node.js 18.x or later
 - pnpm (recommended) or npm
 - API keys for Google AI Studio, OpenAI, Anthropic, and Browserbase
 
- 
Clone the repository:
git clone https://github.com/browserbase/arena cd arena - 
Install dependencies:
pnpm install
 - 
Set up environment variables:
cp .env.example .env.local
Then edit
.env.localwith your API keys:# AI Provider API Keys GOOGLE_API_KEY=your_google_api_key_here OPENAI_API_KEY=your_openai_api_key_here ANTHROPIC_API_KEY=your_anthropic_api_key_here # Browserbase Configuration BROWSERBASE_API_KEY=your_browserbase_api_key_here BROWSERBASE_PROJECT_ID=your_browserbase_project_id_here # Optional: Analytics and monitoring NEXT_PUBLIC_POSTHOG_KEY=your_posthog_key NEXT_PUBLIC_POSTHOG_HOST=https://us.i.posthog.com # Site URL (for local development) NEXT_PUBLIC_SITE_URL=http://localhost:3000
Get your API keys:
- Google: Google AI Studio
 - OpenAI: OpenAI Platform
 - Anthropic: Anthropic Console
 - Browserbase: Browserbase Dashboard
 
 - 
Start the development server:
pnpm dev
 - 
Open your browser: Navigate to http://localhost:3000
 
- Select Providers: Choose between OpenAI or Anthropic for the right-side comparison (Google is always on the left)
 - Enter a Task: Type a natural language instruction or use one of the example prompts:
- Review a pull request on GitHub
 - Browse Hacker News for trending debates
 - Play a game of 2048
 - Get current crypto prices
 
 - Watch Both Agents: See real-time execution with screenshots and logs from both models
 - Compare Results: Observe how different models approach and solve the same task
 
# Development server with Turbopack
pnpm dev
# Production build
pnpm build
# Start production server
pnpm start
# Run linting
pnpm lintAgent behavior is configured via AGENT_INSTRUCTIONS:
- Atomic step-by-step execution
 - Direct navigation preferred over search
 - Risk-averse action selection
 - Automatic screenshots after key actions
 - Maximum 100 steps per execution
 
- Maximum session duration: 10 minutes (Vercel timeout)
 - Viewport locked at 1288x711 pixels
 - No keyboard shortcuts support
 - Browser sessions expire after completion
 - Left panel always runs Google (fixed)
 
- Session fails to start: Verify Browserbase API credentials and project ID
 - Agent not responding: Check that API keys are valid for all selected providers
 - Timeout errors: Complex tasks may exceed 10-minute limit
 - Connection issues: Ensure stable internet for browser streaming
 - Missing providers: Confirm all required API keys are set in 
.env.local 
This is a demo playground project showcasing multi-provider browser automation. Feel free to fork and experiment!
MIT
- Browserbase for browser infrastructure
 - Stagehand for automation framework
 - Google AI Studio, OpenAI, and Anthropic for AI capabilities
 - Vercel for hosting and edge functions