vercel-labs-agent-browser
๐ชMarketplacevercel-labs/agent-browser
Headless browser automation for AI agents
Overview
Agent Browser is a headless browser automation CLI designed specifically for AI agents. Built with a fast Rust core and a Node.js fallback, it provides a comprehensive set of commands for navigating, clicking, filling forms, taking screenshots, and extracting data from web pages. Its accessibility-tree-based snapshot system with element refs makes it particularly well-suited for AI-driven browser interactions.
Key Features
- Accessibility tree snapshots โ The
snapshotcommand returns a structured accessibility tree with element refs, enabling AI agents to interact with page elements by reference rather than fragile CSS selectors. - Comprehensive browser commands โ Supports clicking, typing, filling, scrolling, drag-and-drop, file uploads, screenshots, PDF export, and JavaScript evaluation all from the command line.
- Dual selector support โ Works with both ref-based selectors from snapshots (e.g.,
@e2) and traditional CSS selectors (e.g.,#submit), giving flexibility in how elements are targeted. - Session and tab management โ Supports multiple pages, persistent sessions via
--profile, and CDP connections, allowing complex multi-page workflows. - Fast Rust CLI โ Ships with a native Rust binary for performance, with automatic fallback to Node.js if needed.
Who is this for?
Agent Browser is built for developers and AI engineers who need reliable, programmatic browser automation for AI agent workflows. It is especially useful for teams building autonomous agents that need to interact with web applications, run end-to-end tests, or scrape structured data from websites.
Add this Marketplace
/plugin marketplace add vercel-labs/agent-browserPlugins in this Marketplace
More from this repository9
Automates web browsing tasks by navigating websites, extracting content, interacting with elements, and capturing screenshots programmatically
Systematically explores and tests web applications to find bugs, UX issues, and other problems.
Skill for automating Electron desktop apps (VS Code, Slack, Discord, Figma, Notion, Spotify, etc.) using agent-browser via Chrome DevTools Protocol. Connects to running Electron apps and controls them programmatically.
Skill from agent-browser, a headless browser automation CLI for AI agents with a fast Rust binary and Node.js fallback for cross-platform support.
Skill
A concise, practical guide for creating specialized skills that extend Claude's capabilities with domain-specific knowledge and workflows.
Run agent-browser browser automation on AWS Bedrock AgentCore cloud browser sessions, resolving credentials automatically from environment variables or the AWS CLI. All standard agent-browser commands (open, snapshot, click, screenshot, close) work identically โ only the browser runtime changes to a managed AWS-hosted session.
Browser automation CLI for AI agents, built as a fast native Rust binary. Supports opening pages, taking snapshots, clicking, typing, and extracting content, with automatic detection of existing Chrome, Brave, Playwright, and Puppeteer installations.
Skill