browser-use is a Python-based browser agent that provides programmable browser automation capabilities. It supports local Chrome, Lexmount cloud browsers, and AgentBay cloud browsers.Documentation Index
Fetch the complete documentation index at: https://docs.bubench.lexmount.io/llms.txt
Use this file to discover all available pages before exploring further.
Installation
bubench run will create the agent venv defined in config.yaml (default .venvs/browser_use) and install the browser-use
extra on first use.
Activate .venv (or use uv run bubench ...) before running bubench commands.
Configuration
Configure browser-use in the rootconfig.yaml under agents.browser-use:
active_model to the profile name you want to use by default, then switch at runtime with --model <name>.
Supported Model Types
model_type | Description | Additional Keys |
|---|---|---|
BROWSER_USE | Browser Use official API | api_key ($BROWSER_USE_API_KEY) |
OPENAI | OpenAI-compatible models | api_key, base_url |
GEMINI | Gemini models | api_key, base_url |
Configuration Parameters
| Parameter | Description | Example |
|---|---|---|
active_model | Default model profile | browser-use, gpt |
model_type | Model provider type | BROWSER_USE, OPENAI, GEMINI |
model_id | Model ID | bu-2-0, gpt-4.1 |
api_key | API key (use $ENV_VAR form) | $BROWSER_USE_API_KEY |
browser_id | Browser backend | Chrome-Local, lexmount, agentbay |
use_vision | Pass screenshots to the LLM alongside DOM state | true, false |
max_steps | Max steps per task | 40 |
timeout | Task timeout (seconds) | 600 |
lexmount_browser_mode | Lexmount browser mode | normal (default), light |
agentbay_api_key | AgentBay API key (use $ENV_VAR form) | $AGENTBAY_API_KEY |
agentbay_image_id | AgentBay session image | Default browser_latest |
agentbay_enable_browser_replay | Enable AgentBay replay | true (default), false |
agentbay_browser_use_stealth | Enable AgentBay stealth | false (default), true |
Why no
browser_control?browser-use always operates with DOM and vision running in parallel internally — there is no separate “DOM-only” or “vision-only” mode to expose. use_vision only controls whether screenshots are included in the LLM’s context; the underlying interaction strategy is fixed by the framework itself.Browser Modes
Local Browser: Use local Chrome, suitable for development and debugging. No extra parameters required.LEXMOUNT_API_KEY / LEXMOUNT_PROJECT_ID in .env, then reference them under browser in config.yaml:
AGENTBAY_API_KEY in .env, then reference it under browser in config.yaml:
- AgentBay SDK is treated as an optional dependency. Missing packages or incompatible exports fail only when
browser_id: agentbay; other browser modes continue to work. - Session cleanup failures in AgentBay backend are logged and do not mask task execution errors.
Usage Examples
Basic Run
Run Specific Tasks
Evaluation
Supported Benchmarks
- ✅ LexBench-Browser
- ✅ Online-Mind2Web
- ✅ BrowseComp