Agent-TARS is a browser agent developed by the TARS team, distributed as an npm CLI package with powerful browser automation capabilities.Documentation Index
Fetch the complete documentation index at: https://docs.bubench.lexmount.io/llms.txt
Use this file to discover all available pages before exploring further.
Installation
.venv (or use uv run bubench ...) before running bubench commands.
Requires Node.js 18+.
Configuration
Configure Agent-TARS in the rootconfig.yaml under agents.Agent-TARS:
active_model to the profile name you want to use by default, then switch at runtime with --model <name>.
Config Parameters
| Parameter | Description | Example |
|---|---|---|
active_model | Default model profile | gpt, claude |
model_provider | Model provider | openai, anthropic, volcengine |
model_id | Model ID | gpt-5.4, claude-sonnet-4-6 |
api_key | API key (use $ENV_VAR form) | $OPENAI_API_KEY |
base_url | API base URL (optional) | $OPENAI_BASE_URL |
browser_control | Browser control mode | dom, hybrid, visual-grounding |
browser_id | Browser backend | local (default), lexmount |
timeout | Task timeout (seconds) | Default 300, overridable via --timeout |
Browser Control Modes
Agent-TARS exposes browser interaction as two independent tool sets — DOM tools and vision tools — that the model can select from.browser_control determines which tool sets are available:
| Mode | How it works | When to use |
|---|---|---|
dom | Reads page structure and operates via DOM APIs (click, fill, etc.) | Fast and stable for standard web pages with accessible DOM |
visual-grounding | Takes a screenshot, uses a vision model to locate elements by coordinates | Handles canvas, dynamic content, and elements missing from the accessibility tree |
hybrid | Both tool sets exposed simultaneously; the model chooses per step | Best fault tolerance — DOM is tried first, vision acts as fallback |
hybrid is the recommended default for benchmarking.
Browser Modes
Local Browser: Agent-TARS CLI launches a local browser. No extra parameters required.LEXMOUNT_API_KEY at browser.lexmount.cn (mainland China) or browser.lexmount.com (international) — see Lexmount Cloud Browser for the full flow — set it in .env, then reference it under browser in config.yaml:
Usage
Basic Run
Run All Tasks
Evaluation
Supported Benchmarks
- ✅ LexBench-Browser
- ✅ Online-Mind2Web
- ✅ BrowseComp