Skyvern is a Python-based browser automation agent built on the Skyvern SDK. It supports multiple execution engines (skyvern_v1, skyvern_v2, openai-cua, anthropic-cua, ui-tars) and browser backends (local, cdp, lexmount, skyvern-cloud).
Installation
# Install core dependencies and register the bubench CLI
uv sync
The skyvern and browser-use extras conflict. Use a separate venv (default .venvs/skyvern for skyvern, and .venvs/browser_use for browser-use in config.yaml).
bubench run will auto-create that venv and install the skyvern extra on first use.
Activate .venv (or use uv run bubench ...) before running bubench commands.
Configuration
Copy the example config, then edit it:
cp configs/agents/skyvern/config.yaml.example configs/agents/skyvern/config.yaml
Edit configs/agents/skyvern/config.yaml:
# LLM settings
ENABLE_OPENAI_COMPATIBLE: true
OPENAI_COMPATIBLE_MODEL_NAME: gemini-3-flash-preview
OPENAI_COMPATIBLE_MAX_TOKENS: 16000
OPENAI_COMPATIBLE_TEMPERATURE: 0.0
OPENAI_COMPATIBLE_SUPPORTS_VISION: true
# Engine
ENGINE: skyvern_v2
# Browser
BROWSER_ID: local
HEADLESS: false
LEXMOUNT_BROWSER_MODE: normal
# Runtime
MAX_STEPS: 25
TIMEOUT: 600
Store sensitive keys in the repo root .env, such as OPENAI_COMPATIBLE_API_KEY or SKYVERN_API_KEY.
Engine Options
| ENGINE | Description |
|---|
skyvern_v1 | Skyvern v1 engine |
skyvern_v2 | Skyvern v2 engine |
openai-cua | OpenAI CUA engine |
anthropic-cua | Anthropic CUA engine |
ui-tars | UI-TARS engine |
Browser Backends
| BROWSER_ID | Description |
|---|
local | Local browser |
cdp | Connect to an external CDP browser (set CDP_ADDRESS) |
lexmount | Lexmount cloud browser (set LEXMOUNT_BROWSER_MODE) |
skyvern-cloud | Skyvern cloud browser |
Common Parameters
| Parameter | Description | Example |
|---|
ENABLE_OPENAI_COMPATIBLE | Enable OpenAI-compatible mode | true |
OPENAI_COMPATIBLE_MODEL_NAME | Model name | gemini-3-flash-preview |
OPENAI_COMPATIBLE_MAX_TOKENS | Max output tokens | 16000 |
OPENAI_COMPATIBLE_TEMPERATURE | Temperature | 0.0 |
OPENAI_COMPATIBLE_SUPPORTS_VISION | Model supports vision | true |
HEADLESS | Headless mode for local browser | true / false |
TIMEOUT | Task timeout (seconds) | 600 |
MAX_STEPS | Max steps per task | 25 |
MAX_SCREENSHOT_SCROLLS | Max scroll screenshots | 5 |
INCLUDE_ACTION_HISTORY_IN_VERIFICATION | Include action history in verification | true |
MAX_CONSECUTIVE_REPEATS | Max consecutive repeats | 3 |
MAX_ACTION_OCCURRENCES | Max occurrences of one action | 5 |
Usage Examples
Basic Run
bubench run \
--agent skyvern \
--benchmark LexBench-Browser \
--mode first_n \
--count 3
Run All Tasks
bubench run \
--agent skyvern \
--benchmark LexBench-Browser \
--mode all \
--skip-completed
Evaluation
bubench eval --agent skyvern --benchmark LexBench-Browser
Supported Benchmarks
- ✅ LexBench-Browser
- ✅ Online-Mind2Web
- ✅ BrowseComp
Links