Contribution Guide

Welcome to contribute to the browseruse-bench project! This guide will help you understand how to participate in project development.

Development Environment Setup

Fork Repository

Fork the browseruse-bench repository on GitHub

Clone Code

git clone https://github.com/your-username/browseruse-bench.git
cd browseruse-bench

Install Dependencies

# Using uv (Recommended)
uv sync --all-extras

# Or using pip
pip install -e ".[all,dev]"

Create Branch

git checkout -b feature/your-feature-name

Contribution Types

Adding New Agents

Create a new Agent directory in agents/
Implement the Agent interface
Add config.yaml.example and keep config.yaml local
Update documentation

See Adding New Agents for details.

Adding New Benchmarks

Create a new Benchmark directory in benchmarks/
Prepare task data and data_info.json
Implement evaluator (optional)
Update documentation

See Custom Benchmark for details.

Fixing Bugs

Create an Issue to describe the problem
Submit a PR with the fix
Ensure tests pass

Improving Documentation

Modify documentation in docs/ directory
Submit PR

Adding New Agents

Directory Structure

agents/
└── YourAgent/
    ├── config.yaml.example  # Committed example config
    ├── config.yaml          # Local config (ignored)
    ├── requirements.txt # Dependencies (Optional)
    └── run.py           # Entry Script (Optional)

Use config.yaml.example in the repo and keep config.yaml local for secrets.

Implement Interface

New Agents need to implement the following interface:

class YourAgent:
    def __init__(self, config: dict):
        """Initialize Agent"""
        pass
    
    async def execute_task(self, task: dict) -> dict:
        """
        Execute task
        
        Args:
            task: Task dictionary, containing task_id, task, etc.
            
        Returns:
            Result dictionary, containing action_history, metrics, etc.
        """
        pass

Register Agent

AGENTS = {
    "browser-use": ...,
    "Agent-TARS": ...,
    "YourAgent": {
        "module": "agents.YourAgent.run:YourAgent",
        "config_file": "agents/YourAgent/config.yaml"
    }
}

Code Standards

Formatting

# Use ruff to format
ruff format .

# Check code style
ruff check .

Type Checking

# Use mypy for type checking
mypy browseruse_bench

Testing

# Run all tests
pytest tests/

# Run specific test
pytest tests/test_eval.py -v

Submitting PR

Ensure Tests Pass

pytest tests/
ruff check .

Commit Changes

git add .
git commit -m "feat: add your feature description"

Push Branch

git push origin feature/your-feature-name

Create PR

Create a Pull Request on GitHub and describe your changes

Commit Convention

Use Conventional Commits format:

feat: New feature
fix: Bug fix
docs: Documentation update
refactor: Code refactoring
test: Add test
chore: Other changes

Example:

feat: add WebArena benchmark support
fix: resolve timeout issue in browser-use agent
docs: update quickstart guide

Get Started

Features

Examples

Development

Development Environment Setup

Contribution Types

Adding New Agents

Adding New Benchmarks

Fixing Bugs

Improving Documentation

Adding New Agents

Directory Structure

Implement Interface

Register Agent

Code Standards

Formatting

Type Checking

Testing

Submitting PR

Commit Convention

Get Started

Features

Examples

Development

​Development Environment Setup

​Contribution Types

​Adding New Agents

​Adding New Benchmarks

​Fixing Bugs

​Improving Documentation

​Adding New Agents

​Directory Structure

​Implement Interface

​Register Agent

​Code Standards

​Formatting

​Type Checking

​Testing

​Submitting PR

​Commit Convention

Development Environment Setup

Contribution Types

Adding New Agents

Adding New Benchmarks

Fixing Bugs

Improving Documentation

Adding New Agents

Directory Structure

Implement Interface

Register Agent

Code Standards

Formatting

Type Checking

Testing

Submitting PR

Commit Convention