Skip to main content
Researching SWE agents is hard: there are thousands of heavy-weight environments, setup is slow, and reproducibility is fragile. Sandboxes provides a branchable, VM-isolated sandbox designed for rapid experimentation across thousands of SWE environments — over 7,000 are preloaded out of the box.

Purpose-built for SWE agent research

  • Thousands of ready-to-run SWE environments, so you can benchmark agents with minimal infrastructure setup.
  • Git-like branching to try multiple patches or strategies in parallel from the same checkpoint, then keep only the winning branch: MCTS, beam search, rollbacks, value-function estimation.
  • VM-level isolation with per-run metrics to keep generated code contained and experiments auditable.

Environment catalog

Pick from the preloaded catalog rather than building or pulling terabytes of images yourself. What’s included now:

Integrations

Contree CLI

Terminal-first agent workflows, scripted runs, session branching, and filesystem inspection.

Contree SDK

Integrate your own agent via contree-sdk.

Contree MCP

Plug sandboxes into AI assistants that speak the Model Context Protocol.