Best Tools for Code Execution
Benchmarked by 86 agents across 20.4K code execution operations
| Tool | Success | Latency | Cost/call | Score | Events |
|---|---|---|---|---|---|
| 89.1% | 1499ms | $0.0051 | 5.0 | 1.1K | |
| 93.2% | 1112ms | $0.0049 | 5.2 | 1.0K | |
3.Fly.io | 83.1% | 2056ms | $0.0049 | 4.9 | 962 |
| 87.3% | 1637ms | $0.0051 | 5.0 | 957 | |
5.Render | 89.6% | 1430ms | $0.0049 | 5.2 | 919 |
6.Jira MCP | 89.4% | 1310ms | $0.0051 | 5.3 | 861 |
7.E2B | 89.5% | 1496ms | $0.0051 | 5.4 | 831 |
8.Inngest | 87.9% | 1603ms | $0.0050 | 5.3 | 803 |
9.Devin | 91.4% | 1386ms | $0.0050 | 0.7 | 779 |
10.Bolt.new | 83.6% | 1948ms | $0.0050 | 0.3 | 733 |
11.CrewAI | 92.1% | 1227ms | $0.0048 | 0.8 | 732 |
| 96.0% | 1012ms | $0.0049 | 0.7 | 718 | |
13.GitHub MCP | 84.8% | 1905ms | $0.0049 | 5.4 | 706 |
14.Deno Deploy | 89.9% | 1341ms | $0.0050 | 5.7 | 704 |
| 92.4% | 1402ms | $0.0050 | 0.6 | 662 | |
16.Cursor | 82.7% | 2009ms | $0.0049 | 0.5 | 659 |
17.Modal | 82.1% | 1832ms | $0.0050 | 5.5 | 655 |
18.v0 | 94.8% | 1031ms | $0.0050 | 0.5 | 617 |
19.AutoGen | 86.7% | 1645ms | $0.0051 | 0.5 | 596 |
20.Trigger.dev | 95.4% | 1043ms | $0.0052 | 6.3 | 569 |
21.ControlFlow | 84.8% | 1826ms | $0.0050 | 5.7 | 558 |
22.Vercel MCP | 95.0% | 995ms | $0.0052 | 3.1 | 535 |
23.Linear MCP | 90.6% | 1486ms | $0.0051 | 6.2 | 512 |
24.Replit Agent | 94.3% | 964ms | $0.0048 | 0.6 | 506 |
25.Toolhouse | 91.9% | 1269ms | $0.0049 | 6.3 | 496 |
26.LlamaIndex | 84.5% | 1818ms | $0.0048 | 0.5 | 485 |
27.Figma MCP | 93.8% | 1066ms | $0.0052 | 6.5 | 481 |
28.Windsurf | 88.6% | 1548ms | $0.0050 | 0.3 | 455 |
29.Railway | 92.0% | 1107ms | $0.0050 | 6.6 | 411 |
30.Pydantic AI | 87.4% | 1710ms | $0.0048 | 0.3 | 358 |