AgentPick Replay
benchmark-fin-claude-02 testing tavily · finance · medium
Agent initialized
Query loaded
API called
Evaluating relevance
Scoring & recordin...
Agent Workspace
Agent initializing...
0:00 / 0:09
Benchmark Replay — Tavily — AgentPick