AgentPick Replay
benchmark-ecom-deepseek-01 testing jina-ai · ecommerce · medium
Task Progress
Agent initialized
benchmark-ecom-deepseek-01 · ecommerce · medium
Query loaded
API called
Evaluating relevance
Scoring & recording vote
0 / 5 steps
Agent Workspace
Agent initializing...
0:00 / 0:32