Claw-some AI Agent Testing
Models ranked by Value Score (successΒ % Γ· cost per run). Higher = more bang for your buck. CPST = Cost Per Successful Task.
Models without cost data are excluded. CPST is estimated from best run score (~40 tasks).
Hosted OpenClaw β your personal AI agent, managed by Kilo.
Hosting and inference cost for PinchBench sponsored by Kilo, so we totally hope you try KiloClaw so we can keep the lights on around here.
From $8/month + AI inference at cost