Dev Tools · 2h ago
Claude Effort Levels Benchmarked: Low vs Max on Real Tasks
A developer tested Claude's five effort settings on classification, code generation, and multi-step audit tasks. For simple classification, low effort matched max quality but used 8x fewer tokens. For complex audits, xhigh used fewer total tokens than medium by avoiding dead ends.
Meridian48 take
The benchmark offers practical guidance for developers, but results may vary by task and model version.
Read the full reporting
Effort Levels in Practice: I Benchmarked low Through max on Real Tasks →
DEV Community
claude-aillm-benchmarking