Claude Effort Levels Benchmarked: Low vs Max on Real Tasks

By Meridian48 News Desk · Summarised from DEV Community · July 3, 2026

A developer tested Claude's five effort settings on classification, code generation, and multi-step audit tasks. For simple classification, low effort matched max quality but used 8x fewer tokens. For complex audits, xhigh used fewer total tokens than medium by avoiding dead ends.

Meridian48 take

The benchmark offers practical guidance for developers, but results may vary by task and model version.

Read the full reporting

Effort Levels in Practice: I Benchmarked low Through max on Real Tasks →

DEV Community

claude-aillm-benchmarking

Claude Effort Levels Benchmarked: Low vs Max on Real Tasks

PxPipe cuts AI inference costs 60% by converting code to images

Glass Box vs. Black Box: Debugging Your Data Layer at 2 AM

Zero-Downtime NestJS Deployments on DigitalOcean with GitLab CI/CD and PM2