AI · 2h ago

MiniMax M3: Open-Weight Model with 1M-Token Context and Sparse Attention

By Meridian48 News Desk · Summarised from DEV Community · June 24, 2026

MiniMax released M3, the first open-weight model combining 59% SWE-Bench Pro score, 1M-token context, and native multimodal input. Its MiniMax Sparse Attention reduces compute by 28.4x at 1M tokens by attending only to 2,048 relevant tokens per query. API pricing at $0.30/M input tokens undercuts rivals by 10-20x.

Meridian48 take

The sparse attention breakthrough is real, but self-reported benchmarks and restrictive licensing temper the open-weight promise.

Read the full reporting

MiniMax M3 Explained: The Sparse Attention Breakthrough →

DEV Community

minimax-m3sparse-attention

MiniMax M3: Open-Weight Model with 1M-Token Context and Sparse Attention

New Benchmark DiffusionBench Tests Generative Diffusion Transformers

Google's Gemini-3-Flash Model Prioritizes Speed Over Depth

Why an ISO 42001 course kept failing—and what it reveals about AI compliance