Vercel Adds GLM 5.2 Fast Model to AI Gateway with 2x Throughput

By Meridian48 News Desk · Summarised from Vercel · June 24, 2026

Vercel's AI Gateway now supports GLM 5.2 Fast via Wafer, claiming 2x higher throughput than other providers. Benchmarks show 170+ tok/s for small contexts and 200+ tok/s for large contexts. The gateway offers unified API access, cost tracking, and zero data retention with no markup on pricing.

Meridian48 take

The performance claims are notable, but real-world gains depend on workload patterns and whether Wafer's architecture justifies switching from existing providers.

Read the full reporting

GLM 5.2 Fast via Wafer now available on AI Gateway →

Vercel

ai-gatewaymodel-deployment

Vercel Adds GLM 5.2 Fast Model to AI Gateway with 2x Throughput

Rust community debates GitHub dependency for crates.io publishing

Building a Concurrent Webhook Pipeline for Financial Compliance

Old phones + LLM agents form a self-hosted 'organism' with no cloud