Question 1

How often is this tracker updated?

Accepted Answer

We refresh the data whenever a provider changes their published pricing. Major providers (Anthropic, OpenAI, Google) tend to update quarterly; smaller players move more often. The last-updated date at the top of the page reflects the most recent verified change.

Question 2

What does input vs output pricing mean?

Accepted Answer

AI providers charge separately for tokens you send (input) and tokens the model returns (output). Output is typically 3 to 5 times more expensive than input. A typical chat workload has 5 to 10 times more input tokens than output, so input pricing usually dominates the bill.

Question 3

What is cached input pricing?

Accepted Answer

Most major providers offer a discount when you re-send a system prompt or document the model has seen recently. Cached input pricing applies to that re-used portion, typically at 10 percent of the standard input rate. For RAG and agent workloads, cached pricing can cut bills by 40 to 70 percent.

Question 4

Which model is actually cheapest?

Accepted Answer

It depends on the workload. For pure text at scale, DeepSeek V4 and Gemini 3 Flash Lite are the cheapest serious models on the market. For reasoning, DeepSeek R2 and o3 Mini are the lowest-cost. For vision, Gemini 3 Flash beats most competitors on $/MP. Use our cost calculator to estimate your specific workload.

Question 5

Why are the prices in PKR rounded?

Accepted Answer

We convert at a fixed reference USD/PKR rate and round to whole rupees for readability. Your real cost will track the prevailing interbank rate at the time your card is charged, which can move 1 to 3 percent in either direction.

				Cached /1M		Modalities
Gemini 3 Flash LiteFree tier	Google	$0.10	$0.40	$0.03	1M	TextVision	2026-01
Mistral Small 3Free tier	Mistral	$0.20	$0.60	—	128K	Text	2026-01
DeepSeek V4Free tier	DeepSeek	$0.27	$1.10	$0.07	128K	Text	2026-02
Grok 4 MiniFree tier	xAI	$0.30	$0.50	—	131K	Text	2026-03
Gemini 3 FlashFree tier	Google	$0.35	$1.40	$0.09	1M	TextVisionAudio inVideo	2026-01
DeepSeek R2 (reasoning)	DeepSeek	$0.55	$2.19	$0.14	64K	Text	2026-04
Claude 4.5 HaikuFree tier	Anthropic	$0.80	$4.00	$0.08	200K	TextVision	2025-10
o3 Mini (reasoning)	OpenAI	$1.10	$4.40	$0.55	200K	Text	2025-01
GPT-5 MiniFree tier	OpenAI	$1.50	$6.00	$0.15	200K	TextVision	2025-12
Qwen 3 MaxFree tier	Alibaba	$1.60	$6.40	$0.16	256K	TextVision	2026-02
GPT-4.1	OpenAI	$2.00	$8.00	$0.50	1M	TextVision	2025-04
Mistral Large 3	Mistral	$2.00	$6.00	—	256K	Text	2025-11
Claude 4.6 SonnetFree tier	Anthropic	$3.00	$15	$0.30	1M	TextVision	2026-02
Gemini 3 ProFree tier	Google	$3.50	$14	$0.88	2M	TextVisionAudio inVideo	2026-01
Llama 4 405B (Together)	Meta	$3.50	$3.50	—	128K	Text	2025-09
Grok 4Free tier	xAI	$5.00	$15	—	256K	TextVision	2026-03
GPT-5Free tier	OpenAI	$13	$50	$1.25	400K	TextVisionAudio inAudio out	2025-12
Claude 4.7 Opus	Anthropic	$15	$75	$1.50	1M	TextVision	2026-04

AI Model Pricing Tracker

How we keep this current

What this tracker covers (and doesn't)

Frequently asked questions

How often is this tracker updated?

What does input vs output pricing mean?

What is cached input pricing?

Which model is actually cheapest?

Why are the prices in PKR rounded?

Related on Meridian48

The week in AI, tech business, and the tools worth knowing.