DeepSeek offer 75% discount on new models

April 28, 2026

DeepSeek has officially ignited a massive “AI price war” by announcing a 75% discount on its newly released flagship model, DeepSeek-V4-Pro, on Monday, April 27, 2026.

This aggressive promotional period runs until May 5, 2026, and positions DeepSeek as the most cost-effective frontier-level intelligence provider in the world, significantly undercutting Western rivals like OpenAI, Anthropic, and Google.

1. The 75% Discount Breakdown

The promotional pricing applies specifically to the DeepSeek-V4-Pro API, which features a massive 1.6 trillion parameter Mixture-of-Experts (MoE) architecture.

Token Type	Standard Price (per 1M)	Discounted Price (per 1M)
Input (Cache Miss)	$1.74	$0.435
Input (Cache Hit)	$0.145	$0.036
Output	$3.48	$0.87

Comparison: At $0.87 per million output tokens, DeepSeek-V4-Pro is roughly 34x cheaper than OpenAI’s GPT-5.5 ($30/1M) and 28x cheaper than Anthropic’s Claude 4.7 Opus ($25/1M).

2. The “Deep-Water Bomb”: 90% Cache Reduction

Alongside the V4-Pro discount, DeepSeek launched a permanent structural change to its entire API suite (including the V4-Flash variant) by reducing the cost of input cache hits to one-tenth of their original price.

The Logic: This specifically targets AI Agents that frequently send similar or repeated long-context requests (like a coding agent reading the same codebase repeatedly).
Result for Flash: Before May 5, the input cache hit price for DeepSeek-V4-Flash has dropped to an astonishing $0.0029 per million tokens (approximately ¥0.02).

3. Strategic Rationale

Industry analysts view this as a “dimensional reduction strike” aimed at forcing developers to switch ecosystems.

Incentivizing Switchers: By making the world’s top-tier intelligence virtually “too cheap to ignore,” DeepSeek aims to capture the developer market for agentic frameworks like OpenClaw and Claude Code.
Domestic Chip Optimization: The V4 series is natively optimized for Huawei Ascend 950 chips. This move proves that high-performance AI can be deployed at scale without relying on Nvidia hardware, a critical pivot for the Chinese tech stack amidst export restrictions.
Market Pressure: Even affordable providers like Alibaba Cloud (Qwen3) are suddenly finding their pricing uncompetitive compared to the V4-Pro promotional rates.

4. Technical Specs: DeepSeek-V4 Series

Released in preview on April 24, 2026, the V4 series represents the first major architecture update since December.

Context Window: 1 million tokens for both Pro and Flash.
Architecture: Mixture-of-Experts (MoE).
- Pro: 1.6T total / 49B active parameters.
- Flash: 284B total / 13B active parameters.
License: MIT (Open Weights).

Lapaas Voice

Subscribe to newsletter

Startup

Artificial Intelligence

Funding

Case Studies

Lapaas Voice

Startup

Artificial Intelligence

Funding

Case Studies

Lapaas Voice

1. The 75% Discount Breakdown

2. The “Deep-Water Bomb”: 90% Cache Reduction

3. Strategic Rationale

4. Technical Specs: DeepSeek-V4 Series

LEAVE A REPLY Cancel reply

Lapaas Voice

About us

Latest Articles

Most Popular

Subscribe