DeepSeek offer 75% discount on new models

0
21

DeepSeek has officially ignited a massive “AI price war” by announcing a 75% discount on its newly released flagship model, DeepSeek-V4-Pro, on Monday, April 27, 2026.

This aggressive promotional period runs until May 5, 2026, and positions DeepSeek as the most cost-effective frontier-level intelligence provider in the world, significantly undercutting Western rivals like OpenAI, Anthropic, and Google.


1. The 75% Discount Breakdown

The promotional pricing applies specifically to the DeepSeek-V4-Pro API, which features a massive 1.6 trillion parameter Mixture-of-Experts (MoE) architecture.

Token TypeStandard Price (per 1M)Discounted Price (per 1M)
Input (Cache Miss)$1.74$0.435
Input (Cache Hit)$0.145$0.036
Output$3.48$0.87
  • Comparison: At $0.87 per million output tokens, DeepSeek-V4-Pro is roughly 34x cheaper than OpenAI’s GPT-5.5 ($30/1M) and 28x cheaper than Anthropic’s Claude 4.7 Opus ($25/1M).

2. The “Deep-Water Bomb”: 90% Cache Reduction

Alongside the V4-Pro discount, DeepSeek launched a permanent structural change to its entire API suite (including the V4-Flash variant) by reducing the cost of input cache hits to one-tenth of their original price.

  • The Logic: This specifically targets AI Agents that frequently send similar or repeated long-context requests (like a coding agent reading the same codebase repeatedly).
  • Result for Flash: Before May 5, the input cache hit price for DeepSeek-V4-Flash has dropped to an astonishing $0.0029 per million tokens (approximately ¥0.02).

3. Strategic Rationale

Industry analysts view this as a “dimensional reduction strike” aimed at forcing developers to switch ecosystems.

  • Incentivizing Switchers: By making the world’s top-tier intelligence virtually “too cheap to ignore,” DeepSeek aims to capture the developer market for agentic frameworks like OpenClaw and Claude Code.
  • Domestic Chip Optimization: The V4 series is natively optimized for Huawei Ascend 950 chips. This move proves that high-performance AI can be deployed at scale without relying on Nvidia hardware, a critical pivot for the Chinese tech stack amidst export restrictions.
  • Market Pressure: Even affordable providers like Alibaba Cloud (Qwen3) are suddenly finding their pricing uncompetitive compared to the V4-Pro promotional rates.

4. Technical Specs: DeepSeek-V4 Series

Released in preview on April 24, 2026, the V4 series represents the first major architecture update since December.

  • Context Window: 1 million tokens for both Pro and Flash.
  • Architecture: Mixture-of-Experts (MoE).
    • Pro: 1.6T total / 49B active parameters.
    • Flash: 284B total / 13B active parameters.
  • License: MIT (Open Weights).
Advertisement

LEAVE A REPLY

Please enter your comment!
Please enter your name here