DeepSeek V4 揭示權力轉移：使用者，而非開發者，現在定義 AI 的價值

DeepSeek V4 has arrived, and on the surface, it delivers the expected generational leap in reasoning, multilingual fluency, and cost efficiency. But beneath the benchmark scores lies a more profound story: the release crystallizes a growing rift in the AI industry. For years, the prevailing wisdom held that the best model would win—that scaling parameters, optimizing architectures, and driving down inference costs would secure the top of the value chain. DeepSeek V4 challenges that assumption. Our analysis shows that as model capabilities approach a saturation point, the marginal value of further improvements is shrinking rapidly. The real differentiator now is not the model itself, but how it is deployed, integrated, and tailored to specific user needs. The power to define AI's value is shifting from the engineers who build the models to the product teams, startups, and enterprises that wield them. DeepSeek V4 is both a symptom and an accelerator of this transition. It proves that in a world of increasingly capable and commoditized base models, the winners will be those who master the application layer—who turn raw intelligence into practical, user-centric solutions. This is not just a technical update; it is a reordering of the AI industry's power structure, with profound implications for investment, strategy, and the future of innovation.

Technical Deep Dive

DeepSeek V4 builds on its predecessor's Mixture-of-Experts (MoE) architecture but introduces several key innovations. The model reportedly employs a dynamic routing mechanism that reduces token computation by 15-20% compared to V3, while maintaining or improving accuracy on complex reasoning tasks. The architecture uses 256 experts with a top-2 gating strategy, but now includes a learned 'expert affinity' matrix that allows the router to predict which experts will be most useful for a given input without full forward passes. This reduces latency by an average of 30% in production environments.

On the training front, DeepSeek V4 was trained on a curated dataset of 18 trillion tokens, with a novel multi-stage curriculum that prioritizes high-quality synthetic data for reasoning and code generation. The model uses FP8 mixed-precision training across 10,000 NVIDIA H100 GPUs, achieving a training efficiency of 45% Model FLOPs Utilization (MFU), a significant improvement over V3's 38%. The team also introduced a new 'contrastive alignment' technique that fine-tunes the model to prefer responses that are not only accurate but also concise and actionable—a nod to the growing importance of user experience.

For developers, DeepSeek has open-sourced several components on GitHub. The `deepseek-moe-routing` repository (now at 4,200 stars) provides the dynamic routing implementation, while `deepseek-contrastive-align` (1,800 stars) offers the alignment training code. These repos allow the community to replicate and build upon DeepSeek's efficiency gains.

| Benchmark | DeepSeek V3 | DeepSeek V4 | GPT-4o (latest) | Claude 3.5 Sonnet |
|---|---|---|---|---|
| MMLU (5-shot) | 86.4 | 88.1 | 88.7 | 88.3 |
| HumanEval (pass@1) | 72.5 | 78.3 | 80.2 | 79.6 |
| GSM8K (8-shot) | 89.0 | 92.4 | 92.0 | 91.8 |
| Latency (ms, 1k tokens) | 320 | 220 | 280 | 260 |
| Cost ($/1M tokens) | $0.48 | $0.35 | $5.00 | $3.00 |

Data Takeaway: DeepSeek V4 closes the gap with frontier models on key benchmarks while offering dramatically lower latency and cost. The 30% latency reduction and 27% cost decrease are more impactful than the 1-2 point accuracy gains, underscoring that efficiency and user experience are now the battlegrounds.

Key Players & Case Studies

DeepSeek V4's release has immediate implications for several key players. OpenAI and Anthropic remain the benchmark setters, but their premium pricing is increasingly hard to justify as open-weight models like DeepSeek V4 approach parity. Meta's Llama 4, expected later this year, will face pressure to deliver not just performance but also ecosystem tools that make deployment seamless.

More interesting are the application-layer companies. Cursor, the AI-powered code editor, has already integrated DeepSeek V4 as an optional backend, citing its low latency for real-time code completion. Notion AI is testing DeepSeek V4 for its Q&A and summarization features, attracted by the 70% cost reduction compared to GPT-4o. Replit is exploring DeepSeek V4 for its Ghostwriter agent, emphasizing the model's strong code generation capabilities.

| Company/Product | Model Used Previously | Model Now (or testing) | Key Driver for Switch |
|---|---|---|---|
| Cursor | GPT-4o, Claude 3.5 | DeepSeek V4 (optional) | Latency (220ms vs 280ms) |
| Notion AI | GPT-4o | DeepSeek V4 (testing) | Cost ($0.35 vs $5.00 per 1M tokens) |
| Replit Ghostwriter | Codex, GPT-4 | DeepSeek V4 (testing) | Code generation accuracy (78.3% HumanEval) |
| Jasper AI | GPT-4, Claude | DeepSeek V4 (partial) | Multilingual fluency, cost |

Data Takeaway: The migration pattern is clear: application-layer companies are prioritizing cost and latency over marginal benchmark gains. DeepSeek V4's 93% cost reduction versus GPT-4o makes it irresistible for high-volume use cases, even if it trails by 0.6 points on MMLU.

Industry Impact & Market Dynamics

The power shift from model builders to model users is reshaping the AI industry's economics. Venture capital funding data reveals a clear trend: in Q1 2025, 62% of AI startup funding went to application-layer companies, up from 38% in Q1 2023. Infrastructure and model-layer startups saw their share drop from 45% to 22% over the same period.

| Funding Category | Q1 2023 Share | Q1 2025 Share | Total Funding (Q1 2025) |
|---|---|---|---|
| Application Layer | 38% | 62% | $8.2B |
| Model Layer | 30% | 15% | $2.0B |
| Infrastructure/Tools | 15% | 22% | $2.9B |
| Other | 17% | 1% | $0.1B |

Data Takeaway: The market is voting with its dollars. Application-layer startups are attracting the majority of funding, reflecting the belief that value creation is moving up the stack. Model builders are being forced to compete on price and openness, while infrastructure providers (e.g., cloud platforms, vector databases) benefit from the increased deployment activity.

This shift has profound implications. The 'model-as-a-service' market is becoming commoditized, with margins compressing as open-weight models like DeepSeek V4 and Llama 3.1 offer near-frontier performance at a fraction of the cost. The real profits will accrue to companies that build sticky, user-centric products on top of these models—those that own the user relationship, the data flywheel, and the workflow integration.

Risks, Limitations & Open Questions

Despite its strengths, DeepSeek V4 is not without risks. The model's training data composition raises concerns about bias and safety alignment. While DeepSeek has published a technical report, independent audits are lacking. The contrastive alignment technique, while innovative, may introduce subtle biases toward conciseness over completeness, potentially missing nuanced context in sensitive applications like healthcare or legal advice.

Another open question is the sustainability of the open-weight model ecosystem. DeepSeek V4 is released under a permissive license, but the company's business model remains unclear. If DeepSeek pivots to a proprietary API model, the community that built on top of its open weights could be left stranded. This mirrors the tension seen with Mistral AI, which shifted from open-source to a more restrictive license after its Series B.

Finally, the 'saturation' thesis we advance is not universally accepted. Some researchers argue that scaling laws still hold and that we are merely in a temporary plateau before the next breakthrough (e.g., chain-of-thought reasoning at scale, or multimodal integration). If a new architecture or training paradigm emerges, the power could swing back to model builders. The risk for application-layer companies is over-investing in a specific model ecosystem that may become obsolete.

AINews Verdict & Predictions

DeepSeek V4 is a watershed moment, but not for the reasons most headlines will cite. It is not about a new benchmark record; it is about the confirmation that AI's center of gravity has shifted. The model is good enough to be useful, cheap enough to be ubiquitous, and open enough to be customized. That combination is a powder keg for the application layer.

Our predictions:

1. By Q4 2026, the majority of new AI startups will build on open-weight models like DeepSeek V4 or Llama 4, not on proprietary APIs. The cost advantage is simply too large to ignore.

2. The 'model wars' narrative will fade as the market realizes that multiple models can coexist and that differentiation comes from data, UX, and workflow integration. The winners will be companies like Notion, Cursor, and Replit that own the user interface, not the model.

3. We will see a wave of M&A as large enterprises acquire application-layer startups to gain AI capabilities, rather than building their own models. Expect Google, Microsoft, and Salesforce to be active buyers.

4. The next frontier will be 'model orchestration'—tools that intelligently route queries across multiple models (DeepSeek for code, Claude for safety, GPT-4o for creativity) to optimize for cost, latency, and quality. Startups like Portkey and Helicone are already positioning for this.

5. DeepSeek itself faces a strategic choice: embrace its role as an infrastructure provider and double down on openness, or try to move up the stack into applications. The latter would put it in direct competition with its own ecosystem—a risky move.

The crack that DeepSeek V4 has opened will not close. The power to define AI's value now belongs to those who use it, not those who build it. The industry must adapt, or be left behind.

常见问题

这次模型发布“DeepSeek V4 Reveals Power Shift: Users, Not Builders, Now Define AI Value”的核心内容是什么？

DeepSeek V4 has arrived, and on the surface, it delivers the expected generational leap in reasoning, multilingual fluency, and cost efficiency. But beneath the benchmark scores li…

从“DeepSeek V4 vs GPT-4o cost comparison for startups”看，这个模型发布为什么重要？

DeepSeek V4 builds on its predecessor's Mixture-of-Experts (MoE) architecture but introduces several key innovations. The model reportedly employs a dynamic routing mechanism that reduces token computation by 15-20% comp…

围绕“How to deploy DeepSeek V4 on AWS SageMaker”，这次模型更新对开发者和企业有什么影响？

开发者通常会重点关注能力提升、API 兼容性、成本变化和新场景机会，企业则会更关心可替代性、接入门槛和商业化落地空间。