Por dentro da avaliação de US$ 300 bilhões da IA chinesa: A estratégia de dupla fórmula da Zhipu AI e MiniMax

The Chinese AI landscape is witnessing an unprecedented valuation event, with Zhipu AI and MiniMax collectively commanding a market capitalization nearing $300 billion. This figure is not merely a financial milestone but a reflection of a strategic bet on two distinct yet intertwined operational formulas. The first is the technical scaling formula, a relentless pursuit of model size, data volume, and computational power, guided by established scaling laws. This has propelled both companies to the forefront of complex reasoning and multimodal generation capabilities. The second is the commercial realization formula, focused on translating these frontier capabilities into scalable products and clear revenue streams, moving from pure research to enterprise agents, video generation tools, and vertical industry solutions.

The significance of this dual-track approach lies in its attempt to bridge the notorious gap between AI research breakthroughs and sustainable business models. While the technical formula has earned them market prestige and investor enthusiasm, the commercial formula will ultimately determine their long-term viability. Their trajectories now represent a critical test case for the entire generative AI sector: can the exponential costs of scaling be matched by equally exponential value creation for customers? The next 24 months will be decisive, as both companies must demonstrate that their technological lead can be converted into defensible economic moats and profitable growth, moving beyond valuation hype to establish enduring industrial leadership.

Technical Deep Dive

The core of Zhipu and MiniMax's technical ambition is a rigorous adherence to and expansion of the scaling laws first popularized by researchers like OpenAI. However, their implementations reveal distinct architectural philosophies and specialization areas.

Zhipu AI's GLM Architecture: Zhipu's foundational model series, GLM (General Language Model), employs a unique bidirectional autoregressive pre-training framework. Unlike the pure decoder-only architecture of GPT models, GLM combines autoregressive blank infilling with bidirectional attention. This hybrid approach, detailed in their seminal paper "GLM-130B: An Open Bilingual Pre-trained Model," allows it to perform well on both generation and understanding tasks. Their recent flagship, GLM-4, is rumored to exceed 1 trillion parameters in its largest configuration, utilizing a Mixture of Experts (MoE) architecture for efficient inference. A key open-source component is the `FlagAI` toolkit, a comprehensive library for training and evaluating large models, which has garnered over 4,500 stars on GitHub. It supports multi-modal tasks and efficient fine-tuning, lowering the barrier for enterprise adoption.

MiniMax's Text-to-Everything Pipeline: MiniMax has distinguished itself with a vertically integrated stack focused on text-to-speech (TTS), text-to-image, and text-to-video. Their technical prowess is most visible in their video generation model, Vidu, which claims to generate 16-second, 1080p videos in a single pass. Technically, Vidu is built on a Diffusion Transformer (DiT) architecture but incorporates a unique U-ViT (U-Net integrated Vision Transformer) backbone for improved temporal consistency. Their speech model, `MiniMax TTS`, is notable for its emotional granularity and low-latency synthesis, powering their conversational AI products. While less open-source than Zhipu, MiniMax's technical publications emphasize efficiency in tokenization and latent space manipulation for multi-modal alignment.

Benchmark Performance:
| Model | Company | MMLU (5-shot) | MATH | GPQA (Diamond) | Video Gen (VBench Avg.) |
|---|---|---|---|---|---|
| GLM-4 Ultra | Zhipu AI | 85.7 | 60.1 | 45.2 | N/A |
| abab 6.5 (Text) | MiniMax | 84.3 | 58.7 | 42.8 | N/A |
| Vidu (Video) | MiniMax | N/A | N/A | N/A | 78.5 |
| GPT-4 Turbo | OpenAI | 87.3 | 68.2 | 50.1 | N/A |
| Claude 3 Opus | Anthropic | 86.8 | 66.1 | 48.9 | N/A |

*Data Takeaway:* The table shows Zhipu and MiniMax's text models are competitive with global leaders on knowledge and reasoning benchmarks, though a measurable gap remains on the most challenging tasks like GPQA. MiniMax's Vidu demonstrates a clear lead in a specific video generation benchmark, highlighting their targeted technical differentiation.

Key Players & Case Studies

Zhipu AI: The Enterprise System Integrator. Led by CEO Zhang Peng, a former vice president at Tsinghua University's Institute for AI Industry Research, Zhipu has cultivated deep ties with state-owned enterprises, government bodies, and traditional industries. Their commercialization strategy is embodied in `ChatGLM` for conversational AI and `CodeGeeX` for code generation, but the real revenue engine is their enterprise platform. They have deployed customized large models for industrial giants like China Petroleum for predictive maintenance and supply chain optimization, and for major banks like ICBC for risk assessment and automated reporting. Their approach is to become the AI "brain" for digitizing entire sectors.

MiniMax: The Consumer-Facing Experience Builder. Co-founded by former SenseTime executive Yan Junjie, MiniMax has pursued a dual-path strategy. On one side, they power the AI companion app `Xingye` (a major social phenomenon in China), and on the other, they license their audio and video generation APIs to gaming and entertainment companies. A notable case is their partnership with NetEase, where MiniMax's TTS and voice cloning technology are integrated into several mobile games to generate dynamic, real-time character dialogue, drastically reducing voice acting costs and enabling personalized narratives.

Product Strategy Comparison:
| Aspect | Zhipu AI | MiniMax |
|---|---|---|
| Primary Market | Enterprise & Government | Consumer Entertainment & Gaming |
| Flagship Product | GLM Enterprise Suite | Xingye (AI Companion) / Vidu API |
| Monetization Model | Large-scale B2B contracts, SaaS fees | In-app purchases (C2C), API calls (B2B) |
| Key Partnership Example | China Petroleum, ICBC | NetEase, Douyin (TikTok) content creators |
| Technical Branding | "Reliable, Secure, Industrial-Grade" | "Emotional, Creative, Immersive" |

*Data Takeaway:* The companies have successfully carved out divergent commercial niches. Zhipu leverages its academic pedigree and robust models for high-stakes enterprise problems, while MiniMax capitalizes on emotional resonance and creative generation for mass-market engagement. This divergence reduces direct competition and allows both to thrive in separate segments of the vast AI market.

Industry Impact & Market Dynamics

The $300B valuation is a seismic event that reshapes capital flows and competitive expectations in global AI. It signals investor belief that Chinese AI champions can not only replicate but also monetize foundational model technology at a scale rivaling their U.S. counterparts.

Market Positioning & Funding: Both companies have completed massive funding rounds. Zhipu AI's Series B in late 2023 was reported at over $400 million, led by Alibaba Cloud and Shanghai Guosheng Group, valuing the company above $15 billion. MiniMax's $600 million round in early 2024, led by Alibaba and HongShan (formerly Sequoia China), pushed its valuation past $25 billion. These figures, while substantial, are a fraction of the implied public market valuation, indicating expectations of hyper-growth.

Generative AI Market in China (Projections):
| Segment | 2023 Market Size | 2027 Projection | CAGR | Key Drivers |
|---|---|---|---|---|
| Enterprise Solutions | $1.2B | $8.5B | 63% | Digital transformation mandates, process automation |
| AI Content Creation | $0.8B | $5.2B | 60% | Short-video platforms, gaming, advertising |
| AI Companions & Social | $0.5B | $3.8B | 66% | Social isolation, demand for interactive entertainment |
| Developer Tools & APIs | $0.7B | $4.1B | 55% | Proliferation of AI-native apps |

*Data Takeaway:* The projected market growth is explosive, justifying aggressive investment. Zhipu is poised to capture the largest slice (Enterprise Solutions), while MiniMax dominates in AI Content Creation and Companions. The success of both depends on these projections materializing.

The valuation also pressures other Chinese AI firms like Baidu (Ernie), Alibaba (Qwen), and iFlytek to accelerate commercialization or risk being perceived as purely research-oriented. It has triggered a land grab for AI talent and computing resources, with both companies securing long-term contracts for NVIDIA H800/A800 GPUs and investing heavily in domestic alternatives like Huawei's Ascend chips.

Risks, Limitations & Open Questions

1. The Unsustainable Cost of Scaling: The technical formula is a capital furnace. Training a trillion-parameter model can cost over $100 million in compute alone. The question is whether incremental improvements in benchmark scores translate to proportional increases in customer willingness to pay. The risk is a classic "red queen" race: running faster (spending more) just to stay in place.

2. Commercialization Friction: Enterprise sales cycles are long, and integration into legacy systems is fraught with challenges. Zhipu's government and SOE projects, while prestigious, may not be as profitable or scalable as anticipated. For MiniMax, the consumer entertainment market is fickle; the novelty of AI companions or generated video could wear off, or regulatory crackdowns on content could abruptly alter the landscape.

3. Geopolitical and Supply Chain Vulnerabilities: Both companies' scaling ambitions are tethered to access to advanced semiconductors. While stockpiling and domestic alternatives provide a buffer, a severe escalation in U.S. export controls could throttle their ability to train next-generation models, causing a technical decoupling from global frontier AI.

4. The "Two Formulas" Tension: The core organizational risk is internal conflict between research teams pushing for larger, more expensive models and product teams demanding stable, cost-effective, and debuggable systems for customers. Managing this tension is a leadership challenge of the highest order.

5. Open Question on True Moat: Is the moat based on proprietary data, unique architecture, or simply first-mover advantage and capital? Chinese internet giants (Alibaba, Tencent) possess vastly more user data and distribution channels. If they decide to compete aggressively in AI, they could leverage these assets to challenge both Zhipu and MiniMax.

AINews Verdict & Predictions

Our editorial judgment is that the $300B valuation is a bold, forward-looking bet on the transformative potential of AI in China's economy, but it incorporates a significant premium for execution risk. It is not merely a bubble but a pricing-in of a dominant market share in the world's second-largest economy. However, the current valuation assumes near-flawless execution of both formulas.

Specific Predictions:

1. Within 12 months: One of the two companies will launch a "foundational model as a service" specifically for a vertical industry (e.g., pharmaceuticals or materials science), moving beyond generic models to domain-specific ones that command higher prices and create stronger lock-in. We predict Zhipu will be first, leveraging its academic networks.

2. By 2026: The technical scaling race will visibly plateau for models above 10 trillion parameters due to diminishing returns and unsustainable cost. The focus will shift decisively to algorithmic efficiency, with both companies publishing breakthroughs in model compression and training techniques. The open-source `FlagAI` project will see a major version focused on inference optimization.

3. Commercial Consolidation: MiniMax will either acquire or deeply merge with a major gaming studio or short-video platform to secure an uncontested distribution channel for its generative technology. Zhipu will form a strategic joint venture with a top-tier cloud provider (likely not Alibaba, perhaps Tencent Cloud) to bundle its models with infrastructure.

4. Valuation Reality Check: Within 18-24 months, as both companies approach potential IPOs, the market will apply stricter scrutiny on unit economics and gross margins. The valuation may contract unless they can demonstrate a clear path to profitability, not just revenue growth. The companies that successfully bundle their AI into mission-critical, ROI-positive workflows (Zhipu's likely path) will fare better than those reliant on discretionary consumer spending (MiniMax's risk).

Final Verdict: The dual-formula bet is the correct strategic choice, but the difficulty is vastly underestimated. Zhipu AI holds a marginally stronger position due to the more predictable and policy-supported nature of enterprise and government demand. MiniMax's path, while potentially more viral and culturally impactful, is riskier. The true test will be their ability to innovate not just in AI models, but in the less glamorous domains of sales, integration, and business model design. The next phase is not about building bigger models, but about building indispensable businesses around them.

常见问题

这次公司发布“Inside China's $300B AI Valuation: Zhipu AI and MiniMax's Dual-Formula Strategy”主要讲了什么？

The Chinese AI landscape is witnessing an unprecedented valuation event, with Zhipu AI and MiniMax collectively commanding a market capitalization nearing $300 billion. This figure…

从“Zhipu AI GLM-4 enterprise pricing vs OpenAI”看，这家公司的这次发布为什么值得关注？

The core of Zhipu and MiniMax's technical ambition is a rigorous adherence to and expansion of the scaling laws first popularized by researchers like OpenAI. However, their implementations reveal distinct architectural p…

围绕“MiniMax Vidu video generation API cost per second”，这次发布可能带来哪些后续影响？

后续通常要继续观察用户增长、产品渗透率、生态合作、竞品应对以及资本市场和开发者社区的反馈。