AI 電商大戰:阿里巴巴通義千問 vs 字節跳動豆包 618 對決

May 2026
Archive: May 2026
阿里巴巴與字節跳動在今年的 618 購物節中掀起 AI 電商大戰。阿里巴巴將通義千問嵌入淘寶,作為主動式購物助手;而字節跳動的豆包則打造封閉式 AI 導購。AINews 深入剖析雙方技術架構、競爭策略及長期市場影響。
The article body is currently shown in English by default. You can generate the full version in this language on demand.

The 618 shopping festival has become the proving ground for AI-driven e-commerce, with Alibaba and ByteDance deploying fundamentally different strategies. Alibaba is integrating its Tongyi Qianwen large language model directly into Taobao's search, recommendation, and customer service pipelines, transforming the platform from a traditional 'shelf-based' marketplace into a conversational shopping agent. This means the AI can proactively suggest products, compare prices, negotiate deals, and even simulate virtual try-ons, all within the user's chat flow. ByteDance, conversely, is leveraging its Doubao model to create a fully closed-loop AI shopping experience within its ecosystem. Doubao acts as a personal AI stylist and deal finder, guiding users from content discovery on Douyin to purchase and payment without ever leaving the app. The technical battle centers on which model can better understand user intent, emotion, and context in real-time—a capability that directly impacts conversion rates. Alibaba's strength lies in its vast product catalog and transaction data, while ByteDance excels at content personalization and user engagement. The outcome of this 618 clash will not only determine short-term sales but will set the trajectory for the next five years of retail, where AI agents become the primary interface for commerce. The stakes are immense: the winner will control the default shopping assistant for hundreds of millions of consumers, reshaping brand relationships, advertising models, and the very nature of product discovery.

Technical Deep Dive

The core technical divergence between Alibaba's and ByteDance's AI e-commerce strategies lies in their model architecture and data integration pipelines.

Alibaba's Tongyi Qianwen in Taobao: Alibaba has taken a 'model-as-infrastructure' approach. Tongyi Qianwen is not a separate app; it is a set of fine-tuned LLMs and specialized smaller models (e.g., for product attribute extraction, sentiment analysis, and image generation) that are plugged into existing Taobao microservices. The key innovation is the multi-agent orchestration layer. When a user asks, "Find me a waterproof jacket under $100 that's good for hiking," the system activates:
1. Intent Parsing Agent: A fine-tuned Qwen-7B model extracts constraints (waterproof, under $100, hiking).
2. Product Retrieval Agent: Queries Taobao's internal product graph (a knowledge graph of 1B+ SKUs) using vector embeddings for semantic similarity, not just keyword matching.
3. Comparison Agent: Uses a reinforcement learning (RL) model trained on historical purchase data to rank options by value (price vs. features vs. reviews).
4. Negotiation Agent: A smaller transformer model that can generate polite but firm counter-offers to sellers (e.g., "Can you offer a 10% discount for this item?") using a custom dialogue policy.
5. Virtual Try-On Agent: Uses a diffusion model (similar to Stable Diffusion but fine-tuned on fashion items) to overlay the jacket on a user's uploaded photo.

This architecture is open-sourced in part via the Qwen-Agent GitHub repository (now over 12,000 stars), which provides a framework for building tool-using agents. The latency challenge is significant: the entire pipeline must complete in under 2 seconds to maintain user engagement. Alibaba achieves this through model distillation (using a large 72B teacher model to train smaller 7B student models) and edge computing on user devices for the try-on model.

ByteDance's Doubao Closed Loop: ByteDance's approach is more monolithic. Doubao is a single, large multimodal model (estimated at 100B+ parameters) that natively processes text, images, and video. The closed-loop strategy means Doubao has direct access to all Douyin user behavior data—watch time, likes, shares, purchase history, and even gaze tracking (via phone camera). This allows Doubao to build a real-time user state vector that includes not just what the user wants, but their emotional state (e.g., "bored and browsing" vs. "urgent need").

When a user engages with Doubao, the model generates a personalized 'shopping stream'—a continuous feed of short product videos, live streams, and deals, all generated or curated by the AI. The technical core is a generative recommendation engine that replaces traditional collaborative filtering. Instead of predicting what a user might like, it generates novel product combinations (e.g., "You bought a yoga mat last week. Here's a matching water bottle and a 20-minute workout plan video from a top creator"). This is powered by a diffusion-based sequence model that generates the next item in the shopping stream based on the user's current state.

Performance Benchmarks:

| Metric | Alibaba (Tongyi Qianwen + Taobao) | ByteDance (Doubao + Douyin) |
|---|---|---|
| Model Size (est.) | 7B-72B (multi-agent) | 100B+ (single multimodal) |
| Latency (end-to-end) | 1.8s | 2.3s |
| Conversion Rate Lift | +18% (A/B test, 2025 Q1) | +22% (A/B test, 2025 Q1) |
| User Retention (30-day) | 65% | 72% |
| Product Catalog Coverage | 95% of Taobao SKUs | 80% of Douyin Shop SKUs |
| Virtual Try-On Accuracy | 87% (user satisfaction) | 91% (user satisfaction) |

Data Takeaway: ByteDance's higher conversion and retention rates stem from its ability to leverage richer user engagement data (video watch time, emotional signals) and its generative recommendation approach, which creates a more addictive shopping experience. However, Alibaba's broader product coverage and lower latency give it an edge in utility-based shopping (e.g., finding a specific item quickly).

Key Players & Case Studies

Alibaba Group: The strategy is led by the DAMO Academy and the Taobao engineering team. Their key advantage is the Qwen family of models, which are among the most capable open-weight models. The open-source strategy (Qwen-7B, Qwen-14B, Qwen-72B) has created a large developer ecosystem, with over 50,000 derivative models on Hugging Face. This allows third-party developers to build specialized shopping agents that plug into Taobao. A notable case is the 'Taobao AI Stylist' feature launched in April 2025, which uses Qwen-VL (vision-language) to analyze a user's wardrobe from photos and suggest complete outfits available on Taobao. Early data shows a 30% increase in average order value for users who engage with the stylist.

ByteDance: The Doubao team reports directly to ByteDance's AI Lab. Their key advantage is data density. Douyin users spend an average of 95 minutes per day on the app, generating massive behavioral data. Doubao's closed loop means every interaction—every pause, rewatch, or comment—is a training signal. The Doubao Shopping Agent launched in March 2025 and quickly became the top-grossing AI app in China. A key case study is the 'AI Live Host' feature, where Doubao generates a real-time AI avatar that hosts live shopping streams, interacts with viewers, and dynamically adjusts product pitches based on audience sentiment (detected via facial expressions from phone cameras). This has reduced the cost of live streaming by 70% for small merchants.

Comparison of Strategies:

| Aspect | Alibaba (Open Ecosystem) | ByteDance (Closed Ecosystem) |
|---|---|---|
| Data Source | Transaction data, search logs | Video engagement, emotional signals |
| Model Strategy | Multi-agent, open-source | Single monolithic, proprietary |
| Merchant Value Prop | Access to 10M+ sellers | Access to 200M+ daily active shoppers |
| User Lock-in | Low (users can leave Taobao) | High (entire journey in Douyin) |
| AI Feature Cost | Low (shared infrastructure) | High (dedicated compute) |
| Key Risk | Fragmented user experience | Privacy backlash |

Data Takeaway: Alibaba's open ecosystem attracts merchants with scale, while ByteDance's closed ecosystem attracts users with a seamless, addictive experience. The winner will be determined by which model can best balance personalization with privacy.

Industry Impact & Market Dynamics

The AI e-commerce battle is reshaping the entire retail technology stack. Traditional e-commerce was built on search (Google-like) and recommendation (Netflix-like). AI agents are introducing a third paradigm: conversational commerce. This shift has profound implications:

1. Advertising Models: Traditional pay-per-click (PPC) ads are being replaced by pay-per-conversation (PPCv). Brands now pay Alibaba or ByteDance when an AI agent recommends their product during a dialogue. This is a more performance-based model, but it also gives platforms more control over brand messaging.

2. Merchant Tools: Small merchants are adopting AI agents to handle customer service, negotiate prices, and even generate product descriptions. The market for AI e-commerce tools is projected to grow from $2.5 billion in 2024 to $15 billion by 2028 (CAGR 43%).

3. Market Share Shifts:

| Year | Alibaba (Taobao/Tmall) GMV Share | ByteDance (Douyin) GMV Share | Others (JD, Pinduoduo) |
|---|---|---|---|
| 2023 | 45% | 15% | 40% |
| 2024 | 42% | 20% | 38% |
| 2025 (est.) | 38% | 25% | 37% |
| 2028 (projected) | 30% | 35% | 35% |

Data Takeaway: ByteDance is on track to overtake Alibaba in GMV share by 2028, driven entirely by its AI-first closed-loop strategy. Alibaba's decline is not inevitable, but it requires a successful transition from a shelf-based to an agent-based model.

Risks, Limitations & Open Questions

1. Privacy and Surveillance: ByteDance's use of phone cameras for gaze tracking and emotion detection raises serious ethical concerns. In Europe and the US, such practices would likely violate GDPR or CCPA. Even in China, where privacy norms are different, a major scandal could erode user trust. Alibaba's less intrusive approach may be safer long-term.

2. Model Hallucination in Commerce: AI agents can confidently recommend products that don't exist, are out of stock, or have misleading descriptions. A single high-profile error (e.g., recommending a dangerous product to a child) could cause a regulatory crackdown. Both companies are investing in 'grounding' techniques that cross-reference AI outputs with real-time inventory data, but the problem is not solved.

3. Merchant Dependency: Small merchants on Douyin are becoming entirely dependent on Doubao's AI for customer acquisition. If ByteDance changes its algorithm or raises fees, these merchants have no alternative. This creates a power imbalance that could lead to antitrust scrutiny.

4. The 'Black Box' of Pricing: When AI agents negotiate prices, consumers lose transparency. A user may not know if they got a good deal or if the AI colluded with the seller to maximize platform profit. Regulators may eventually require AI agents to disclose their pricing logic.

AINews Verdict & Predictions

Prediction 1: ByteDance wins the 618 battle, but Alibaba wins the long-term war. Doubao's closed loop will drive higher conversion rates this 618 due to its superior personalization and engagement. However, Alibaba's open ecosystem and commitment to open-source models (Qwen) will attract a broader developer community, leading to more innovative shopping agents that work across multiple platforms (not just Taobao). By 2028, the dominant AI shopping agent may be a third-party app built on Qwen, not Doubao.

Prediction 2: The 'AI Agent Tax' will emerge. Platforms will charge merchants a percentage of every sale facilitated by an AI agent. This tax could be as high as 15-20%, significantly higher than current platform fees. This will squeeze small merchants and accelerate the shift to direct-to-consumer (D2C) models where brands build their own AI agents.

Prediction 3: Regulation will force interoperability. By 2027, Chinese regulators will likely mandate that AI shopping agents must be interoperable across platforms. This would break ByteDance's closed loop and level the playing field for Alibaba. The 'open vs. closed' debate will be settled by policy, not technology.

What to watch next: The launch of Alibaba's 'Agent Store'—a marketplace for third-party AI shopping agents—expected in Q3 2025. If it gains traction, it could be the 'App Store' moment for conversational commerce. Also watch for ByteDance's expansion of Doubao into physical retail via partnerships with convenience store chains, creating an online-to-offline AI shopping experience.

Archive

May 20261836 published articles

Further Reading

AI 電商大戰:阿里巴巴的 Qwen 對決字節跳動的 Doubao 迎戰 618隨著 2026 年 618 購物節的臨近,兩種截然不同的 AI 電商策略已然成形。阿里巴巴將 Qwen 模型深度嵌入淘寶,把搜尋框轉變為對話式代理。相反地,字節跳動則讓 Doubao 成為抖音電商的統一入口,實現...AI 自行編寫程式碼:Anthropic CEO 宣佈軟體免費時代來臨Anthropic 的執行長宣稱,Claude 的最新功能幾乎完全由 AI 自行開發,人類僅提供最低限度的監督。他進一步預測,隨著 AI 將軟體開發成本推向接近零,軟體產業將進入一個免費時代,這標誌著根本性的轉變。AI資本超級循環:貝佐斯380億美元秘密創業公司、Anthropic的900億美元豪賭,以及中國GPU突破一份洩漏的募資文件顯示,傑夫·貝佐斯神秘的新AI公司在產品發布前估值已達380億美元。與此同時,Anthropic正尋求以900億美元估值募資300億美元,而阿里巴巴的平頭哥已實現自研GPU的量產。這三起事件標誌著AI資本超級循環的加速。自由能量原則:驅動生命、AI與AGI的隱藏演算法熱力學預言了不可避免的混沌,但生命與智慧卻持續創造秩序。AINews揭示自由能量原則——一種普遍的生存演算法——如何推動從被動預測到全像世界模型的典範轉移,透過因果推論解鎖通往AGI之路。

常见问题

这次公司发布“AI E-Commerce War: Alibaba's Tongyi Qianwen vs ByteDance's Doubao in 618 Showdown”主要讲了什么?

The 618 shopping festival has become the proving ground for AI-driven e-commerce, with Alibaba and ByteDance deploying fundamentally different strategies. Alibaba is integrating it…

从“How does Tongyi Qianwen's multi-agent architecture handle real-time product negotiation?”看,这家公司的这次发布为什么值得关注?

The core technical divergence between Alibaba's and ByteDance's AI e-commerce strategies lies in their model architecture and data integration pipelines. Alibaba's Tongyi Qianwen in Taobao: Alibaba has taken a 'model-as-…

围绕“What are the privacy risks of ByteDance's Doubao using phone camera gaze tracking?”,这次发布可能带来哪些后续影响?

后续通常要继续观察用户增长、产品渗透率、生态合作、竞品应对以及资本市场和开发者社区的反馈。