酷家樂轉向空間智能:為物理世界構建AI基礎

April 2026
world modelembodied AIArchive: April 2026
作為中國「杭州六龍」中首家上市公司,酷家樂正將其核心戰略從設計軟體轉向空間智能基礎設施。公司計劃利用其旗艦平台酷家樂所擁有的大量結構化3D數據,為尚未...的領域構建基礎AI模型。
The article body is currently shown in English by default. You can generate the full version in this language on demand.

Koolab is undergoing a fundamental transformation that redefines its post-IPO trajectory. The company, best known for its Kujiale cloud-based interior design and rendering platform, is strategically repositioning its accumulated asset—billions of highly structured 3D spatial models of rooms, buildings, and furniture—as the training corpus for next-generation spatial AI models. This pivot moves the company beyond the Software-as-a-Service (SaaS) model for designers into the arena of foundational AI infrastructure. The core thesis is that while large language models (LLMs) have mastered text and generative video models like Sora are advancing visual synthesis, AI still lacks a robust, intuitive understanding of the three-dimensional physical world—its geometry, physics, semantics, and affordances. Koolab aims to fill this gap by building what it terms a 'Spatial Intelligence Base,' a platform that provides spatial reasoning, planning, and generative capabilities as a service. The potential applications are vast: enabling robots to navigate and manipulate in complex indoor environments, allowing architects and city planners to simulate real-world physics and human behavior, creating more persistent and interactive metaverse spaces, and powering next-gen smart home systems that understand context. This strategic bet recognizes that the path to more capable, general AI—and particularly embodied AI—requires models trained on the 'language' of space itself. Koolab's unique data moat, built over a decade in the home design vertical, gives it a distinct, though not uncontested, starting position in this emerging and critical frontier.

Technical Deep Dive

The technical ambition behind Koolab's spatial intelligence pivot is to construct a foundational model that translates raw 3D geometry into a rich, machine-understandable representation of space. This goes far beyond computer-aided design (CAD) or building information modeling (BIM). The goal is to create a Spatial World Model—a neural network that internalizes the rules, relationships, and possibilities inherent in physical environments.

Architecture & Data Pipeline: The process begins with Kujiale's proprietary data lake. Each user-created design is not just a collection of polygons; it's a structured graph of objects with attached metadata: dimensions, material properties, functional categories (e.g., 'dining chair,' 'kitchen sink'), manufacturer data, and even behavioral annotations (a sofa is for sitting, a door can be opened). This data is orders of magnitude richer and more semantically labeled than raw point clouds from LiDAR scans or photogrammetry. The pipeline involves:
1. Data Harmonization: Normalizing millions of designs into a consistent coordinate system and schema.
2. Graph Neural Network (GNN) Pre-training: Treating a room as a graph where nodes are objects (furniture, walls, appliances) and edges are spatial relationships ("on top of," "adjacent to," "facing"). Models like SceneGraphNet (an internal architecture likely inspired by open-source projects) learn to predict missing objects or flag physically impossible arrangements.
3. Physics-Informed Neural Networks (PINNs): Integrating basic laws of physics (gravity, collision, material stress) into the simulation layer, allowing the model to reason about stability, load-bearing, and human interaction.
4. Multimodal Fusion: Linking spatial graphs to textual descriptions (user design notes), images (rendered views), and potentially soon, video walkthroughs to ground language in 3D structure.

A key technical challenge is representation learning. How do you encode a 3D space for a transformer? Approaches include voxel grids, neural radiance fields (NeRF), and triplane representations (like those used by NVIDIA's GET3D). Koolab's early research papers suggest a hybrid approach, using a sparse voxel octree for efficient large-scale scene representation coupled with implicit neural fields for high-fidelity detail.

Relevant Open-Source Benchmarks & Repos: The field is rapidly evolving, with academia and big tech driving open-source innovation. Key repositories Koolab's engineers are certainly monitoring or contributing to include:
* ThreeDWorld (TDW): A high-performance, photorealistic simulation platform for interactive physical environments. It serves as a benchmark for embodied AI tasks.
* Habitat-Sim: Facebook AI Research's scalable 3D simulator for embodied agents, focused on navigation and interaction.
* OmniObject3D: A large-scale 3D object dataset with high-quality textured meshes, crucial for training detailed object recognition and manipulation models.

| Spatial Representation Method | Strengths | Weaknesses | Best For |
|---|---|---|---|
| Voxel Grid | Simple, easy for CNNs, explicit geometry | Memory intensive (cubic growth), low resolution | Early-stage scene parsing, collision detection |
| Point Cloud | Memory efficient, preserves exact geometry | Unordered, lacks topology, no surface information | Raw sensor data (LiDAR), registration |
| Mesh | Lightweight for rendering, explicit surfaces | Difficult to edit with AI, non-differentiable | Final asset output, game engines |
| Neural Radiance Field (NeRF) | Photorealistic view synthesis, continuous | Slow training/inference, no physical properties | Novel view synthesis, visualization |
| Implicit Neural Representation | Compact, continuous, high detail | Black-box, difficult to extract explicit rules | High-fidelity 3D reconstruction, generative AI |

Data Takeaway: The optimal technical stack for a spatial intelligence base is not a single representation but a layered system. Koolab likely employs meshes/voxels for symbolic reasoning and physics, and neural fields for generative tasks and realism, requiring significant engineering to bridge these representations.

Key Players & Case Studies

Koolab is not operating in a vacuum. The race to model the physical world is a central battleground for AI supremacy, attracting giants, well-funded startups, and academic consortia.

The Incumbent Titans:
* NVIDIA: The undisputed leader in the infrastructure layer with its Omniverse platform. Omniverse is a USD (Universal Scene Description)-based collaboration and simulation platform, positioning itself as the 'operating system' for 3D workflows. NVIDIA's strength is in the full-stack hardware (GPUs)-to-software pipeline and its focus on industrial digital twins. Its generative AI tools like GET3D and Magic3D directly compete with the generative aspect of Koolab's vision.
* Google DeepMind & Alphabet: With projects like RT-1 and RT-2 for robotics, and its vast SayCan project linking LLMs to physical skills, DeepMind's embodied AI research is foundational. Its access to data from everyday robotics (Everyday Robots) and its work on MuJoCo and other simulators make it a formidable force in defining what spatial understanding means for AI.
* Meta (Facebook): Heavily invested in the metaverse as a spatial computing future. Its Habitat and AI Habitat simulators, along with massive datasets of 3D-scanned interiors (Replica, ScanNet), provide a strong research foundation. Its Ego4D project, focused on first-person video understanding, is another angle on spatial AI.
* Apple: With the Vision Pro, Apple has bet its future on spatial computing. Its decades of expertise in ARKit, providing robust SLAM (Simultaneous Localization and Mapping) on mobile devices, gives it unparalleled real-world spatial data from millions of iPhones and iPads.

Startups & Vertical Challengers:
* Matterport: The closest public company comparable in mission. Matterport specializes in 3D capture of real spaces using specialized cameras, creating 'digital twins' primarily for real estate and tourism. Its data is photorealistic but less semantically structured than Kujiale's design-native data. Its recent AI features, like Dimensions AI for automatic measurement, show a parallel evolution.
* Unity & Unreal Engine (Epic Games): These game engine giants are the de facto standard for creating interactive 3D experiences. They are aggressively integrating AI tools (Unity Sentis, Unreal's AI frameworks) to generate assets and behaviors. Their strength is in real-time rendering and interactivity, a gap in Koolab's current toolchain.

| Company/Platform | Core Data Source | Primary Approach | Key Advantage | Weakness vs. Koolab |
|---|---|---|---|---|
| Koolab (Kujiale) | Human-designed 3D models | Structured graph + generative AI | Rich semantics, functional intent, clean data | Less photorealistic/real-world noisy data |
| Matterport | Photogrammetry/LiDAR scans | Volumetric capture + CV | Ground-truth realism, scale of real buildings | Lack of deep semantics, unstructured point clouds |
| NVIDIA Omniverse | Multi-source USD files | Simulation & collaboration platform | Physics fidelity, industry adoption, full stack | Less focus on high-level semantic reasoning |
| Meta Habitat | 3D Scans (ScanNet) | Embodied AI simulation | Focus on agent training, academic benchmark | Not a commercial platform, limited generative focus |

Data Takeaway: Koolab's competitive edge lies in the *intentionality* and *structure* of its data. While others capture *what is*, Kujiale's data contains *what could be* and *what was intended*, encoding human design logic and functional relationships that are absent from raw scans.

Industry Impact & Market Dynamics

This strategic pivot has ripple effects across multiple trillion-dollar industries.

1. Redefining the AI Stack: The dominant AI stack today is built on text (LLMs) and 2D images. Koolab's bet asserts that a Spatial Foundation Model layer is missing and will be equally critical. This creates a new platform opportunity between raw infrastructure (cloud GPUs) and vertical applications (robotics software). If successful, Koolab transitions from a B2B SaaS company to a B2B2X AI platform, licensing its spatial understanding API to others.

2. Accelerating Embodied AI and Robotics: The 'sim-to-real' transfer problem—training a robot in simulation and deploying it in reality—is massively hampered by unrealistic simulators. A high-fidelity spatial intelligence base that understands clutter, material properties, and everyday object interaction could drastically cut down real-world training time and cost for companies like Boston Dynamics, Figure AI, and countless logistics robotics firms.

3. Transforming Real Estate, Construction, and Facilities Management: The AEC (Architecture, Engineering, Construction) industry is ripe for AI disruption. A model that can generate compliant building layouts, simulate traffic flow, predict maintenance issues, or plan renovations automatically moves from drawing tool to co-pilot. This expands Koolab's TAM (Total Addressable Market) far beyond home decor.

Market Data & Projections:

| Spatial AI & Digital Twin Market Segment | 2024 Estimated Size (USD) | Projected CAGR (2024-2030) | Key Drivers |
|---|---|---|---|
| 3D Mapping & Modeling | $6.5 Billion | 18.5% | Urban planning, autonomous vehicles |
| Digital Twin (Overall) | $73 Billion | 39.5% | Manufacturing, smart cities, IoT integration |
| Metaverse (Infrastructure) | $28 Billion | 34.5% | Virtual events, spatial computing |
| AI in AEC | $2.1 Billion | 26.8% | Labor shortages, sustainability mandates |
| Embodied AI Software | $0.8 Billion | 52.3% (est.) | Advancements in robotics, cheaper sensors |

Data Takeaway: The markets Koolab is targeting are not just large, but are among the fastest-growing in tech. The explosive CAGR for Embodied AI Software, though from a smaller base, indicates where the most intense innovation and venture capital will flow, validating the strategic timing of Koolab's pivot.

Risks, Limitations & Open Questions

Despite the compelling vision, the path is fraught with technical, commercial, and ethical challenges.

Technical Risks:
* Data Bias: Kujiale's data is overwhelmingly residential and from a specific cultural (primarily Chinese) and socioeconomic context. A model trained solely on aspirational home designs may fail to understand factories, hospitals, public schools, or informal settlements, leading to serious performance gaps in broader applications.
* The Reality Gap: Designed spaces are idealized. They lack the wear, tear, clutter, and unpredictable chaos of real inhabited spaces. Bridging this 'intent-reality' gap is as hard as the sim-to-real gap.
* Scale of Computation: Training world models is arguably more complex than training LLMs, as it must integrate geometry, physics, and semantics. The computational cost may be prohibitive without a breakthrough in efficiency.

Commercial & Strategic Risks:
* Platform Lock-In Failure: The shift from SaaS to Platform is historically difficult. It requires convincing former customers (designers) to contribute to a data moat that will ultimately serve their potential competitors (AI automation that could replace some design tasks).
* Competition from Integrated Giants: NVIDIA, Google, and Apple can subsidize their spatial AI platforms with profits from other divisions. Koolab, as a pure-play, must achieve rapid commercialization and scale to compete.
* Defensibility: While the data is unique, it is not perfectly exclusive. Competitors could generate synthetic design data or form partnerships with other large 3D content repositories (e.g., SketchUp's warehouse, gaming asset stores).

Ethical & Societal Questions:
* Surveillance & Control: A perfect model of built environments is a powerful tool for surveillance, predictive policing, and social control. The ethics of who can access such a model and for what purpose must be addressed proactively.
* Labor Displacement: Automating spatial reasoning and design has profound implications for architects, interior designers, urban planners, and related trades.
* Reality Fidelity & Responsibility: If a robot trained in a Koolab-powered simulator causes damage in the real world due to a simulation flaw, where does liability lie?

AINews Verdict & Predictions

Koolab's pivot is a bold, necessary, and strategically astute gamble that correctly identifies a critical bottleneck in AI's evolution. However, success is far from guaranteed.

Verdict: This is a high-risk, high-reward move that transforms Koolab from a solid, growing SaaS company into a speculative, frontier-tech platform contender. The technical vision is coherent and addresses a genuine need in the AI ecosystem. Their unique data asset provides a credible, though temporary, moat. The primary risk is not the vision itself, but execution against capital-rich, full-stack competitors.

Predictions:
1. Within 18 months, Koolab will release a limited beta of a spatial reasoning API, initially targeted at academic robotics labs and boutique metaverse developers, focusing on tasks like furniture arrangement generation and simple navigation planning.
2. By 2026, we will see the first major strategic partnership or joint venture between Koolab and a Chinese robotics or autonomous vehicle company (e.g., DJI, Siasun) to jointly develop simulation environments, validating the platform's industrial utility.
3. The key acquisition target for Koolab will not be another software firm, but a company specializing in real-world 3D capture or computer vision to blend its intentional design data with real-world scan data, mitigating the reality gap. A company like **** (a Chinese Matterport equivalent) could be a logical target.
4. The most likely outcome in 5 years is not that Koolab becomes *the* spatial OS, but that it becomes a *critical specialized layer* within broader ecosystems. We predict it will be a preferred provider of high-quality, semantically rich 3D training data and specialized foundation models for the Chinese domestic market's embodied AI and metaverse efforts, potentially becoming an acquisition target itself for a larger player (e.g., Tencent, Baidu) seeking to bolster its AI infrastructure.

What to Watch Next: Monitor Koolab's research paper output at conferences like CVPR, NeurIPS, and ICRA. Listen for announcements of partnerships with robotics firms. Most critically, watch its R&D and capital expenditure spending post-pivot—a sustained increase will signal true commitment, while a pullback would indicate a retreat to its SaaS core. The journey from rendering a beautiful virtual room to teaching a robot to clean a real one has begun, and Koolab has just placed a defining bet on its own role in that future.

Related topics

world model21 related articlesembodied AI100 related articles

Archive

April 20262134 published articles

Further Reading

從步履蹣跚到馬拉松冠軍:人形機器人如何在一年內實現耐力突破在一項驚人的快速進展展示中,一款尖端人形機器人成功跑完了全程馬拉松距離。這項成就,距離同類機器人還在為基本移動問題掙扎僅過去一年,標誌著機器人耐力發生了根本性的階段轉變。OpenAI的Sora轉向:從影片生成器到世界模型的基礎OpenAI近期對其Sora影片生成模型的戰略調整,遠不止於產品優化。這是一次深思熟慮的轉向,從打造獨立工具轉為構建未來世界模型的視覺核心。此舉標誌著OpenAI旨在成為基礎設施的雄心壯志。Tencent's Strategic Pivot: How AGI is Forcing a Complete Rewrite of Its Investment PlaybookAn in-depth AINews analysis reveals Tencent is undergoing a fundamental strategic shift, moving away from its legacy inv貝點數位推出 Spark AI Cloud 2.0:為城市與產業打造全新AI作業系統貝點數位正式發布 Spark AI Cloud 2.0,超越基礎AI服務,提出一個面向城市與工業園區的綜合性『AI系統工程』平台。這標誌著從提供單點解決方案,轉向構建一個能夠自主運行的AI驅動作業系統的根本性轉變。

常见问题

这次公司发布“Koolab's Pivot to Spatial Intelligence: Building AI's Foundation for the Physical World”主要讲了什么?

Koolab is undergoing a fundamental transformation that redefines its post-IPO trajectory. The company, best known for its Kujiale cloud-based interior design and rendering platform…

从“Koolab Kujiale spatial intelligence vs NVIDIA Omniverse”看,这家公司的这次发布为什么值得关注?

The technical ambition behind Koolab's spatial intelligence pivot is to construct a foundational model that translates raw 3D geometry into a rich, machine-understandable representation of space. This goes far beyond com…

围绕“How does Koolab's 3D data train AI for robots”,这次发布可能带来哪些后续影响?

后续通常要继续观察用户增长、产品渗透率、生态合作、竞品应对以及资本市场和开发者社区的反馈。