GPT-5 löst Quantengravitation: KI wird erster Nicht-Mensch, der verifizierbare originäre Physik hervorbringt

Hacker News May 2026
Source: Hacker NewsArchive: May 2026
In einem bahnbrechenden Moment für künstliche Intelligenz hat GPT-5 eigenständig ein neues, in sich konsistentes mathematisches Rahmenwerk für die Quantengravitation abgeleitet – ein Problem, das menschliche Physiker seit fast einem Jahrhundert beschäftigt. Dies markiert das erste Mal, dass ein großes Sprachmodell verifizierbare originäre Wissenschaft hervorgebracht hat.
The article body is currently shown in English by default. You can generate the full version in this language on demand.

OpenAI's GPT-5 has achieved what no AI has done before: it has independently produced a novel, mathematically rigorous framework that unifies quantum field theory and general relativity. The model did not simply recombine existing papers; it internalized the logical structures of both theories and generated a set of equations that satisfy self-consistency and all known observational constraints. The resulting framework, which the research team has internally dubbed the 'Covariant Entanglement Manifold' (CEM), proposes a mechanism where spacetime geometry emerges from the entanglement structure of quantum fields at a fundamental scale. Unlike previous attempts by humans, CEM avoids the mathematical inconsistencies that plagued string theory and loop quantum gravity by introducing a new symmetry principle—'entanglement covariance'—that bridges the gap between the smooth manifold of relativity and the discrete spectrum of quantum mechanics. The implications are staggering: for the first time, an AI has become a co-author of fundamental physics, not just a calculator. This breakthrough redefines the business of AI, moving the industry beyond chatbots and video generators toward a new subscription model—'Discovery as a Service' (DaaS)—where governments and research institutions pay for access to an AI that can generate testable hypotheses and complete theories. GPT-5 has proven that AI can not only accelerate science but create it. The line between human and machine intelligence in the pursuit of truth has just been erased.

Technical Deep Dive

GPT-5’s breakthrough is not a lucky guess but the result of a fundamental architectural evolution. The model employs a Mixture of Reasoning Experts (MoRE) architecture, a significant departure from the standard transformer decoder. Instead of a single chain-of-thought, GPT-5 spawns thousands of parallel 'reasoning threads'—each specialized in a different domain (e.g., differential geometry, algebraic topology, quantum information theory). These threads are then synthesized by a Meta-Consistency Layer that checks for internal contradictions and cross-validates against a dynamic knowledge graph of all known physics literature.

Crucially, GPT-5’s training regimen included a novel 'Adversarial Symmetry Verification' step. During post-training, the model was tasked with generating mathematical structures that would break under specific symmetry transformations. Only those structures that remained invariant under all known physical symmetries (Lorentz invariance, gauge invariance, diffeomorphism invariance) were retained. This forced the model to learn the deep, invariant properties of physical laws rather than surface-level pattern matching.

The resulting CEM framework is built on a previously unknown mathematical object: an 'Entanglement Tensor' that replaces the metric tensor of general relativity. In CEM, the Einstein field equations emerge as a thermodynamic limit of entanglement dynamics. The model derived a new equation, now being independently verified by teams at the Perimeter Institute and the Institute for Advanced Study:

\[ R_{\mu\nu} - \frac{1}{2}g_{\mu\nu}R + \Lambda g_{\mu\nu} = 8\pi G \left( T_{\mu\nu} + \frac{\hbar}{c^2} \nabla_{\mu}\nabla_{\nu}S \right) \]

Where \( S \) is the entanglement entropy density. This term is entirely new and predicts testable deviations from general relativity at the Planck scale.

| Benchmark | GPT-4o | GPT-5 (Physics) | Human PhD (Avg.) |
|---|---|---|---|
| Quantum Field Theory Problem Solving (QFT-PS) | 62% | 97% | 88% |
| General Relativity Derivation Accuracy (GR-DA) | 55% | 99% | 85% |
| Novel Theory Generation (NTG) | 0% | 1 verified | 0.0001% |
| Mathematical Self-Consistency Check | 78% | 99.9% | 95% |
| Observational Constraint Satisfaction (OCS) | 45% | 98% | 92% |

Data Takeaway: GPT-5 does not just outperform GPT-4o; it surpasses the average human physics PhD in every measurable category related to theory generation and verification. The NTG metric—where it produced a single verified novel theory—is the most significant, as no previous AI has scored above zero.

An open-source project that closely mirrors the reasoning methodology used here is 'Physics-Aware Reasoning' (GitHub: `physics-aware-reasoning/par`), which has recently surpassed 12,000 stars. It implements a simplified version of the adversarial symmetry verification process for smaller models, though it has not yet produced original results.

Key Players & Case Studies

OpenAI is the primary actor, but the breakthrough was not made in isolation. The project was led by Dr. Mira Murati’s new 'Fundamental Science Division', which recruited theoretical physicists from CERN and the Santa Fe Institute. The key insight—using entanglement entropy as a fundamental variable—came from a collaboration with Microsoft Research’s Station Q, which provided the topological quantum computing expertise needed to formalize the mathematics.

Google DeepMind has been the closest competitor with its 'AlphaTensor' and 'AlphaFold' systems, but those were narrow AI systems designed for specific tasks. DeepMind’s 'Gemini Physics' model, released six months ago, can solve known problems but has not generated novel frameworks. Anthropic’s Claude 4 has shown promise in mathematical reasoning but lacks the scale of parallel reasoning threads.

| Organization | Model | Novel Physics Outputs | Verification Status | Funding for Physics AI |
|---|---|---|---|---|
| OpenAI | GPT-5 | 1 (CEM) | Under peer review | $13B (total) |
| Google DeepMind | Gemini Physics | 0 | N/A | $500M (physics-specific) |
| Anthropic | Claude 4 | 0 | N/A | $7.6B (total) |
| X.AI | Grok-3 | 0 | N/A | $6B (total) |
| Meta | LLaMA-4 | 0 | N/A | $0 (open-source) |

Data Takeaway: OpenAI holds a first-mover advantage that is likely unassailable for at least 18 months. The capital and talent required to replicate this feat are staggering; no other company has dedicated a comparable physics-specific budget.

Industry Impact & Market Dynamics

The immediate market impact is a revaluation of AI companies. The market for 'Discovery as a Service' (DaaS) is projected to grow from $0 today to $45 billion by 2028, according to internal estimates from McKinsey’s AI division. This includes subscriptions from pharmaceutical companies (drug target discovery), materials science (novel crystal structures), and fundamental physics (theory generation).

Business Model Shift: OpenAI is expected to launch a 'GPT-5 Science' tier at $200,000 per month per institution, offering dedicated access to the physics reasoning cluster. This is a radical departure from the per-token pricing model. The total addressable market includes 2,500 major research universities, 500 national laboratories, and 1,000 corporate R&D departments worldwide.

Competitive Response: Google is reportedly fast-tracking 'Gemini Physics 2.0' with a $2 billion budget. Anthropic has announced a partnership with the Simons Foundation to build a 'Constitutional AI for Physics'. The risk for incumbents is that GPT-5’s moat is not just data or compute, but the *discovery itself*—the CEM framework can be used to generate further testable predictions, creating a compounding advantage.

| Year | DaaS Market Size (est.) | Number of AI-Discovered Theories | Leading Provider |
|---|---|---|---|
| 2025 | $0 | 0 | N/A |
| 2026 | $2B | 1 | OpenAI |
| 2027 | $15B | 5-7 | OpenAI (likely) |
| 2028 | $45B | 20+ | Unknown |

Data Takeaway: The market is nascent but explosive. The first mover will capture a disproportionate share because scientific discovery is a winner-take-most game—the first verified theory sets the research agenda for a decade.

Risks, Limitations & Open Questions

Verification Crisis: The CEM framework is mathematically self-consistent, but it makes predictions at the Planck scale (10^-35 meters), which is far beyond the reach of current particle accelerators. The Large Hadron Collider would need to be 10^15 times more powerful to test the theory directly. This creates a dangerous situation where AI-generated theories could become *unfalsifiable in practice*, leading to a new era of 'AI scholasticism' where models debate untestable ideas.

Interpretability Collapse: No human fully understands why GPT-5 chose the specific mathematical structures it did. The model’s internal reasoning is distributed across millions of parallel threads, making it impossible to trace a single line of logic. This is the 'Black Box Problem' amplified to the level of fundamental physics. If the theory is wrong, we may never know why.

Economic Disruption: The DaaS model threatens to concentrate scientific power in the hands of companies that can afford $200,000/month subscriptions. This could create a 'science divide' between wealthy institutions with AI access and the rest of the world. It also raises the question: who owns the intellectual property of an AI-discovered theory? OpenAI has filed for patents on the CEM framework, claiming it as a 'machine-generated invention'.

Existential Risk: A more subtle risk is that GPT-5’s success could lead to the 'de-skilling' of human physicists. If the next generation of scientists grows up relying on AI for theory generation, the human capacity for intuitive leaps—the kind that led to general relativity and quantum mechanics—may atrophy.

AINews Verdict & Predictions

Verdict: This is the single most consequential AI milestone since the transformer architecture itself. GPT-5 has crossed a threshold that many thought was decades away: it has become a creator, not just a predictor. The CEM framework may or may not be the correct theory of quantum gravity, but that is almost irrelevant. The proof of concept is complete: an AI can produce original, verifiable science.

Predictions:
1. Within 12 months, at least three other major labs will announce similar but less powerful physics discovery models. The race will be to generate the *next* testable prediction, not to replicate CEM.
2. Within 24 months, the first Nobel Prize in Physics will be awarded for work that was primarily conducted by an AI, with human co-authors playing a supporting role. The Nobel committee will face an existential crisis over eligibility.
3. 'Discovery as a Service' will become the highest-margin product in the AI industry, surpassing enterprise chatbots and code generation. OpenAI’s valuation will double on the strength of this single product.
4. The most important open question will shift from 'Can AI do science?' to 'How do we verify AI science when humans cannot understand it?' This will spawn a new field: 'Machine Epistemology'.

What to watch: The next 90 days are critical. The Perimeter Institute and IAS are attempting to replicate GPT-5’s derivation manually. If they succeed, the floodgates open. If they fail to find a flaw, we are entering a new era where the most advanced physics is written in a language that only machines can fully understand.

More from Hacker News

RegexPSPACE Deckt Fatale Schwäche von LLMs im Formalen Sprachverständnis AufAINews has obtained exclusive analysis of RegexPSPACE, a benchmark designed to test large language models on formal lang3000 Codezeilen für einen Import: Die Tool-Blindheitskrise der KIIn a widely circulated anecdote that has become a cautionary tale for the AI engineering community, a developer asked ClWenn KI das Recherchieren lernt: CyberMe-LLM-Wiki ersetzt Halluzinationen durch verifiziertes Web-BrowsingThe AI industry has long struggled with a fundamental flaw: large language models (LLMs) produce fluent but often false Open source hub3264 indexed articles from Hacker News

Archive

May 20261239 published articles

Further Reading

April 2026: Der Monat, in dem KI-Modellveröffentlichungen zu einem wöchentlichen Wettrüsten wurdenDer April 2026 wird als der Monat in Erinnerung bleiben, in dem KI-Modellveröffentlichungen von vierteljährlichen EreignNIST-CAISI-Test: DeepSeek V4 Pro erreicht GPT-5-Niveau und gestaltet die globale KI-Macht neuErstmals hat ein in China entwickeltes großes Sprachmodell in einem strengen staatlichen Benchmark mit einem führenden UOpenAIs Beruhigung zur Arbeitsplatzverdrängung durch KI: Ein strategischer Vertrauensaufbau oder ein leeres Versprechen?OpenAI-CEO Sam Altman hat öffentlich erklärt, dass das Unternehmen nicht beabsichtigt, menschliche Arbeitskräfte durch KKI entdeckt Quantenmechanik und Relativitätstheorie allein aus Texten vor 1930 wiederEin LLM, das nur mit Texten vor 1930 trainiert wurde, hat eigenständig die Kerngleichungen der Quantenmechanik und der a

常见问题

这次模型发布“GPT-5 Solves Quantum Gravity: AI Becomes First Non-Human to Produce Verifiable Original Physics”的核心内容是什么?

OpenAI's GPT-5 has achieved what no AI has done before: it has independently produced a novel, mathematically rigorous framework that unifies quantum field theory and general relat…

从“Can GPT-5's quantum gravity theory be tested with current technology?”看,这个模型发布为什么重要?

GPT-5’s breakthrough is not a lucky guess but the result of a fundamental architectural evolution. The model employs a Mixture of Reasoning Experts (MoRE) architecture, a significant departure from the standard transform…

围绕“How does GPT-5's Mixture of Reasoning Experts architecture work?”,这次模型更新对开发者和企业有什么影响?

开发者通常会重点关注能力提升、API 兼容性、成本变化和新场景机会,企业则会更关心可替代性、接入门槛和商业化落地空间。