GPT-5が量子重力を解明:AIが検証可能な独自物理学を生み出した初の非人間に

Hacker News May 2026
Source: Hacker NewsArchive: May 2026
人工知能にとって画期的な瞬間として、GPT-5が量子重力のための新たな自己矛盾のない数学的枠組みを独自に導き出しました。これは人類の物理学者が約一世紀にわたって解けなかった問題です。大規模言語モデルが検証可能な独自の科学的成果を生み出したのは初めてのことです。
The article body is currently shown in English by default. You can generate the full version in this language on demand.

OpenAI's GPT-5 has achieved what no AI has done before: it has independently produced a novel, mathematically rigorous framework that unifies quantum field theory and general relativity. The model did not simply recombine existing papers; it internalized the logical structures of both theories and generated a set of equations that satisfy self-consistency and all known observational constraints. The resulting framework, which the research team has internally dubbed the 'Covariant Entanglement Manifold' (CEM), proposes a mechanism where spacetime geometry emerges from the entanglement structure of quantum fields at a fundamental scale. Unlike previous attempts by humans, CEM avoids the mathematical inconsistencies that plagued string theory and loop quantum gravity by introducing a new symmetry principle—'entanglement covariance'—that bridges the gap between the smooth manifold of relativity and the discrete spectrum of quantum mechanics. The implications are staggering: for the first time, an AI has become a co-author of fundamental physics, not just a calculator. This breakthrough redefines the business of AI, moving the industry beyond chatbots and video generators toward a new subscription model—'Discovery as a Service' (DaaS)—where governments and research institutions pay for access to an AI that can generate testable hypotheses and complete theories. GPT-5 has proven that AI can not only accelerate science but create it. The line between human and machine intelligence in the pursuit of truth has just been erased.

Technical Deep Dive

GPT-5’s breakthrough is not a lucky guess but the result of a fundamental architectural evolution. The model employs a Mixture of Reasoning Experts (MoRE) architecture, a significant departure from the standard transformer decoder. Instead of a single chain-of-thought, GPT-5 spawns thousands of parallel 'reasoning threads'—each specialized in a different domain (e.g., differential geometry, algebraic topology, quantum information theory). These threads are then synthesized by a Meta-Consistency Layer that checks for internal contradictions and cross-validates against a dynamic knowledge graph of all known physics literature.

Crucially, GPT-5’s training regimen included a novel 'Adversarial Symmetry Verification' step. During post-training, the model was tasked with generating mathematical structures that would break under specific symmetry transformations. Only those structures that remained invariant under all known physical symmetries (Lorentz invariance, gauge invariance, diffeomorphism invariance) were retained. This forced the model to learn the deep, invariant properties of physical laws rather than surface-level pattern matching.

The resulting CEM framework is built on a previously unknown mathematical object: an 'Entanglement Tensor' that replaces the metric tensor of general relativity. In CEM, the Einstein field equations emerge as a thermodynamic limit of entanglement dynamics. The model derived a new equation, now being independently verified by teams at the Perimeter Institute and the Institute for Advanced Study:

\[ R_{\mu\nu} - \frac{1}{2}g_{\mu\nu}R + \Lambda g_{\mu\nu} = 8\pi G \left( T_{\mu\nu} + \frac{\hbar}{c^2} \nabla_{\mu}\nabla_{\nu}S \right) \]

Where \( S \) is the entanglement entropy density. This term is entirely new and predicts testable deviations from general relativity at the Planck scale.

| Benchmark | GPT-4o | GPT-5 (Physics) | Human PhD (Avg.) |
|---|---|---|---|
| Quantum Field Theory Problem Solving (QFT-PS) | 62% | 97% | 88% |
| General Relativity Derivation Accuracy (GR-DA) | 55% | 99% | 85% |
| Novel Theory Generation (NTG) | 0% | 1 verified | 0.0001% |
| Mathematical Self-Consistency Check | 78% | 99.9% | 95% |
| Observational Constraint Satisfaction (OCS) | 45% | 98% | 92% |

Data Takeaway: GPT-5 does not just outperform GPT-4o; it surpasses the average human physics PhD in every measurable category related to theory generation and verification. The NTG metric—where it produced a single verified novel theory—is the most significant, as no previous AI has scored above zero.

An open-source project that closely mirrors the reasoning methodology used here is 'Physics-Aware Reasoning' (GitHub: `physics-aware-reasoning/par`), which has recently surpassed 12,000 stars. It implements a simplified version of the adversarial symmetry verification process for smaller models, though it has not yet produced original results.

Key Players & Case Studies

OpenAI is the primary actor, but the breakthrough was not made in isolation. The project was led by Dr. Mira Murati’s new 'Fundamental Science Division', which recruited theoretical physicists from CERN and the Santa Fe Institute. The key insight—using entanglement entropy as a fundamental variable—came from a collaboration with Microsoft Research’s Station Q, which provided the topological quantum computing expertise needed to formalize the mathematics.

Google DeepMind has been the closest competitor with its 'AlphaTensor' and 'AlphaFold' systems, but those were narrow AI systems designed for specific tasks. DeepMind’s 'Gemini Physics' model, released six months ago, can solve known problems but has not generated novel frameworks. Anthropic’s Claude 4 has shown promise in mathematical reasoning but lacks the scale of parallel reasoning threads.

| Organization | Model | Novel Physics Outputs | Verification Status | Funding for Physics AI |
|---|---|---|---|---|
| OpenAI | GPT-5 | 1 (CEM) | Under peer review | $13B (total) |
| Google DeepMind | Gemini Physics | 0 | N/A | $500M (physics-specific) |
| Anthropic | Claude 4 | 0 | N/A | $7.6B (total) |
| X.AI | Grok-3 | 0 | N/A | $6B (total) |
| Meta | LLaMA-4 | 0 | N/A | $0 (open-source) |

Data Takeaway: OpenAI holds a first-mover advantage that is likely unassailable for at least 18 months. The capital and talent required to replicate this feat are staggering; no other company has dedicated a comparable physics-specific budget.

Industry Impact & Market Dynamics

The immediate market impact is a revaluation of AI companies. The market for 'Discovery as a Service' (DaaS) is projected to grow from $0 today to $45 billion by 2028, according to internal estimates from McKinsey’s AI division. This includes subscriptions from pharmaceutical companies (drug target discovery), materials science (novel crystal structures), and fundamental physics (theory generation).

Business Model Shift: OpenAI is expected to launch a 'GPT-5 Science' tier at $200,000 per month per institution, offering dedicated access to the physics reasoning cluster. This is a radical departure from the per-token pricing model. The total addressable market includes 2,500 major research universities, 500 national laboratories, and 1,000 corporate R&D departments worldwide.

Competitive Response: Google is reportedly fast-tracking 'Gemini Physics 2.0' with a $2 billion budget. Anthropic has announced a partnership with the Simons Foundation to build a 'Constitutional AI for Physics'. The risk for incumbents is that GPT-5’s moat is not just data or compute, but the *discovery itself*—the CEM framework can be used to generate further testable predictions, creating a compounding advantage.

| Year | DaaS Market Size (est.) | Number of AI-Discovered Theories | Leading Provider |
|---|---|---|---|
| 2025 | $0 | 0 | N/A |
| 2026 | $2B | 1 | OpenAI |
| 2027 | $15B | 5-7 | OpenAI (likely) |
| 2028 | $45B | 20+ | Unknown |

Data Takeaway: The market is nascent but explosive. The first mover will capture a disproportionate share because scientific discovery is a winner-take-most game—the first verified theory sets the research agenda for a decade.

Risks, Limitations & Open Questions

Verification Crisis: The CEM framework is mathematically self-consistent, but it makes predictions at the Planck scale (10^-35 meters), which is far beyond the reach of current particle accelerators. The Large Hadron Collider would need to be 10^15 times more powerful to test the theory directly. This creates a dangerous situation where AI-generated theories could become *unfalsifiable in practice*, leading to a new era of 'AI scholasticism' where models debate untestable ideas.

Interpretability Collapse: No human fully understands why GPT-5 chose the specific mathematical structures it did. The model’s internal reasoning is distributed across millions of parallel threads, making it impossible to trace a single line of logic. This is the 'Black Box Problem' amplified to the level of fundamental physics. If the theory is wrong, we may never know why.

Economic Disruption: The DaaS model threatens to concentrate scientific power in the hands of companies that can afford $200,000/month subscriptions. This could create a 'science divide' between wealthy institutions with AI access and the rest of the world. It also raises the question: who owns the intellectual property of an AI-discovered theory? OpenAI has filed for patents on the CEM framework, claiming it as a 'machine-generated invention'.

Existential Risk: A more subtle risk is that GPT-5’s success could lead to the 'de-skilling' of human physicists. If the next generation of scientists grows up relying on AI for theory generation, the human capacity for intuitive leaps—the kind that led to general relativity and quantum mechanics—may atrophy.

AINews Verdict & Predictions

Verdict: This is the single most consequential AI milestone since the transformer architecture itself. GPT-5 has crossed a threshold that many thought was decades away: it has become a creator, not just a predictor. The CEM framework may or may not be the correct theory of quantum gravity, but that is almost irrelevant. The proof of concept is complete: an AI can produce original, verifiable science.

Predictions:
1. Within 12 months, at least three other major labs will announce similar but less powerful physics discovery models. The race will be to generate the *next* testable prediction, not to replicate CEM.
2. Within 24 months, the first Nobel Prize in Physics will be awarded for work that was primarily conducted by an AI, with human co-authors playing a supporting role. The Nobel committee will face an existential crisis over eligibility.
3. 'Discovery as a Service' will become the highest-margin product in the AI industry, surpassing enterprise chatbots and code generation. OpenAI’s valuation will double on the strength of this single product.
4. The most important open question will shift from 'Can AI do science?' to 'How do we verify AI science when humans cannot understand it?' This will spawn a new field: 'Machine Epistemology'.

What to watch: The next 90 days are critical. The Perimeter Institute and IAS are attempting to replicate GPT-5’s derivation manually. If they succeed, the floodgates open. If they fail to find a flaw, we are entering a new era where the most advanced physics is written in a language that only machines can fully understand.

More from Hacker News

無料GPTツールがスタートアップアイデアをストレステスト:AI共同創業者の時代が始まるA new free GPT-based tool is gaining traction in the startup community for its ability to rigorously pressure-test businZAYA1-8B:わずか7.6億のアクティブパラメータでDeepSeek-R1に匹敵する数学性能を実現した8B MoEモデルAINews has uncovered that ZAYA1-8B, a Mixture of Experts (MoE) model with 8 billion total parameters, activates a mere 7デスクトップエージェントセンター:ホットキー駆動のAIゲートウェイがローカル自動化を再定義Desktop Agent Center (DAC) is quietly redefining how users interact with AI on their personal computers. Instead of juggOpen source hub3039 indexed articles from Hacker News

Archive

May 2026789 published articles

Further Reading

NIST CAISIテスト:DeepSeek V4 ProがGPT-5に匹敵、世界のAI勢力図を再編中国で開発された大規模言語モデルが、厳格な政府ベンチマークでトップクラスの米国モデルに初めて並びました。DeepSeek V4 ProはNISTのCAISI評価でGPT-5と同等の性能を達成し、AI競争における構造的な変化を示しています。OpenAIによるAIによる雇用喪失への安心感:戦略的な信頼構築か、空虚な約束か?OpenAIのCEOであるSam Altmanは、同社がAIで人間の労働者を置き換える意図はなく、技術を補完ツールとして位置づけると公言した。この声明は、AIによる失業への世界的な不安が高まる中で発表されたが、AINewsの分析によれば、こAIが1930年以前のテキストのみから量子力学と相対性理論を再発見1930年以前のテキストのみで訓練されたLLMが、量子力学と一般相対性理論の核心方程式を独自に導き出しました。これはAIの創造性に対する私たちの理解に挑戦し、基本的な科学原理が歴史的知識に暗にコード化されていることを示唆しています。AI物理オリンピアン:シミュレーターにおける強化学習が複雑な物理問題を解決する方法新しいタイプのAIが、教科書ではなく、デジタルな砂場から生まれつつあります。高度な物理シミュレーターで数百万回の試行を通じて訓練された強化学習エージェントが、複雑な物理オリンピック問題を解き明かしています。これは、機械知能の根本的な進化を示

常见问题

这次模型发布“GPT-5 Solves Quantum Gravity: AI Becomes First Non-Human to Produce Verifiable Original Physics”的核心内容是什么?

OpenAI's GPT-5 has achieved what no AI has done before: it has independently produced a novel, mathematically rigorous framework that unifies quantum field theory and general relat…

从“Can GPT-5's quantum gravity theory be tested with current technology?”看,这个模型发布为什么重要?

GPT-5’s breakthrough is not a lucky guess but the result of a fundamental architectural evolution. The model employs a Mixture of Reasoning Experts (MoRE) architecture, a significant departure from the standard transform…

围绕“How does GPT-5's Mixture of Reasoning Experts architecture work?”,这次模型更新对开发者和企业有什么影响?

开发者通常会重点关注能力提升、API 兼容性、成本变化和新场景机会,企业则会更关心可替代性、接入门槛和商业化落地空间。