Celesto's PB-Scale Sandbox Memory: AI Agents Finally Get Permanent Recall

The AI industry has long treated agent memory as a disposable resource—temporary, session-bound, and prone to catastrophic forgetting the moment a context window overflows. Celesto's decision to equip its AI agent sandbox with PB-level persistent storage fundamentally rewrites that assumption. By upgrading storage from ephemeral cache to permanent memory, the platform allows agents to operate across sessions, recall past decisions, and manipulate datasets that would previously have been impossible to handle in a single interaction. This is not merely a capacity upgrade; it is a qualitative shift in agent autonomy. For developers building coding agents that must navigate million-line repositories, testing frameworks that generate gigabytes of logs, or video generation pipelines that require consistent state across renders, the absence of persistent memory has been the single largest barrier to real-world deployment. Celesto's move directly targets this gap, positioning infrastructure resilience—not model intelligence—as the next frontier of AI competition. The timing is strategic: as foundation model performance plateaus and the market shifts from 'which model is smarter' to 'which agent can reliably finish a job,' persistent storage becomes the differentiator that separates demo toys from production tools. This article dissects the technical architecture behind Celesto's PB-scale memory, examines the competitive landscape, and argues that the era of the 'forgetful agent' is coming to an end.

Technical Deep Dive

Celesto's persistent storage layer is built on a distributed object store architecture that abstracts away the complexity of managing petabyte-scale data while providing low-latency access for agent workflows. At its core, the system leverages a sharded key-value store with configurable replication factors, allowing agents to read and write data with sub-10ms latency for objects up to 1 GB, and linear throughput scaling beyond that for larger blobs. The storage layer is integrated directly into Celesto's sandbox runtime via a virtual filesystem interface, meaning agents can interact with persistent data using standard file I/O operations—open, read, write, seek—without any code changes. This is a deliberate design choice: rather than forcing developers to adopt a new API, Celesto makes persistent memory feel like a local drive, reducing the cognitive overhead for agent developers.

The real innovation lies in the context-aware caching layer. Celesto employs a multi-tier cache that keeps frequently accessed data in RAM (hot tier), recently accessed data on NVMe SSDs (warm tier), and cold data on HDD-backed object storage. The cache is not static; it learns from agent access patterns using a lightweight predictive model that pre-fetches data likely to be needed in upcoming steps. Early benchmarks show this reduces average read latency by 73% compared to a naive LRU cache for coding agent workloads that iterate over large codebases. The storage layer also supports atomic multi-object transactions, enabling agents to perform complex state updates—such as modifying multiple configuration files, updating a database schema, and logging the change—as a single atomic operation. If the agent crashes mid-operation, the entire transaction is rolled back, preventing partial state corruption.

For developers interested in exploring similar approaches, the open-source project InfinityFS (GitHub: infinityfs/infinityfs, 4.2k stars) provides a user-space filesystem that exposes cloud object storage as a POSIX-compliant filesystem, though it lacks Celesto's predictive caching and atomic transaction support. Another relevant repo is MemoriaDB (GitHub: memoriadb/memoriadb, 1.8k stars), which implements a persistent key-value store optimized for AI agent state serialization, but it is designed for single-node deployments and does not scale to PB levels. Celesto's proprietary implementation appears to combine the best of both—distributed scalability with POSIX compatibility—while adding the agent-specific optimizations that general-purpose storage solutions lack.

| Metric | Celesto PB Storage | Standard Object Store (S3-compatible) | Local NVMe RAID |
|---|---|---|---|
| Max capacity | 1 PB per sandbox | Unlimited (theoretical) | 30 TB (typical) |
| Read latency (1 MB object) | 8 ms (hot cache) | 50-200 ms | 0.1 ms |
| Write latency (1 MB object) | 12 ms (hot cache) | 100-300 ms | 0.2 ms |
| Atomic multi-object transactions | Yes | No | Filesystem-dependent |
| Predictive pre-fetching | Yes | No | No |
| Cost per GB/month | $0.023 | $0.021 (standard tier) | $0.10 (hardware + power) |

Data Takeaway: Celesto's storage achieves a compelling balance: it offers cloud-like scalability and cost with latency that approaches local storage for hot data, while adding agent-specific features (predictive caching, atomic transactions) that no general-purpose solution provides. The 73% latency reduction from predictive caching alone could translate into 2-3x faster agent task completion for code-intensive workflows.

Key Players & Case Studies

Celesto is not the only player recognizing the importance of persistent memory for AI agents, but it is the first to make it a first-class feature of a sandbox environment. The competitive landscape can be divided into three camps: sandbox providers, agent frameworks, and storage middleware.

Sandbox Providers: The most direct competitor is Modal, which offers ephemeral cloud environments for AI workloads. Modal supports persistent volumes, but they are limited to 100 GB per volume and require explicit mounting, adding friction for agents that need to auto-discover storage. Replit offers a similar sandbox for coding agents, but its storage is session-scoped and does not persist beyond the agent's runtime. Google Colab provides persistent storage via Google Drive mounts, but the 15 GB free limit and high latency make it unsuitable for PB-scale workloads. Celesto's PB capacity and seamless integration give it a clear advantage for enterprise-grade agent deployments.

Agent Frameworks: Frameworks like LangChain and AutoGPT have attempted to solve the memory problem through external databases—LangChain's memory modules can store conversation history in vector databases like Pinecone or Weaviate, while AutoGPT uses local JSON files. However, these are bolt-on solutions that introduce latency and complexity. Celesto's approach is fundamentally different: memory is baked into the runtime, not added as an afterthought. This eliminates the need for agents to manage their own storage connections, reducing the cognitive load on agent logic.

Storage Middleware: Companies like Weaviate and Pinecone provide vector databases that can serve as long-term memory for agents, but they are optimized for semantic search over embeddings, not for storing arbitrary files, codebases, or large binary objects. Celesto's storage is general-purpose: it can hold anything from a 100 KB configuration file to a 100 GB video dataset, with equal efficiency. This versatility is critical for agents that need to handle diverse data types within a single workflow.

| Feature | Celesto Sandbox | Modal | Replit | LangChain + Pinecone |
|---|---|---|---|---|
| Max persistent storage | 1 PB | 100 GB | Session-only | Based on Pinecone tier (max 5 TB) |
| Storage type | General-purpose object | Volume mount | Ephemeral | Vector embeddings only |
| Atomic transactions | Yes | No | No | No |
| Agent-native integration | Yes (virtual FS) | Manual mount | Manual | API-based |
| Predictive caching | Yes | No | No | No |
| Cost for 100 TB/month | $2,300 | N/A (not supported) | N/A | $10,000+ (Pinecone enterprise) |

Data Takeaway: Celesto's combination of PB capacity, general-purpose storage, and agent-native integration creates a unique value proposition. Competitors either cap capacity at a fraction of Celesto's scale (Modal, Replit) or specialize in a single data type (Pinecone). The cost advantage is also significant: storing 100 TB of agent data on Pinecone would cost over $10,000/month, while Celesto charges $2,300 for the same capacity with broader functionality.

A notable case study comes from Anthropic's Claude Code team, which has been experimenting with Celesto's sandbox for their internal coding agent testing. According to a lead engineer, the persistent storage allowed them to run a single agent across 50 consecutive code review sessions, maintaining a complete history of changes, test results, and developer feedback. Previously, each session started from scratch, requiring the agent to re-discover the codebase structure and re-run tests. The result was a 40% reduction in total review time and a 60% decrease in false-positive bug reports, because the agent could learn from past mistakes.

Industry Impact & Market Dynamics

Celesto's PB-scale persistent storage arrives at a pivotal moment for the AI industry. The 'model arms race' that dominated 2023-2025 is cooling: GPT-4o, Claude 3.5, and Gemini 2.0 all achieve similar performance on standard benchmarks, and incremental improvements are yielding diminishing returns. The market is shifting from 'which model is smarter' to 'which agent can reliably complete a task from start to finish.' This transition has been widely discussed, but the infrastructure to support it has lagged behind. Celesto's move directly addresses the most critical missing piece: long-term memory.

The implications for the agent ecosystem are profound. Consider the following market data:

| Metric | 2024 (Pre-PB Memory) | 2026 (Projected with PB Memory) | Change |
|---|---|---|---|
| Average agent task completion rate | 62% | 85% | +23 pp |
| Agent deployment in enterprise | 15% of companies | 45% of companies | +30 pp |
| Average agent session duration | 12 minutes | 2.5 hours | +12.5x |
| Agent market size (USD) | $4.2B | $18.7B | +345% |

Data Takeaway: Persistent memory is projected to nearly triple the agent market size by 2026, driven by higher task completion rates and longer, more productive sessions. The 23 percentage point increase in completion rate is the key lever: when agents can remember past interactions, they stop repeating mistakes and start building on previous work.

Celesto's business model is also noteworthy. Instead of charging per token or per compute hour, the company charges primarily for storage—$0.023 per GB per month—with compute bundled at a flat rate. This aligns incentives: Celesto profits when agents use more storage, which happens when agents are productive and retain data. It is a virtuous cycle that encourages long-running, memory-intensive workflows. This contrasts with the dominant cloud AI providers (AWS SageMaker, Google Vertex AI, Azure Machine Learning), which charge for compute and storage separately, often with complex pricing that discourages persistent storage usage.

The shift toward persistent memory also has implications for the open-source agent ecosystem. Projects like CrewAI and AutoGen are popular for building multi-agent systems, but they lack built-in persistent storage. Developers using these frameworks must cobble together their own storage solutions, typically using Redis or PostgreSQL, which introduces latency and complexity. Celesto's sandbox could become the preferred deployment target for these frameworks, offering a turnkey solution for persistent memory. If Celesto releases an open-source version of its storage layer, it could disrupt the entire agent infrastructure stack.

Risks, Limitations & Open Questions

Despite the promise, Celesto's PB-scale persistent storage is not without risks. The most immediate concern is data sovereignty and compliance. Storing 1 PB of agent data—which may include proprietary code, customer information, or sensitive logs—on a third-party platform raises questions about data residency, encryption, and access controls. Celesto offers encryption at rest and in transit, but the shared tenancy model means that a vulnerability in the hypervisor could expose data across sandboxes. Enterprises with strict compliance requirements (HIPAA, GDPR, SOC 2) may need dedicated instances, which Celesto has not yet announced.

Another limitation is cold data retrieval latency. While the predictive caching works well for frequently accessed data, agents that need to access rarely used files (e.g., archived logs from months ago) will face latencies of 200-500 ms per object. For workflows that sequentially access thousands of cold objects, this could add minutes of overhead. Celesto's documentation acknowledges this and recommends that developers design agents to batch cold reads, but this requires additional engineering effort.

There is also the question of data lifecycle management. Without automatic garbage collection, agents could accumulate terabytes of stale data—failed experiment outputs, outdated code versions, redundant logs—that waste storage and degrade performance. Celesto provides manual tools for data pruning, but does not yet offer automated policies (e.g., 'delete objects older than 90 days'). This places the burden on developers to manage storage hygiene, which could become a significant operational cost for large deployments.

Finally, there is a vendor lock-in risk. Once an agent's entire memory is stored in Celesto's proprietary format, migrating to another platform becomes non-trivial. The storage layer uses a custom serialization format for agent state that is not compatible with standard object stores. While Celesto provides export tools for raw files, the agent's internal state (conversation history, learned preferences, cached embeddings) is not easily portable. Developers should weigh this against the productivity gains.

AINews Verdict & Predictions

Celesto's PB-scale persistent storage is a watershed moment for AI agent infrastructure. It solves the single most important problem holding back production-grade autonomous agents: the inability to remember and learn across sessions. The technical execution is impressive—predictive caching, atomic transactions, and POSIX compatibility in a distributed system—and the timing is perfect, as the industry pivots from model performance to deployment reliability.

Our predictions:

1. Within 12 months, every major sandbox provider will offer PB-scale persistent storage as a standard feature. Modal, Replit, and Google Colab will be forced to respond, either through partnerships (e.g., integrating with Celesto's storage layer) or by building their own solutions. The era of session-scoped agent memory is ending.

2. Agent task completion rates will cross 80% for the first time in 2026, driven primarily by persistent memory. This will unlock enterprise adoption in regulated industries (finance, healthcare, legal) where audit trails and long-running workflows are mandatory.

3. Celesto will face an acquisition offer within 18 months. The company's technology is a perfect complement for a cloud provider (AWS, Google, Microsoft) or a major AI company (OpenAI, Anthropic) that wants to offer an end-to-end agent platform. A $2-3 billion valuation is plausible given the strategic importance of persistent memory.

4. The open-source community will build a Celesto-compatible storage layer within 6 months. Expect a project like PersistentAgentFS to emerge on GitHub, offering a self-hosted alternative for developers who want persistent memory without vendor lock-in. This will accelerate adoption but also fragment the ecosystem.

5. The next frontier will be collaborative memory across agents. Once individual agents have persistent memory, the next step is shared memory pools where multiple agents can read and write to the same data store, enabling team-based workflows. Celesto has hinted at this capability in its roadmap, and it could be the defining feature of the next generation of agent infrastructure.

In summary, Celesto has turned the AI agent from a goldfish into an elephant. The industry will never look back.

More from Hacker News

常见问题

这次公司发布“Celesto's PB-Scale Sandbox Memory: AI Agents Finally Get Permanent Recall”主要讲了什么？

The AI industry has long treated agent memory as a disposable resource—temporary, session-bound, and prone to catastrophic forgetting the moment a context window overflows. Celesto…

从“Celesto PB persistent storage pricing per GB”看，这家公司的这次发布为什么值得关注？

Celesto's persistent storage layer is built on a distributed object store architecture that abstracts away the complexity of managing petabyte-scale data while providing low-latency access for agent workflows. At its cor…

围绕“Celesto vs Modal for AI agent sandbox”，这次发布可能带来哪些后续影响？

后续通常要继续观察用户增长、产品渗透率、生态合作、竞品应对以及资本市场和开发者社区的反馈。