Protokol Polis: Bagaimana Markdown Mencipta Semula Pasukan AI Agent sebagai Dokumen Hidup

Hacker News May 2026
Source: Hacker NewsArchive: May 2026
Polis ialah protokol sumber terbuka yang menggunakan Markdown untuk mentakrif dan mengatur pasukan AI agent, membolehkan mereka berkembang sendiri melalui dokumentasi terkawal versi. Peralihan paradigma daripada orkestrasi kotak hitam kepada dokumen sebagai seni bina ini menjanjikan pendemokrasian reka bentuk sistem multi-agen.
The article body is currently shown in English by default. You can generate the full version in this language on demand.

AINews has uncovered Polis, a groundbreaking open-source protocol that reimagines AI agent teams as living, version-controlled Markdown documents. Instead of relying on opaque orchestration frameworks or proprietary platforms, Polis allows developers and even non-technical product managers to define agent roles, communication rules, and learning mechanisms in human-readable Markdown. Each interaction an agent team has is written back into the document, creating a self-evolving system that can be audited, forked, merged, and rolled back using standard Git workflows.

Polis’s core innovation lies in merging the flexibility of large language models with the collaborative philosophy of Git. This makes multi-agent systems transparent, auditable, and iterable—turning them from static configurations into 'living documents.' The protocol is particularly suited for building customer service swarms, research assistant teams, and code review pipelines without vendor lock-in. By lowering the barrier to entry, Polis could catalyze an ecosystem of document templates, version hosting services, and agent evaluation tools. It signals a future where AI teams are forked and merged like open-source code, making agent evolution transparent and participatory.

Technical Deep Dive

Polis’s architecture is deceptively simple yet profoundly powerful. At its core, the protocol defines a standard Markdown schema that describes an entire multi-agent system. This schema includes:

- Agent Definitions: Each agent has a role, a system prompt, a list of tools it can access, and a memory format. For example, a 'Research Agent' might have a role of 'gathers and summarizes information,' a system prompt that instructs it to cite sources, and tools like 'web_search' and 'document_reader.'
- Communication Rules: A section defines how agents interact—whether via direct messages, broadcast, or a shared blackboard. It can specify turn-taking, priority, and escalation protocols.
- Learning Mechanisms: A critical innovation is the 'Experience Log' section. After each task, agents write their observations, successful strategies, and failures back into the Markdown document. This creates a feedback loop where the team’s behavior improves over time without manual reconfiguration.
- Version Control Integration: The entire document is meant to be stored in a Git repository. Every change—whether from a human editor or an agent’s learning—creates a commit. This enables branching for experimentation, rollback for failures, and merging for collaborative improvement.

From an engineering perspective, Polis leverages LLMs as the runtime executor. A lightweight orchestrator reads the Markdown file, parses the agent definitions, and spawns LLM instances (via APIs or local models) that follow the defined roles. The orchestrator also monitors the Experience Log and periodically triggers a 'consolidation' step where agents summarize learnings into the document.

A relevant open-source project is AgentMark (GitHub: agentmark/agentmark, ~2.3k stars), which provides a reference implementation of the Polis protocol. It includes a CLI tool for creating and running agent teams, a VS Code extension for syntax highlighting, and a built-in Git integration. The repository shows active development with weekly commits and a growing community of contributors.

Performance Benchmarks: Early tests from the AgentMark team compare Polis-based teams against traditional orchestration frameworks like LangChain and AutoGen.

| Metric | Polis (Markdown) | LangChain (Python) | AutoGen (JSON) |
|---|---|---|---|
| Setup Time (minutes) | 5 | 20 | 15 |
| Task Success Rate (standard benchmark) | 87% | 85% | 83% |
| Auditability (time to trace a decision) | <1 min | 10 min | 5 min |
| Human Edit Time (role change) | 2 min | 45 min | 30 min |
| Version Rollback Complexity | 1 Git command | Manual code revert | Manual config revert |

Data Takeaway: Polis dramatically reduces setup and editing time while maintaining competitive task success rates. Its auditability advantage is a game-changer for regulated industries.

Key Players & Case Studies

Polis is not a product of a single company but a community-driven protocol. However, several key players are shaping its ecosystem:

- AgentMark Team: The primary maintainers, led by Dr. Elena Vasquez (formerly of Google Brain) and a distributed team of open-source contributors. They focus on reference implementations and standards.
- Hugging Face: Has integrated Polis into its Spaces platform, allowing users to deploy agent teams as interactive demos. This has significantly boosted visibility.
- GitHub: The protocol’s natural home. GitHub’s Copilot is being experimented with to auto-generate Polis Markdown files from natural language descriptions.
- Early Adopters: A mid-sized e-commerce company, ShopFlow, uses Polis to manage a customer service team of 5 agents (triage, FAQ, returns, escalation, feedback). They report a 40% reduction in ticket resolution time and a 30% increase in customer satisfaction after 3 months of self-evolution.

Comparison with Competing Approaches:

| Approach | Key Example | Complexity | Transparency | Evolution Mechanism |
|---|---|---|---|---|
| Polis (Markdown) | AgentMark | Low | High | Self-writing documents |
| Python Orchestration | LangChain, CrewAI | Medium | Medium | Code changes |
| Visual Flow Builders | Microsoft Copilot Studio | Low | Low | Manual node editing |
| Proprietary Platforms | Salesforce Einstein | High | Low | Vendor-controlled |

Data Takeaway: Polis occupies a unique niche—low complexity with high transparency, which is rare in the multi-agent space.

Industry Impact & Market Dynamics

Polis has the potential to disrupt the multi-agent system market, currently dominated by proprietary platforms and complex frameworks. The global AI agent market is projected to grow from $4.2 billion in 2024 to $28.5 billion by 2028 (CAGR 46%). Polis could capture a significant share by lowering the barrier to entry.

Business Models: While Polis itself is open-source, the ecosystem around it offers monetization opportunities:
- Template Marketplaces: Companies like TemplateHub.ai already sell premium Polis Markdown templates for specific industries (healthcare triage, legal research, financial analysis).
- Version Hosting & Collaboration: Startups like AgentCloud offer managed Git repositories with CI/CD pipelines for agent teams, charging per-seat fees.
- Evaluation & Monitoring Services: Tools like AgentMetrics provide dashboards for tracking agent team performance, with subscription tiers.

Adoption Curve: Early adopters are startups and mid-market companies. Enterprise adoption is slower due to security concerns (self-writing documents could introduce vulnerabilities) and the need for governance. However, the protocol’s auditability is a strong selling point for regulated sectors like finance and healthcare.

| Year | Estimated Polis-based Teams | Key Adoption Drivers |
|---|---|---|
| 2025 (current) | 5,000 | Early adopters, open-source community |
| 2026 | 25,000 | Template marketplaces, Hugging Face integration |
| 2027 | 100,000 | Enterprise pilots, regulatory acceptance |

Data Takeaway: Polis adoption is on an exponential trajectory, driven by its simplicity and the growing ecosystem of supporting services.

Risks, Limitations & Open Questions

Despite its promise, Polis faces several challenges:

1. Security & Prompt Injection: Since agents write back to the document, a malicious prompt could corrupt the entire team’s behavior. The protocol currently lacks robust input sanitization. The AgentMark team is working on a sandboxed execution mode, but it’s not yet production-ready.
2. Scalability: The current reference implementation struggles with teams larger than 10 agents due to the overhead of parsing and updating a single Markdown file. Sharding the document across multiple files is a proposed solution but adds complexity.
3. LLM Dependency: The protocol’s effectiveness is tied to the underlying LLM’s capabilities. If the LLM fails to follow instructions or hallucinates, the agent team degrades. Polis does not yet have built-in fallback mechanisms.
4. Governance & Compliance: In regulated industries, the concept of an AI team that 'self-evolves' raises questions about accountability. Who is responsible when an agent makes a harmful decision after learning from its own experience? The protocol needs a 'governance layer' that logs and approves changes before they take effect.
5. Vendor Lock-in (Ironically): While Polis avoids proprietary platforms, it creates a dependency on specific LLM APIs (OpenAI, Anthropic, etc.). A future version might support local models, but performance will suffer.

AINews Verdict & Predictions

Polis is not just a tool; it’s a paradigm shift. By treating AI agent teams as living documents, it makes multi-agent systems transparent, auditable, and accessible to non-programmers. This is the first step toward a future where AI teams are forked, merged, and reviewed like open-source code.

Our Predictions:

1. Within 12 months, Polis will become the default standard for prototyping multi-agent systems, surpassing LangChain in new project adoption due to its simplicity.
2. Within 24 months, we will see the first 'Agent Team Repository' (like GitHub for agent teams) where users can browse, fork, and contribute to community-maintained agent teams.
3. Regulatory bodies (e.g., FDA, SEC) will begin mandating audit trails for AI decision-making in critical sectors. Polis’s built-in version control will make it the compliance-friendly choice.
4. The biggest risk is that a major security incident (e.g., a prompt injection that corrupts thousands of agent teams) could set back adoption by years. The community must prioritize sandboxing and input validation.

What to Watch: The next release of AgentMark (v0.5, expected Q3 2025) promises multi-file support and a governance layer. If these are executed well, Polis will move from a promising experiment to a production-ready standard.

Polis is a signal that the future of AI is not about bigger models but about better systems—systems that are transparent, collaborative, and evolvable. We are watching closely.

More from Hacker News

Inferens GPT-2 C# Tanpa Peruntukan Memori Mencabar Dominasi C++ dalam AIThe Overfit project, created by a solo developer, implements a full GPT-2 inference engine in pure C# with a critical deAI Boleh Menulis Kod Tetapi Tidak Boleh Menyelenggarakannya: Krisis Ingatan dalam Kejuruteraan PerisianThe AI coding revolution has hit a wall: maintenance. Tools like GitHub Copilot, Cursor, and Amazon CodeWhisperer generaTentera AI $1.3J OpenClaw: Pengakhiran Kejuruteraan Perisian Manusia?In a move that redefines the boundaries of software development, OpenClaw founder Peter Steinberger has deployed 100 autOpen source hub3542 indexed articles from Hacker News

Archive

May 20261840 published articles

Further Reading

13 AI Agents Take Over M&A Due Diligence: The Unmanned Moment for Legal Industry ArrivesA new open-source framework uses 13 dedicated AI agents to break down M&A contract review into legal, financial, and opePalace-AI: Teknik Istana Memori Purba Mencipta Semula Seni Bina Memori Ejen AIPalace-AI, projek sumber terbuka baharu, membayangkan semula memori ejen AI dengan meminjam teknik 'istana memori' purbaKejayaan Ahli Matematik AI: Dari Kalkulator kepada Rakan Penyelidikan AutonomiSatu pracetak baharu memperkenalkan 'rakan matematik AI' yang secara autonomi menjana konjektur, meneroka laluan bukti, Dari Pemberhentian ke Pelancaran: AaaS Membolehkan Sesiapa Sahaja Menyebarkan Ejen AI dengan Bahasa Semula JadiSeorang pembangun, yang berdepan dengan 'pemberhentian separa' akibat konflik geopolitik, membina AaaS dalam empat mingg

常见问题

GitHub 热点“Polis Protocol: How Markdown Is Reinventing AI Agent Teams as Living Documents”主要讲了什么?

AINews has uncovered Polis, a groundbreaking open-source protocol that reimagines AI agent teams as living, version-controlled Markdown documents. Instead of relying on opaque orch…

这个 GitHub 项目在“how to create Polis agent team Markdown file”上为什么会引发关注?

Polis’s architecture is deceptively simple yet profoundly powerful. At its core, the protocol defines a standard Markdown schema that describes an entire multi-agent system. This schema includes: Agent Definitions: Each…

从“Polis protocol vs LangChain comparison”看,这个 GitHub 项目的热度表现如何?

当前相关 GitHub 项目总星标约为 0,近一日增长约为 0,这说明它在开源社区具有较强讨论度和扩散能力。