Tend的注意力協議：人機協作的新基礎設施

2026年4月10日下午12:26 AINews HN AI/ML

隨著AI代理的普及，它們可能成為新的數位干擾源，反而破壞了其承諾的協作。Tend正在構建一個新穎的基礎設施層——注意力協議，旨在協調人與機器之間的專注力。這代表著從單純的指令執行到協調式關注的根本性轉變。

The article body is currently shown in English by default. You can generate the full version in this language on demand.

The vision of seamless human-AI collaboration is colliding with a harsh reality: cognitive overload. Each well-intentioned AI assistant—from scheduling bots to research agents—generates its own stream of interruptions, creating a cacophony that fragments user attention. Tend, emerging from stealth, proposes a radical solution: not another notification manager, but a foundational protocol layer that treats human attention as a first-class, programmable resource. The core concept is an 'attention plane'—a lightweight, open specification that allows applications, operating systems, and AI agents to declare their intent to engage, understand a user's current cognitive context (e.g., 'deep work', 'meeting', 'available'), and negotiate the timing and modality of interactions. This moves beyond simple 'Do Not Disturb' modes into a dynamic, context-aware system where a user's calendar, task priorities, and real-time focus state are synthesized into a machine-readable signal that AI can respect. The significance lies in its infrastructural ambition. Tend is not building an end-user app, but the plumbing for a new class of 'attention-aware' software. If successful, it could become as critical to human-AI interaction as TCP/IP is to networking, enabling scalable cooperation by preventing the chaos of uncoordinated agentic interruptions. The company's early technical papers outline a decentralized architecture where user devices host a local 'Attention Engine' that brokers requests, preserving privacy while enabling coordination. The ultimate bet is that for AI to become truly helpful, it must learn not just what to do, but when and how to interrupt—a problem that requires systemic, not application-specific, solutions.

Technical Deep Dive

Tend's proposed architecture rests on three core pillars: the Attention State Model, the Negotiation Protocol, and the Local Orchestration Engine. Unlike centralized notification services (Apple's Focus, Google's Digital Wellbeing), Tend envisions a decentralized protocol where the user's device acts as the sovereign broker of their attention.

The Attention State Model is a machine-readable representation of a user's cognitive context. It goes beyond binary 'busy/available' to include dimensions like:
- Cognitive Load: Estimated mental bandwidth, potentially inferred from application usage patterns, calendar density, or even biometric data (with explicit consent).
- Task Context: The active project or goal, often pulled from active documents, communication threads, or task manager APIs.
- Interruptibility Score: A dynamic value determining the cost of an interruption, influenced by the above and historical user feedback.

This state is computed locally by the Attention Engine, a lightweight daemon. The engine ingests signals from the OS, applications, and connected devices to maintain a real-time model. A key technical challenge is creating a unified, cross-application context model. Tend's whitepaper suggests leveraging natural language processing on window titles, document content, and meeting transcripts (processed locally) to generate embeddings that represent the current 'topic' of focus.

The Negotiation Protocol is the communication layer. When an AI agent or application needs user attention, it sends a structured Attention Request to the local engine. This request includes metadata: Intent (e.g., 'inform', 'query', 'require-action'), Estimated Engagement Time (how long the interaction will take), Priority (declared by the requesting entity), and Optimal Modality (e.g., banner notification, audio cue, full-screen takeover). The local engine evaluates this request against the current Attention State, applies user-configured rules, and returns a Response: 'granted', 'denied', 'scheduled-for-later', or 'request-more-info'. For complex scenarios, a brief back-and-forth negotiation can occur.

A relevant open-source project exploring similar territory is `calm-notifications` (GitHub: calm-tech/calm-notifications), a framework for building less intrusive software. While not a protocol, it embodies the philosophy of 'calm technology' that informs Tend's approach. Another is `activitywatch` (GitHub: ActivityWatch/activitywatch), an open-source time-tracker that automatically logs computer usage. Tend's context modeling could integrate such tools to infer cognitive state from application switching patterns.

| Protocol Layer | Function | Key Innovation |
|---|---|---|
| State Model | Encodes user's cognitive context & interruptibility | Multi-dimensional, real-time, locally computed |
| Negotiation Protocol | Standardized API for attention requests/responses | Enables machine-to-machine negotiation over timing & modality |
| Orchestration Engine | Local broker that applies rules & policies | User sovereignty; privacy-preserving; integrates OS signals |

Data Takeaway: The technical blueprint reveals a shift from OS-level, rule-based filtering to a dynamic, semantically-aware brokerage system. The local-first architecture is a critical design choice for privacy and responsiveness, but it places a significant burden on the engine's ability to accurately model complex human states from noisy data.

Key Players & Case Studies

The problem space of attention management is crowded, but Tend's infrastructural approach is distinct. Key players fall into several categories:

1. OS-Level Focus Modes: Apple's Focus, Google's Digital Wellbeing, and Microsoft's Focus Sessions are the incumbent solutions. They are rule-based (time/location/application) and user-configured, but lack dynamic context awareness and cannot negotiate with AI agents. They treat all interruptions equally within a blocked category.
2. Enterprise Workflow Orchestrators: Tools like Zapier, Make, and n8n automate workflows between apps but operate at the data layer, not the attention layer. They generate tasks that ultimately become notifications in other apps, exacerbating the problem Tend aims to solve.
3. AI Agent Platforms: Cognition Labs (Devon), OpenAI (GPTs with actions), and xAI (Grok) are building increasingly autonomous agents. Currently, these agents interact with users through chat interfaces or generate notifications via connected apps. They have no inherent understanding of a user's broader attention state. A partnership or integration with a protocol like Tend would be a logical evolution.
4. Research & Academia: Researchers like Gloria Mark (UC Irvine), author of *Attention Span*, have empirically documented the high cost of task switching. The MIT Media Lab's 'Rainbow' project explored emotion-aware notifications. This body of work provides the scientific foundation for systems like Tend, but prior attempts have largely remained academic.

A revealing case study is the evolution of Slack. It began as a 'notification hub' but has struggled with channel overload. Its 'Do Not Disturb' and scheduled notification features are crude tools. The company's investment in Slack AI for channel summarization is an attempt to reduce the *volume* of required attention, not to manage its *timing*. This highlights the gap Tend aims to fill.

| Solution Type | Example | Approach to Attention | Limitation vs. Tend's Vision |
|---|---|---|---|
| OS/App Filters | Apple Focus | Static, user-set rules | No context awareness, no agent negotiation |
| AI Summarization | Slack AI, Gmail Priority | Reduce information volume | Doesn't solve timing of interruptions |
| Agent Platforms | OpenAI GPTs | Generate actions/notifications | Operate in isolation, unaware of other agents or user state |
| Protocol Layer | Tend | Dynamic, negotiated brokerage | Requires ecosystem buy-in; unproven at scale |

Data Takeaway: The competitive landscape is fragmented, with incumbents solving slices of the problem (volume or simple filtering). Tend's unique position is its horizontal, protocol-level ambition, but this also makes it dependent on the very platforms (OS, agent frameworks) it seeks to augment.

Industry Impact & Market Dynamics

If Tend gains traction, its impact would ripple across multiple layers of the tech stack.

For Operating Systems, it could become a critical differentiator. Imagine 'Windows 13 with Tend Protocol' marketed as the first OS truly designed for the AI era, where Copilot intelligently schedules its suggestions. Apple could integrate it deeply into Continuity, allowing a seamless attention state to flow between iPhone, Mac, and Vision Pro. The OS that best manages the cognitive toll of AI will have a significant user experience advantage.

For Enterprise Software, the value proposition is direct productivity gains. Studies by Gloria Mark and others suggest it takes an average of 23 minutes to fully refocus after a significant interruption. An enterprise version of Tend that integrates with Microsoft Teams, Salesforce, and SAP could coordinate notifications from all business systems, potentially reclaiming hours of focused work per employee per week. The market for enterprise productivity software is projected to exceed $102 billion by 2026 (Gartner), and a solution proving double-digit percentage gains in focused work time would command a premium.

For AI Developers, the protocol lowers a major adoption barrier: user annoyance. By providing a standard way for agents to be 'polite' and context-aware, it reduces the risk of agent rejection. This could accelerate the deployment of background AI agents in personal and professional life. We predict the emergence of 'Tend-compatible' as a selling point for AI tools.

The funding landscape reflects growing interest in 'AI-human interface' infrastructure. While Tend's funding is not fully public, it fits a trend. Related companies in human-AI interaction have seen significant venture capital:

| Company | Core Focus | Recent Funding | Investor Interest Signal |
|---|---|---|---|
| Tend | Attention Protocol | Undisclosed Seed (est. $3-5M) | Founders from OS/UX backgrounds |
| Rewind AI | Personal AI memory | $10M Series A (2023) | Demand for context-aware AI |
| Aomni | AI sales agent | $6.8M Seed (2023) | Autonomous agent space heating up |
| Cognition Labs | AI software engineer | $175M+ (Series B, 2024) | Massive bets on agent capability |

Data Takeaway: Investor capital is flowing aggressively into AI agents and the infrastructure to support them. Tend operates at the crucial but less glamorous intersection of capability and usability. Its success depends on proving that managing attention friction is not a feature, but a foundational service that unlocks the next wave of agent adoption.

Risks, Limitations & Open Questions

The vision is compelling, but the path is fraught with challenges.

1. The Cold Start & Ecosystem Problem: A protocol is worthless without widespread adoption. Tend must convince major OS vendors, AI platform providers, and popular applications to implement its spec. This is a classic chicken-and-egg dilemma. Their likely strategy is to first release an SDK and a compelling reference implementation (e.g., a Tend-powered smart calendar that brilliantly manages meetings and agent interactions) to demonstrate value and attract developers.

2. The Context Modeling Accuracy Problem: Can an algorithm truly infer a human's cognitive state reliably? Misclassification could be disastrous—blocking an urgent notification or allowing an interruption during a crucial moment. Early versions will likely be conservative and heavily reliant on explicit user calibration, limiting initial 'wow' factor.

3. Privacy and Security Paradox: The local-first model protects privacy but limits power. The most accurate attention state might require cloud-based analysis of emails, messages, and documents—a non-starter for many. Furthermore, the Attention Engine itself becomes a high-value target for malware seeking to understand user behavior or spoof attention states.

4. User Behavior and Over-Automation: Will users trust an algorithm to manage their most precious resource? There's a risk of creating a 'black box' for focus, where users feel controlled rather than assisted. Additionally, over-reliance on the system could atrophy personal attention management skills.

5. Economic and Attention Inequality: A premium, effective attention management system could become another tool that enhances the productivity of knowledge workers, widening the performance gap. It risks becoming a luxury good that further divides those who can afford focus and those bombarded by uncoordinated digital demands.

Open Questions: Will this remain a niche tool for AI power users, or can it achieve mainstream integration? Can it handle the multi-modal future of AR/VR, where interruptions are even more jarring? Who owns the rules and the model—the user, the OS vendor, or Tend?

AINews Verdict & Predictions

Tend's proposal is one of the most architecturally significant responses to the looming crisis of AI-driven distraction. It correctly identifies that the solution to managing multiple intelligences is not human willpower, but better machine-to-machine coordination. The protocol approach is bold and correct in its ambition.

Our Predictions:

1. Partial Adoption Path: Tend will not replace OS focus modes within 5 years. Instead, we predict it will succeed first as an enterprise middleware solution. A company will license Tend's technology to build 'Cognitive Workflow Hub' for large corporations, integrating with their existing SaaS stack. This provides a clear ROI (reduced context-switching costs) and a controlled environment for refinement.
2. Open-Source Fork: Within 18 months, if commercial adoption is slow, core components of the protocol will be open-sourced by the community or by Tend itself to spur developer innovation. A GitHub repo named `open-attention-protocol` will emerge, gaining rapid traction among indie developers and researchers.
3. Acquisition Target: Apple or Google will acquire a company in this space within 3 years. Their OS teams are acutely aware of the notification overload problem, and as they bake more AI into their systems, the need for a sophisticated attention layer becomes existential. Tend, or a competitor that emerges, is a prime acquisition target for its talent and IP.
4. The New Benchmark: Within 2 years, 'Tend Compliance' or a similar metric will become a talking point for new AI agent products, much like 'Energy Star' certification for appliances. It will signal that an agent is designed for harmonious collaboration.

Final Verdict: The 'attention protocol' concept is inevitable. The explosion of AI agents will force its creation. Tend is an early and thoughtful contender, but the winner may not bear its name. The company's ultimate impact may be in crystallizing the problem and proposing a viable architecture, thereby setting the agenda for the next generation of human-computer interaction. The race to build the attention infrastructure for the 'human-machine dance' has begun, and it will be one of the defining software challenges of this decade. Ignoring it will leave future AI ecosystems useful in theory but unbearable in practice.

常见问题

这次公司发布“Tend's Attention Protocol: The New Infrastructure for Human-AI Collaboration”主要讲了什么？

The vision of seamless human-AI collaboration is colliding with a harsh reality: cognitive overload. Each well-intentioned AI assistant—from scheduling bots to research agents—gene…

从“Tend attention protocol vs Apple Focus modes”看，这家公司的这次发布为什么值得关注？

Tend's proposed architecture rests on three core pillars: the Attention State Model, the Negotiation Protocol, and the Local Orchestration Engine. Unlike centralized notification services (Apple's Focus, Google's Digital…

围绕“how does Tend attention engine work technically”，这次发布可能带来哪些后续影响？

后续通常要继续观察用户增长、产品渗透率、生态合作、竞品应对以及资本市场和开发者社区的反馈。