Main Agent Supervisor

This skill is for a supervisor layer over a main agent, not a generic task tracker.

Goal

Prevent the main agent from getting stuck on obvious decisions while still preserving real human control for risky or ambiguous actions.

Core design

Use a four-part model:

1. Classifier

- Decide whether a pending ask/action is: - AUTO - CONFIRM - INLINECODE2

2. Pre-send gate

- Before the main agent sends a user-visible reply, ask: - Is this asking for an obvious decision? - Is there a safe default? - Is the agent permission-looping? - If yes, suppress the question and continue execution.

3. Triage / watchdog

- Borrowing from claude-code-supervisor, classify agent state into: - FINE - NEEDS_NUDGE - STUCK - DONE - ESCALATE - Use a lightweight pre-filter for obvious cases before invoking heavier review.

4. Task-state tracking for large tasks

- Borrowing from task-supervisor, keep simple checkpoint files for long tasks. - Track: - started time - status - completed steps - last updated - current blocker / next step

Use this policy

AUTO

Proceed without bothering the user when all are true:

- internal / local action
reversible or low-risk
no external send/publish
no payment / secret / production change
user intent is already clear
there is one reasonable default

CONFIRM

Ask the user when any are true:

- external send/publish
destructive / irreversible action
money / orders / account changes
production/live-system changes
privacy / compliance / legal sensitivity

ESCALATE

Ask only when blocked after reasonable retries or when multiple materially different paths exist.

Reply-shaping rules

When the main agent drafts a question, rewrite it if:

- it is merely asking permission for an AUTO action
it asks for a trivial preference that has a safe default
it proposes extra scope that is obviously worth trying and reversible

Preferred rewrite:

- state the chosen default
continue execution
mention assumptions briefly if needed

For larger tasks, pair this with a task-state file instead of ad-hoc check-in messages. That preserves progress visibility without interrupting the user for obvious decisions.

Best current pattern

For this workspace, the best practical setup is:

- escalation classifier as the core policy
pre-send gate as enforcement
triage/watchdog for stuck detection
task-state files for large tasks
passive reviewer/audit log for tuning

References

Read these when needed:

- references/design.md — recommended architecture and message flow
INLINECODE11 — what existing public skills cover vs what they miss
INLINECODE12 — workspace-specific OpenClaw implementation plan

主智能体主管

此技能适用于主智能体之上的监督层，而非通用任务追踪器。

目标

防止主智能体在明显决策上陷入僵局，同时保留对高风险或模糊操作的真实人工控制。

核心设计

采用四部分模型：

1. 分类器

- 判断待处理的请求/操作属于： - 自动执行 - 需确认 - 需升级

2. 发送前门控

- 在主智能体发送用户可见回复前，询问： - 是否在请求明显决策？ - 是否存在安全默认值？ - 智能体是否陷入权限循环？ - 若答案为是，则抑制该问题并继续执行。

3. 分诊/看门狗

- 借鉴claude-code-supervisor，将智能体状态分类为： - 正常 - 需提示 - 卡住 - 完成 - 需升级 - 在调用更严格的审查前，对明显情况使用轻量级预过滤器。

4. 大型任务的状态追踪

- 借鉴task-supervisor，为长任务维护简单的检查点文件。 - 追踪： - 开始时间 - 状态 - 已完成步骤 - 最后更新时间 - 当前阻塞项/下一步操作

使用此策略

自动执行

当满足所有条件时，无需打扰用户直接执行：

- 内部/本地操作
可逆或低风险
无外部发送/发布
无支付/机密/生产变更
用户意图已明确
存在一个合理的默认选项

需确认

当满足任一条件时，询问用户：

- 外部发送/发布
破坏性/不可逆操作
涉及金钱/订单/账户变更
生产/线上系统变更
隐私/合规/法律敏感性

需升级

仅在合理重试后仍被阻塞，或存在多个实质性不同路径时，才进行升级。

回复塑造规则

当主智能体起草问题时，在以下情况下重写：

- 仅为自动执行操作请求许可
询问存在安全默认值的琐碎偏好
提出明显值得尝试且可逆的额外范围

推荐重写方式：

- 说明所选默认值
继续执行
必要时简要提及假设

对于大型任务，使用任务状态文件而非临时检查消息。这样既能保持进度可见性，又不会因明显决策而中断用户。

最佳当前模式

对于此工作空间，最佳实践配置为：

- 升级分类器作为核心策略
发送前门控作为执行机制
分诊/看门狗用于卡住检测
任务状态文件用于大型任务
被动审查/审计日志用于调优

参考资料

需要时阅读以下内容：

- references/design.md — 推荐架构和消息流程
references/comparison.md — 现有公开技能的覆盖范围与缺失内容
references/implementation.md — 工作空间特定的OpenClaw实施计划

main-agent-supervisor主代理监督