Main Agent Supervisor
This skill is for a supervisor layer over a main agent, not a generic task tracker.
Goal
Prevent the main agent from getting stuck on obvious decisions while still preserving real human control for risky or ambiguous actions.
Core design
Use a four-part model:
- 1. Classifier
- Decide whether a pending ask/action is:
-
AUTO
-
CONFIRM
- INLINECODE2
- 2. Pre-send gate
- Before the main agent sends a user-visible reply, ask:
- Is this asking for an obvious decision?
- Is there a safe default?
- Is the agent permission-looping?
- If yes, suppress the question and continue execution.
- 3. Triage / watchdog
- Borrowing from
claude-code-supervisor, classify agent state into:
-
FINE
-
NEEDS_NUDGE
-
STUCK
-
DONE
-
ESCALATE
- Use a lightweight pre-filter for obvious cases before invoking heavier review.
- 4. Task-state tracking for large tasks
- Borrowing from
task-supervisor, keep simple checkpoint files for long tasks.
- Track:
- started time
- status
- completed steps
- last updated
- current blocker / next step
Use this policy
AUTO
Proceed without bothering the user when all are true:
- - internal / local action
- reversible or low-risk
- no external send/publish
- no payment / secret / production change
- user intent is already clear
- there is one reasonable default
CONFIRM
Ask the user when any are true:
- - external send/publish
- destructive / irreversible action
- money / orders / account changes
- production/live-system changes
- privacy / compliance / legal sensitivity
ESCALATE
Ask only when blocked after reasonable retries or when multiple materially different paths exist.
Reply-shaping rules
When the main agent drafts a question, rewrite it if:
- - it is merely asking permission for an AUTO action
- it asks for a trivial preference that has a safe default
- it proposes extra scope that is obviously worth trying and reversible
Preferred rewrite:
- - state the chosen default
- continue execution
- mention assumptions briefly if needed
For larger tasks, pair this with a task-state file instead of ad-hoc check-in messages. That preserves progress visibility without interrupting the user for obvious decisions.
Best current pattern
For this workspace, the best practical setup is:
- - escalation classifier as the core policy
- pre-send gate as enforcement
- triage/watchdog for stuck detection
- task-state files for large tasks
- passive reviewer/audit log for tuning
References
Read these when needed:
- -
references/design.md — recommended architecture and message flow - INLINECODE11 — what existing public skills cover vs what they miss
- INLINECODE12 — workspace-specific OpenClaw implementation plan
主智能体主管
此技能适用于主智能体之上的监督层,而非通用任务追踪器。
目标
防止主智能体在明显决策上陷入僵局,同时保留对高风险或模糊操作的真实人工控制。
核心设计
采用四部分模型:
- 1. 分类器
- 判断待处理的请求/操作属于:
- 自动执行
- 需确认
- 需升级
- 2. 发送前门控
- 在主智能体发送用户可见回复前,询问:
- 是否在请求明显决策?
- 是否存在安全默认值?
- 智能体是否陷入权限循环?
- 若答案为是,则抑制该问题并继续执行。
- 3. 分诊/看门狗
- 借鉴claude-code-supervisor,将智能体状态分类为:
- 正常
- 需提示
- 卡住
- 完成
- 需升级
- 在调用更严格的审查前,对明显情况使用轻量级预过滤器。
- 4. 大型任务的状态追踪
- 借鉴task-supervisor,为长任务维护简单的检查点文件。
- 追踪:
- 开始时间
- 状态
- 已完成步骤
- 最后更新时间
- 当前阻塞项/下一步操作
使用此策略
自动执行
当满足所有条件时,无需打扰用户直接执行:
- - 内部/本地操作
- 可逆或低风险
- 无外部发送/发布
- 无支付/机密/生产变更
- 用户意图已明确
- 存在一个合理的默认选项
需确认
当满足任一条件时,询问用户:
- - 外部发送/发布
- 破坏性/不可逆操作
- 涉及金钱/订单/账户变更
- 生产/线上系统变更
- 隐私/合规/法律敏感性
需升级
仅在合理重试后仍被阻塞,或存在多个实质性不同路径时,才进行升级。
回复塑造规则
当主智能体起草问题时,在以下情况下重写:
- - 仅为自动执行操作请求许可
- 询问存在安全默认值的琐碎偏好
- 提出明显值得尝试且可逆的额外范围
推荐重写方式:
对于大型任务,使用任务状态文件而非临时检查消息。这样既能保持进度可见性,又不会因明显决策而中断用户。
最佳当前模式
对于此工作空间,最佳实践配置为:
- - 升级分类器作为核心策略
- 发送前门控作为执行机制
- 分诊/看门狗用于卡住检测
- 任务状态文件用于大型任务
- 被动审查/审计日志用于调优
参考资料
需要时阅读以下内容:
- - references/design.md — 推荐架构和消息流程
- references/comparison.md — 现有公开技能的覆盖范围与缺失内容
- references/implementation.md — 工作空间特定的OpenClaw实施计划