AI OS Blueprint
Framework: The OS Stack Scorecard
Worth $500/hr consultant time. Yours for $39.
What This Skill Does
Takes you from "I use Claude like a chatbot" to "Claude is my operating system." Covers persistent memory, skill architecture, context hooks, and sub-agent routing. Includes a full audit of your current setup with a scored gap analysis and rebuild plan.
Problem it solves: The jump from ChatGPT user to AI-native operator is a mindset shift, not just a tooling upgrade. Most people add one automation at a time and wonder why nothing compounds. This blueprint gives you the architecture that makes everything stack.
The OS Stack Scorecard
A diagnostic framework that audits your current AI setup across 5 layers and produces a prioritized rebuild plan. Score your stack, find the bottleneck, fix it first.
The 5 Layers of an AI OS
CODEBLOCK0
Fix Layer 1 before Layer 2. Fix Layer 2 before Layer 3. Etc.
Skipping layers is why most AI setups plateau.
OS Stack Scorecard: Full Audit
Layer 1 — Foundation Audit
Score each item 0-2:
| Component | 0 (Missing) | 1 (Partial) | 2 (Done) | Your Score |
|---|
| AI client installed (Claude Desktop / OpenClaw) | None | Old version | Current | |
| MCP servers connected |
0 tools | 1-2 tools | 3+ tools | |
| API keys stored securely | In chat history | In .env file | In secrets manager |
_ |
| Config file backed up | Never | Occasionally | Version controlled |
_ |
| Restart/reload procedure known | Unknown | Sometimes works | Documented |
_ |
Layer 1 Score: _ / 10
If Layer 1 < 6: Stop. Fix foundation before anything else.
→ Install MCP Server Setup Kit (sold separately) to fix this layer.
Layer 2 — Context & Memory Audit
Score each item 0-2:
| Component | 0 (Missing) | 1 (Partial) | 2 (Done) | Your Score |
|---|
| Persistent memory file exists | None | Informal notes | Structured MEMORY.md | |
| Agent knows your role/context |
Starts fresh each session | Some in system prompt | Full context file | |
| Daily memory updated regularly | Never | Inconsistently | Every session |
_ |
| Long-term decisions captured | Lost | In chat history | In searchable file |
_ |
| Context files are version controlled | No | Sometimes | Yes, always |
_ |
Layer 2 Score: _ / 10
If Layer 2 < 6: Agent has amnesia. Everything you teach it, it forgets.
→ See Memory Architecture section below.
Layer 3 — Skill Architecture Audit
Score each item 0-2:
| Component | 0 (Missing) | 1 (Partial) | 2 (Done) | Your Score |
|---|
| Skills installed for your use cases | Using base model only | 1-3 skills | 5+ targeted skills | |
| Skills have named frameworks |
Generic prompts | Some structure | Proprietary frameworks | |
| Skills are version controlled | No | Sometimes | Yes |
_ |
| Skills are tested before use | Never | Ad hoc | Formal test per skill |
_ |
| Skill catalog documented | No | Mental map only | Written index |
_ |
Layer 3 Score: _ / 10
If Layer 3 < 6: You're reinventing the wheel every session.
→ See Skill Architecture section below.
Layer 4 — Agent Routing Audit
Score each item 0-2:
| Component | 0 (Missing) | 1 (Partial) | 2 (Done) | Your Score |
|---|
| You know when to use sub-agents | Never use | Occasionally | Systematically | |
| Tasks are routed by type |
One agent does everything | Some routing | Model × task matrix | |
| Expensive model use is justified | Always default | Sometimes cheap | Cheapest model that works |
_ |
| Parallel work happens | Always sequential | Sometimes parallel | Default parallel |
_ |
| Agent output is reviewed before action | Never | Sometimes | Always for risky ops |
_ |
Layer 4 Score: _ / 10
If Layer 4 < 6: You're using a supercomputer like a calculator.
→ See Agent Routing section below.
Layer 5 — Outputs & Integrations Audit
Score each item 0-2:
| Component | 0 (Missing) | 1 (Partial) | 2 (Done) | Your Score |
|---|
| Agent outputs land in the right tools | Copy-paste manually | Partially automated | Fully integrated | |
| Slack notifications configured |
None | Manual | Automated via loop | |
| Notion/Linear/GitHub updated automatically | Never | Sometimes | Systematically |
_ |
| Outputs are searchable/retrievable | Lost after session | In notes | In indexed system |
_ |
| Feedback loops exist (agent learns from output) | None | Informal | Structured |
_ |
Layer 5 Score: _ / 10
Total OS Stack Score: _ / 50
Scorecard Interpretation
| Total Score | Status | Priority Action |
|---|
| 45-50 | AI-Native Operator | Optimize costs + scale loops |
| 35-44 |
Advanced User | Fix weakest layer, add routing |
| 25-34 | Intermediate | Memory + skill architecture gaps |
| 15-24 | Beginner | Start with Layer 1-2 rebuild |
| 0-14 | Day 1 | Follow the full OS Build Order below |
The OS Build Order
Build exactly in this order. Do not skip.
Phase 1: Foundation (Week 1, ~4 hours)
Goal: Agent is connected to your tools and can receive commands.
CODEBLOCK1
Done when: You can ask "list my GitHub issues" and get real data.
Phase 2: Memory Architecture (Week 1-2, ~3 hours)
Goal: Agent remembers who you are, what you're building, and what matters.
Core memory files to create:
CODEBLOCK2
SOUL.md template:
CODEBLOCK3
MEMORY.md template:
CODEBLOCK4
Done when: You start a session and the agent knows your name, mission, and current priority without you explaining it.
Phase 3: Skill Architecture (Week 2, ~4 hours)
Goal: Agent has 5+ targeted capabilities installed as named skills.
Skill selection decision tree:
CODEBLOCK5
Skill quality checklist (for each skill you install):
CODEBLOCK6
Skill catalog file (create at project root):
CODEBLOCK7
Done when: You can say "use [skill]" and the agent knows exactly what framework to apply.
Phase 4: Agent Routing (Week 2-3, ~3 hours)
Goal: Right model for right task. Parallel when possible.
Model Routing Matrix:
| Task Type | Use | Cost Level |
|---|
| File reads, status checks, simple lookups | Haiku / cheap model | $ |
| Writing, content, standard analysis |
Sonnet / mid model | $$ |
| Architecture decisions, complex strategy | Opus / best model | $$$ |
| Sub-agents doing simple tasks | Haiku | $ |
| Sub-agents doing writing | Sonnet | $$ |
Sub-agent trigger rules:
CODEBLOCK8
Done when: You're running parallel sub-agents on independent tasks and using Haiku for simple work automatically.
Phase 5: Output Integration (Week 3, ~3 hours)
Goal: Agent outputs land in the right place automatically.
Integration checklist:
CODEBLOCK9
Done when: You get a Slack notification from your agent without asking for it.
Architecture Reference Diagrams
Minimal Viable AI OS (Good)
CODEBLOCK10
Standard AI OS (Better)
CODEBLOCK11
Full AI OS (Best)
CODEBLOCK12
Common Anti-Patterns (and fixes)
| Anti-Pattern | Symptom | Fix |
|---|
| Context amnesia | Re-explaining yourself every session | Build Layer 2 memory stack |
| One-size-fits-all model |
Burning credits on simple tasks | Apply Model Routing Matrix |
| Serial agent work | Tasks take 10x longer than needed | Enable sub-agent parallelism |
| Prompt not skill | Same prompts repeated every session | Package into SKILL.md |
| No approval gate | Agent sends external messages unexpectedly | Add gate to all write ops |
| Config not backed up | One system wipe loses everything | Git-commit your config |
Example Session
User prompt:
"Audit my current AI setup and tell me what to build next."
Agent response using this skill:
- 1. Asks user to self-score each layer of the OS Stack Scorecard
- Calculates total score and identifies weakest layer
- Generates prioritized build order based on scores
- Provides specific implementation checklist for the weakest layer
- Estimates time investment per phase
- Books follow-up check: "In 1 week, re-score Layer [X]"
Bundle Note
This skill is part of the AI Setup & Productivity Pack ($79 bundle):
- - MCP Server Setup Kit ($19)
- Agentic Loop Designer ($29)
- AI OS Blueprint ($39) — you are here
- Context Budget Optimizer ($19)
- Non-Technical Agent Quickstart ($9)
Save $36 with the full bundle. Built by @RemyClaw.
AI OS 蓝图
框架:操作系统堆栈评分卡
价值500美元/小时的咨询时间。仅需39美元。
本技能的作用
带你从我把Claude当聊天机器人用进阶到Claude就是我的操作系统。涵盖持久记忆、技能架构、上下文钩子和子代理路由。包含对你当前设置的全面审计,附带评分差距分析和重建计划。
解决的问题: 从ChatGPT用户到AI原生操作者的转变是思维模式的转变,而不仅仅是工具升级。大多数人一次只添加一个自动化,却不知道为什么没有产生叠加效应。这份蓝图为你提供了让一切都能堆叠起来的架构。
操作系统堆栈评分卡
一个诊断框架,在5个层级上审计你当前的AI设置,并生成一个优先重建计划。给你的堆栈打分,找到瓶颈,优先修复。
AI操作系统的5个层级
┌──────────────────────────────────────────────┐
│ 第5层:输出与集成 │
│ (工作成果落在哪里?) │
├──────────────────────────────────────────────┤
│ 第4层:代理路由 │
│ (谁做什么工作?) │
├──────────────────────────────────────────────┤
│ 第3层:技能架构 │
│ (安装了哪些能力?) │
├──────────────────────────────────────────────┤
│ 第2层:上下文与记忆 │
│ (代理知道什么?) │
├──────────────────────────────────────────────┤
│ 第1层:基础 │
│ (连接了哪些工具?) │
└──────────────────────────────────────────────┘
先修复第1层,再修复第2层。先修复第2层,再修复第3层。以此类推。
跳过层级是大多数AI设置陷入瓶颈的原因。
操作系统堆栈评分卡:完整审计
第1层 — 基础审计
每项评分0-2分:
| 组件 | 0 (缺失) | 1 (部分) | 2 (完成) | 你的分数 |
|---|
| 已安装AI客户端 (Claude Desktop / OpenClaw) | 无 | 旧版本 | 最新版本 | |
| 已连接MCP服务器 |
0个工具 | 1-2个工具 | 3个以上工具 | |
| API密钥安全存储 | 在聊天历史中 | 在.env文件中 | 在密钥管理器中 |
_ |
| 配置文件已备份 | 从未 | 偶尔 | 版本控制 |
_ |
| 重启/重新加载流程已知 | 不知道 | 有时有效 | 已记录 |
_ |
第1层得分:_ / 10
如果第1层 < 6: 停止。先修复基础,再做其他事。
→ 安装MCP服务器设置工具包(单独出售)来修复这一层。
第2层 — 上下文与记忆审计
每项评分0-2分:
| 组件 | 0 (缺失) | 1 (部分) | 2 (完成) | 你的分数 |
|---|
| 存在持久记忆文件 | 无 | 非正式笔记 | 结构化的MEMORY.md | |
| 代理知道你的角色/背景 |
每次会话从零开始 | 系统提示中有部分 | 完整的上下文文件 | |
| 每日记忆定期更新 | 从不 | 不规律 | 每次会话 |
_ |
| 长期决策被记录 | 丢失 | 在聊天历史中 | 在可搜索文件中 |
_ |
| 上下文文件已版本控制 | 否 | 有时 | 是,总是 |
_ |
第2层得分:_ / 10
如果第2层 < 6: 代理有健忘症。你教给它的一切,它都会忘记。
→ 参见下面的记忆架构部分。
第3层 — 技能架构审计
每项评分0-2分:
| 组件 | 0 (缺失) | 1 (部分) | 2 (完成) | 你的分数 |
|---|
| 为你的用例安装了技能 | 仅使用基础模型 | 1-3个技能 | 5个以上针对性技能 | |
| 技能有命名框架 |
通用提示 | 一些结构 | 专有框架 | |
| 技能已版本控制 | 否 | 有时 | 是 |
_ |
| 技能在使用前经过测试 | 从不 | 临时测试 | 每个技能正式测试 |
_ |
| 技能目录已记录 | 否 | 仅心理地图 | 书面索引 |
_ |
第3层得分:_ / 10
如果第3层 < 6: 你每次会话都在重复造轮子。
→ 参见下面的技能架构部分。
第4层 — 代理路由审计
每项评分0-2分:
| 组件 | 0 (缺失) | 1 (部分) | 2 (完成) | 你的分数 |
|---|
| 你知道何时使用子代理 | 从不使用 | 偶尔 | 系统性地 | |
| 任务按类型路由 |
一个代理做所有事 | 一些路由 | 模型×任务矩阵 | |
| 昂贵模型的使用有正当理由 | 总是默认 | 有时用便宜的 | 能工作的最便宜模型 |
_ |
| 并行工作发生 | 总是串行 | 有时并行 | 默认并行 |
_ |
| 代理输出在行动前被审查 | 从不 | 有时 | 高风险操作总是 |
_ |
第4层得分:_ / 10
如果第4层 < 6: 你在把超级计算机当计算器用。
→ 参见下面的代理路由部分。
第5层 — 输出与集成审计
每项评分0-2分:
| 组件 | 0 (缺失) | 1 (部分) | 2 (完成) | 你的分数 |
|---|
| 代理输出落在正确的工具中 | 手动复制粘贴 | 部分自动化 | 完全集成 | |
| Slack通知已配置 |
无 | 手动 | 通过循环自动化 | |
| Notion/Linear/GitHub自动更新 | 从不 | 有时 | 系统性地 |
_ |
| 输出可搜索/可检索 | 会话后丢失 | 在笔记中 | 在索引系统中 |
_ |
| 存在反馈循环 (代理从输出中学习) | 无 | 非正式 | 结构化 |
_ |
第5层得分:_ / 10
操作系统堆栈总分:_ / 50
评分卡解读
| 总分 | 状态 | 优先行动 |
|---|
| 45-50 | AI原生操作者 | 优化成本 + 扩展循环 |
| 35-44 |
高级用户 | 修复最弱层级,添加路由 |
| 25-34 | 中级 | 记忆 + 技能架构差距 |
| 15-24 | 初级 | 从第1-2层重建开始 |
| 0-14 | 第一天 | 按照下面的完整操作系统构建顺序 |
操作系统构建顺序
严格按照此顺序构建。不要跳过。
阶段1:基础 (第1周,约4小时)
目标: 代理连接到你的工具并能接收命令。
检查清单:
□ 安装OpenClaw或Claude Desktop (最新版本)
□ 连接3个以上MCP服务器 (至少GitHub、Notion、Slack)
□ 将所有API密钥存储在密钥管理器中 (不要用.env文件)
□ 将配置备份到git仓库
□ 测试:Claude能读取和写入每个连接的工具
完成标志: 你可以问列出我的GitHub问题并获得真实数据。
阶段2:记忆架构 (第1-2周,约3小时)
目标: 代理记住你是谁、你在构建什么、以及什么重要。
要创建的核心记忆文件:
SOUL.md — 这个代理是谁?使命是什么?
MEMORY.md — 长期决策、偏好、上下文
memory/YYYY-MM-DD.md — 每日会话笔记 (每次会话追加)
FOCUS.md — 当前第一优先级 (每次会话更新)
SOUL.md模板:
markdown
代理身份
名称:[代理名称]
使命:[一句话 — 我们在构建/实现什么?]
所有者:[你的名字]
语气:[语调:直接/温暖/分析型等]
操作原则
- 1. [核心行为规则]
- [核心行为规则]
- [核心行为规则]
关键绩效指标
MEMORY.md模板:
markdown
长期记忆
已做决策
已学偏好
活跃项目