Parameter	Description	Required
INLINECODE0	Path to documentation directory (for existing analyses)	Optional
INLINECODE1

Level	Criteria
Strong	All LLM outputs use Pydantic/dataclass schemas with validators
Partial

Level	Criteria
Strong	Prompts in separate files, templated (Jinja2), versioned
Partial

Level	Criteria
Strong	Custom context format, token optimization, typed events, compaction
Partial

Level	Criteria
Strong	All tool outputs schema-validated, handlers type-safe
Partial

Level	Criteria
Strong	Single serializable state object with all execution metadata
Partial

Level	Criteria
Strong	REST API + webhook resume, pause at any point including mid-tool
Partial

Level	Criteria
Strong	INLINECODE31 tool with question/options/urgency/format
Partial

Level	Criteria
Strong	Custom routing functions, conditional edges, execution mode control
Partial

Level	Criteria
Strong	Errors in context, retry with threshold, automatic escalation
Partial

Level	Criteria
Strong	3+ specialized agents, each with single responsibility, step limits
Partial

Level	Criteria
Strong	CLI + REST + WebSocket + webhooks + chat integrations
Partial

Level	Criteria
Strong	Immutable state updates, side effects isolated to tools/handlers
Partial

Level	Criteria
Strong	Automatic pre-fetch of related tests, files, docs before planning
Partial

Score	Meaning	Action
Strong	Fully implements principle	Maintain, minor optimizations
Partial

12-Factor Agents 合规性分析

参考：12-Factor Agents

输入参数

参数	描述	必填
docspath	文档目录路径（用于已有分析）	可选
codebasepath

待分析代码库的根路径 | 必填 |

分析框架

因子1：自然语言到工具调用

原则： 使用模式验证的输出，将自然语言输入转换为结构化、确定性的工具调用。

搜索模式：
bash

查找 Pydantic 模式

grep -r class.BaseModel --include=.py
grep -r TaskDAG\|TaskResponse\|ToolCall --include=*.py

查找 JSON 模式生成

grep -r modeljsonschema\|json_schema --include=*.py

查找结构化输出生成

grep -r outputtype\|responsemodel --include=*.py

文件模式： /agents/.py，/schemas/.py，/models/*.py

合规标准：

级别	标准
强	所有 LLM 输出使用带验证器的 Pydantic/dataclass 模式
部分

部分输出有类型，但存在字典返回或未验证的字符串 |
| 弱 | LLM 返回原始字符串，手动或用正则解析 |

反模式：

- 无模式验证的 json.loads(llm_response)
对 LLM 响应使用 output.split() 或正则解析
代理返回 dict[str, Any] 类型
LLM 输出与处理器执行之间无验证

因子2：拥有你的提示词

原则： 将提示词视为你控制、版本管理和迭代的一等代码。

搜索模式：
bash

查找内嵌提示词

grep -r SYSTEMPROMPT\|systemprompt --include=*.py
grep -r .You are --include=.py

查找模板系统

grep -r jinja\|Jinja\|render_template --include=*.py find . -name .jinja2 -o -name .j2

查找提示词目录

find . -type d -name prompts

文件模式： /prompts/，/templates/，/agents/*.py

合规标准：

级别	标准
强	提示词在单独文件中，使用模板（Jinja2），有版本管理
部分

提示词作为模块常量，有一定参数化 |
| 弱 | 提示词硬编码在函数内联中，仅使用 f-string |

反模式：

- 在代理方法中内联使用 fYou are a {role}...
提示词与业务逻辑混合
无法在不修改代码的情况下迭代提示词
没有提示词版本管理或 A/B 测试能力

因子3：拥有你的上下文窗口

原则： 控制历史记录、状态和工具结果如何格式化为 LLM 输入。

搜索模式：
bash

查找上下文/消息管理

grep -r AgentMessage\|ChatMessage\|messages --include=*.py
grep -r contextwindow\|contextcompiler --include=*.py

查找自定义序列化

grep -r toxml\|tocontext\|serialize --include=*.py

查找令牌管理

grep -r tokencount\|maxtokens\|truncate --include=*.py

文件模式： /context/.py，/state/.py，/core/*.py

合规标准：

级别	标准
强	自定义上下文格式、令牌优化、类型化事件、压缩
部分

基本消息历史，有一定结构 |
| 弱 | 原始消息累积，仅使用标准 OpenAI 格式 |

反模式：

- 无限制的消息累积
内嵌大型工件（差异、文件）
无代理特定的上下文过滤
所有代理类型使用相同上下文

因子4：工具是结构化输出

原则： 工具产生模式验证的 JSON，触发确定性代码，而非魔法函数调用。

搜索模式：
bash

查找工具/响应模式

grep -r class.Response.BaseModel --include=*.py
grep -r ToolResult\|ToolOutput --include=*.py

查找确定性处理器

grep -r def handle\|def execute --include=*.py

查找验证层

grep -r modelvalidate\|parseobj --include=*.py

文件模式： /tools/.py，/handlers/.py，/agents/*.py

合规标准：

级别	标准
强	所有工具输出经过模式验证，处理器类型安全
部分

大多数工具有类型，部分存在松散字典返回 |
| 弱 | 工具返回任意字典，无验证层 |

反模式：

- 直接执行 LLM 输出的工具处理器
对 LLM 生成的代码使用 eval() 或 exec()
决策（LLM）与执行（代码）之间无分离
基于字符串匹配的魔法方法分发

因子5：统一执行状态

原则： 将执行状态（步骤、重试）与业务状态（消息、结果）合并。

搜索模式：
bash

查找状态模型

grep -r ExecutionState\|WorkflowState\|Thread --include=*.py

查找双状态系统

grep -r checkpoint\|MemorySaver --include=*.py grep -r sqlite\|database\|repository --include=*.py

查找状态重建

grep -r load_state\|restore\|reconstruct --include=*.py

文件模式： /state/.py，/models/.py，/database/*.py

合规标准：

级别	标准
强	单一可序列化状态对象，包含所有执行元数据
部分

状态存在但跨系统拆分（内存 + 数据库） |
| 弱 | 执行状态分散，需要多次查询才能重建 |

反模式：

- 重试计数与任务状态分开存储
错误历史在日志中但不在状态中
LangGraph 检查点 + 独立数据库存储
无统一事件线程

因子6：启动/暂停/恢复

原则： 代理支持简单的 API 用于启动、在任何点暂停和恢复。

搜索模式：
bash

查找 REST 端点

grep -r @router.post\|@app.post --include=*.py
grep -r start_workflow\|pause\|resume --include=*.py

查找中断机制

grep -r interruptbefore\|interruptafter --include=*.py

查找 Webhook 处理器

grep -r webhook\|callback --include=*.py

文件模式： /routes/.py，/api/.py，/orchestrator/*.py

合规标准：

级别	标准
强	REST API + Webhook 恢复，可在任何点暂停，包括工具执行中
部分

存在启动/暂停/恢复，但仅在粗粒度点 |
| 弱 | 仅 CLI 启动，无暂停/恢复能力 |

反模式：

- 阻塞的 input() 或 confirm() 调用
进程重启后无法恢复
审批仅在计划级别，而非每个工具
无基于 Webhook 的外部系统恢复

因子7：通过工具联系人类

原则： 人类联系是一个带有问题、选项和紧急程度的工具调用。

搜索模式：
bash

查找人类输入机制

grep -r typer.confirm\|input(\|prompt( --include=*.py
grep -r requesthumaninput\|human_contact --include=*.py

查找审批模式

grep -r approval\|approve\|reject --include=*.py

agent-architecture-analysis智能体架构分析

agent-architecture-analysis

12-Factor Agents Compliance Analysis

Input Parameters

Analysis Framework

Factor 1: Natural Language to Tool Calls

Factor 2: Own Your Prompts

Factor 3: Own Your Context Window

Factor 4: Tools Are Structured Outputs

Factor 5: Unify Execution State

Factor 6: Launch/Pause/Resume

Factor 7: Contact Humans with Tools

Factor 8: Own Your Control Flow

Factor 9: Compact Errors into Context

Factor 10: Small, Focused Agents

Factor 11: Trigger from Anywhere

Factor 12: Stateless Reducer

Factor 13: Pre-fetch Context

Output Format

Executive Summary Table

Per-Factor Analysis

Analysis Workflow

Quick Reference: Compliance Scoring

When to Use This Skill

12-Factor Agents 合规性分析

输入参数

分析框架

因子1：自然语言到工具调用

查找 Pydantic 模式

查找 JSON 模式生成

查找结构化输出生成

因子2：拥有你的提示词

查找内嵌提示词

查找模板系统

查找提示词目录

因子3：拥有你的上下文窗口

查找上下文/消息管理

查找自定义序列化

查找令牌管理

因子4：工具是结构化输出

查找工具/响应模式

查找确定性处理器

查找验证层

因子5：统一执行状态

查找状态模型

查找双状态系统

查找状态重建

因子6：启动/暂停/恢复

查找 REST 端点

查找中断机制

查找 Webhook 处理器

因子7：通过工具联系人类

查找人类输入机制

查找审批模式

查找结构化问题

标签

通过对话安装

方式一：安装 SkillHub 和技能

方式二：设置 SkillHub 为优先技能安装源

通过命令行安装

下载

相关推荐

self-improvement

self-improvement

self-improvement

self-improvement