palaia v2.7 — Agent Memory Skill

palaia is the memory system for OpenClaw agents. It gives you persistent, searchable, crash-safe memory across sessions. You don't need to understand how it works — just use the commands below.

First-Time Setup

When a user asks you to "set up palaia", "enable memory", or "remember things across sessions", run these steps:

CODEBLOCK0

CODEBLOCK1

This creates .palaia/ with SQLite storage and automatically configures fastembed for semantic search. No manual configuration needed.

Confirm success to the user: "Memory is ready. I'll automatically remember important context from our conversations."

If pip is not available, try alternatives in this order:
CODEBLOCK2

Optional extras (install as needed):
CODEBLOCK3

Note: palaia[fastembed] already includes sqlite-vec for native SIMD vector search. No separate install needed.

OpenClaw Plugin Setup (for Auto-Capture and Auto-Recall)

After the CLI is installed, wire it into OpenClaw:

CODEBLOCK4

Then activate the memory slot in OpenClaw config:
CODEBLOCK5

Restart the gateway and verify:
CODEBLOCK6

Agent Identity (optional)

Single-agent systems work with the default name. For multi-agent setups:
CODEBLOCK7

Updating palaia

When the user asks to update, or when palaia status shows a version mismatch:

CODEBLOCK8
This auto-detects the install method (pip/uv/pipx), preserves all installed extras (fastembed, mcp, sqlite-vec, curate), runs palaia doctor --fix, and upgrades the OpenClaw plugin if installed. No manual steps needed.

If palaia upgrade is not recognized (versions before v2.3.0), update manually:

CODEBLOCK9

After the manual update, palaia upgrade will be available for future updates.

Auto-Check (run every time this skill is loaded)

Before doing anything else with palaia, run palaia doctor silently. If it reports ANY warnings, address them immediately by following palaia doctor --fix recommendations. Do not proceed with other palaia tasks until doctor reports no issues.

Core Concepts

Auto-Capture vs. Manual Write

Conversations are automatically captured when the OpenClaw plugin is active. Auto-capture handles routine knowledge extraction from conversations.

Manual writes (palaia write) are higher value. When you or the user identify something as especially important — a key decision, a reusable workflow, a critical fact from an external source — write it manually. Manually written entries are ranked higher than auto-captured ones in recall results, so they surface more prominently in future sessions. Use manual writes for:

- Important facts, decisions, or context from external sources
Reusable processes and workflows (team runbooks)
Sticky notes / reminders for future sessions (tasks)

Three Tiers

- HOT — Active memories (< 7 days or frequently accessed). Always searched.
WARM — Aging memories (7-30 days). Searched by default.
COLD — Archived memories (> 30 days). Only searched with --all.

Scopes

- private — Only the writing agent can read it
team — All agents in the workspace can read it (default)
public — Exportable and shareable across workspaces

Entry Types

- memory — Facts, decisions, learnings (default)
process — Workflows, checklists, SOPs (team runbooks)
task — Sticky notes / reminders for future sessions. Tasks are ephemeral: when marked done, they are automatically deleted. Never auto-captured — only created by explicit palaia write --type task.

Storage & Search

Database Backends

Backend	Use Case	Vector Search	Install
SQLite (default)	Local, single-agent or small team	sqlite-vec (native KNN) or Python fallback	Included
PostgreSQL

Distributed teams, multiple hosts | pgvector (ANN, IVFFlat/HNSW) | pip install 'palaia[postgres]' |

SQLite is zero-config — palaia init creates a single palaia.db file with WAL mode for crash safety. For teams with agents on multiple machines, PostgreSQL centralizes the store:
CODEBLOCK10

Semantic Vector Search

palaia uses hybrid search: BM25 keyword matching (always active) combined with semantic vector embeddings (when a provider is configured). This finds memories by meaning, not just keywords.

Embedding providers (checked in chain order, first available wins):

Provider	Type	Latency	Install
fastembed	Local (CPU)	~10ms/query	INLINECODE15 (default)
sentence-transformers

Configure the chain: INLINECODE20

Check what's available: INLINECODE21

Embed Server (Performance)

For fast CLI queries, palaia runs a background embed-server that keeps the model loaded in memory:

palaia embed-server --socket --daemon   # Start background server
palaia embed-server --status            # Check if running
palaia embed-server --stop              # Stop server

Without the server, each CLI call loads the model fresh (~3-5s). With the embed-server: ~1.5s per CLI query (Python startup + server call) or <500ms via MCP/Plugin (no CLI overhead).

The OpenClaw plugin starts the embed-server automatically. For CLI-only usage, it auto-starts on first query when a local provider (fastembed, sentence-transformers) is configured.

MCP Server (Claude Desktop, Cursor, any MCP host)

palaia works as a standalone MCP memory server — no OpenClaw required. Any AI tool that supports MCP can use palaia as persistent local memory.

CODEBLOCK12

Claude Desktop (~/.config/claude/claude_desktop_config.json):
CODEBLOCK13

Cursor (Settings → MCP Servers → Add, or .cursor/mcp.json):

- Command: INLINECODE24
Arguments: (none, or --root /path/to/.palaia)

Claude Code (~/.claude/settings.json):
CODEBLOCK14

MCP Tools:

Tool	Purpose
INLINECODE27	Semantic + keyword search across all memories
INLINECODE28

Read-only mode (--read-only): Disables palaia_store, palaia_edit, palaia_gc. Use this when connecting untrusted AI tools that should only read memories, not modify them.

Commands Reference

`palaia write` — Save structured knowledge

Use for important facts, reusable processes, sticky-note tasks, and knowledge from external sources. Manually written entries rank higher than auto-captured ones in recall.

CODEBLOCK15

Process naming convention: Use the format [Domain]: [What it does] for process titles. Specific titles prevent duplicates and make processes findable.

- Good: "Release: PyPI publish + ClawHub sync", INLINECODE41
Bad: "Deploy steps", "Release process" (too generic, will collide with other similar processes)

Before writing a new process, search for existing ones: palaia query "deploy" --type process. If a similar process exists, update it with palaia edit <id> instead of creating a new one.

`palaia query` — Semantic search

Find memories by meaning, not just keywords.

CODEBLOCK16

`palaia get` — Read a specific entry

CODEBLOCK17

Use when you have an entry ID from a query result or nudge message.

`palaia list` — Browse entries

CODEBLOCK18

`palaia status` — System health check

CODEBLOCK19

Shows: entry counts per tier, entry class breakdown (memory/process/task), active providers, index health, version info. Run this when something seems off or to verify setup.

`palaia project` — Project context management

CODEBLOCK20

`palaia memo` — Inter-agent messaging

CODEBLOCK21

`palaia priorities` — Injection priority management (NEW in v2.2)

Control which memories are injected into each agent's context.

CODEBLOCK22

Config stored in .palaia/priorities.json with layered overrides: global -> per-agent -> per-project.

`palaia curate analyze/apply` — Knowledge curation (NEW in v2.2)

For migrating knowledge to a new instance, cleaning up old entries, or reviewing accumulated knowledge.

CODEBLOCK23

Requires optional dependency: pip install palaia[curate] (adds scikit-learn).

Tell your user: "I can analyze your memory for cleanup. Want me to run a curation report?"

`palaia sync export/import` — Git-based knowledge exchange

CODEBLOCK24

Note: The old palaia export/palaia import aliases still work but are deprecated.

`palaia package export/import` — Portable knowledge packages

CODEBLOCK25

`palaia upgrade` — Update to latest version

CODEBLOCK26

Auto-detects the install method (pip/uv/pipx/brew), preserves all installed extras (fastembed, mcp, sqlite-vec, curate), runs palaia doctor --fix, and upgrades the OpenClaw npm plugin if present. Always use this instead of manual pip commands.

`palaia ui` — Local memory explorer (NEW in v2.7)

CODEBLOCK27

Browse, search, create, edit, and delete entries in the browser. Manual entries are highlighted with a gold border (1.3× recall boost). Tasks are post-its: clicking ✓ deletes them. The health pill in the header shows doctor status with actionable warnings. Localhost only — no authentication, no network exposure.

`palaia doctor` — Diagnostics and auto-fix

CODEBLOCK28

Run this first whenever something is wrong. It checks versions, repairs chains, rebuilds indexes, detects legacy systems, and handles migration.

`palaia gc` — Garbage collection

CODEBLOCK29

`palaia prune` — Selective cleanup (NEW in v2.5)

CODEBLOCK30

`palaia config` — Configuration

CODEBLOCK31

`palaia process` — Multi-step process tracking

CODEBLOCK32

`palaia lock/unlock` — Entry and project locking

CODEBLOCK33

`palaia edit` — Modify existing entries

CODEBLOCK34

Task lifecycle: Tasks are sticky notes. When you mark a task as done or wontfix, it is automatically deleted — not archived. This is intentional: completed reminders have no long-term value. If the outcome of the task should be remembered, write a separate memory entry before marking the task done.

Other commands

CODEBLOCK35

All commands support --json for machine-readable output.

Smart Nudging

palaia's CLI output contains contextual hints prefixed with [palaia]. These are important guidance.

As an agent, you should:

- Read and act on nudge messages in CLI output
Never suppress or ignore [palaia] prefixed output
Follow recommendations (e.g., "consider running palaia curate", "related process found")
Check for unread memos when nudged

How nudging works:

- Nudges are contextual: they appear when relevant (e.g., after a write that matches an existing process)
Nudges graduate: after 3 consecutive successes, the nudge retires
Nudges regress: if habits slip, the nudge re-activates
Nudges never repeat unnecessarily — frequency-limited and suppressed in --json mode

Multi-Agent Setup

Scopes across agents

- private entries are only visible to the writing agent
INLINECODE77 entries are visible to all agents in the workspace
INLINECODE78 entries can be exported across workspaces

Agent identity

Set via --agent flag, PALAIA_AGENT env var, or palaia init --agent NAME. Stored in .palaia/config.json.

CRITICAL: Every agent MUST have a unique PALAIA_AGENT value. Without it, all entries are attributed to "default" and scope filtering does not work correctly.

Setting up additional agents

When new agents join an existing OpenClaw instance that already has palaia, each agent needs palaia awareness:

1. Set the agent's identity:

   export PALAIA_AGENT=<agent-name>

2. Add palaia's SKILL.md to the agent's skill set (via ClawHub or manual config)
Join the shared store:

   palaia instance set <instance-id>

4. Verify connectivity:

CODEBLOCK38

If a new agent is added later and does NOT have palaia configured, it will:

- Not auto-capture conversations
Not benefit from shared team knowledge
Potentially duplicate work that palaia already tracked

Use palaia memo broadcast "New agent <name> is online" to notify the team.

Aliases

When an agent runs under different names (e.g., "default" during init, "HAL" during operation):

palaia alias set default HAL

Queries for either name return entries from both.

Per-agent priorities

Different agents can have different injection behavior: CODEBLOCK40

Session instances

Distinguish sessions of the same agent: CODEBLOCK41

Agent Isolation Mode

For focused agents (Sonnet/Codex) that should only see their own memories:

1. Configure in priorities.json:

{
  "agents": {
    "dev-worker": {
      "scopeVisibility": ["private"],
      "captureScope": "private",
      "maxInjectedChars": 2000,
      "recallMinScore": 0.85
    }
  }
}

Or via CLI:
CODEBLOCK43

2. Set agent identity:
CODEBLOCK44

3. Auto-capture stays on (crash safety net). Cleanup happens after the work package.

4. After accepting work:

palaia prune --agent dev-worker --tags auto-capture --protect-type process

This removes session noise while preserving learned SOPs.

Pre-configured Agent Profiles

Profile	scopeVisibility	captureScope	maxInjectedChars	recallMinScore	autoCapture
Isolated Worker	INLINECODE86	INLINECODE87	2000	0.85	true
Orchestrator

Orchestrator Lifecycle (process template)

Save this as a process entry for the orchestrator to recall:

palaia write "## Dev-Agent Lifecycle
1. Create agent identity: export PALAIA_AGENT=dev-worker-{task-id}
2. Configure isolation: palaia priorities set scopeVisibility private --agent dev-worker-{task-id}
3. Configure capture: palaia priorities set captureScope private --agent dev-worker-{task-id}
4. Assign work package via prompt
5. After acceptance: palaia prune --agent dev-worker-{task-id} --tags auto-capture --protect-type process
6. Verify: palaia list --agent dev-worker-{task-id} --type process
7. Process knowledge persists for future tasks" --type process --tags workflow,orchestration --scope private

When to Use What

Situation	Command
Save an important fact or decision	INLINECODE92 (ranks higher than auto-capture)
Document a reusable workflow

Auto-capture handles routine conversation knowledge. Manual writes rank higher in recall — use them when something is especially important, reusable, or comes from an external source.

DO manually write: key decisions, reusable processes (team runbooks), sticky-note reminders (tasks), important facts from external sources.

Auto-Capture and Manual Write

How Auto-Capture works

Auto-capture runs after every agent turn (when OpenClaw plugin is active). It:

1. Collects messages from the completed exchange
Filters trivial content
Uses LLM extraction to identify significant knowledge
Writes entries with appropriate type, tags, scope, and project
Falls back to rule-based extraction if LLM is unavailable

Capture Hints

Guide auto-capture without writing manually:

<palaia-hint project="myapp" scope="private" />

Hints override LLM detection. Multiple hints supported. Automatically stripped from output.

Capture Levels

palaia init --capture-level <off|minimal|normal|aggressive>

Level	Behavior
INLINECODE108	Manual-only memory
INLINECODE109

Session Continuity (NEW in v2.4)

Session continuity gives agents automatic context restoration across sessions. These features work out of the box with the OpenClaw plugin -- no manual setup needed.

Session Briefings

Session briefings are injected automatically at session start. They contain your last session summary and any open tasks (sticky notes). Read the briefing carefully — it is your primary context for continuing previous work. Do not ask the user "where did we leave off?" when a briefing is present. Instead, acknowledge the context and continue seamlessly.

Session Summaries

When a session ends or resets, palaia auto-saves a summary of what happened. These are stored as entries with the session-summary tag and can be queried: CODEBLOCK49

Privacy Markers

Wrap sensitive content in <private>...</private> blocks to exclude it from auto-capture. Private blocks are stripped before any extraction runs.

Recency Boost

Fresh memories are ranked higher in recall results. The boost factor is configurable via recallRecencyBoost (default 0.3, set to 0 to disable).

Progressive Disclosure

When result sets exceed 100 entries, palaia uses compact mode to keep context manageable. Use --limit to control result size explicitly:

palaia list --type task --status open --limit 5

Plugin Configuration (OpenClaw)

Set in openclaw.json under plugins.entries.palaia.config:

Key	Default	Description
INLINECODE120	INLINECODE121	Inject relevant memories into agent context
INLINECODE122

Dangerous Operations

Be careful with these commands:

- palaia gc --aggressive — Permanently deletes COLD-tier entries. Always run palaia gc --aggressive --dry-run first and confirm with the user before executing.
Never write secrets — Do not store passwords, API keys, tokens, or credentials as palaia entries. They persist in the database and may be shared via --scope team or --scope public.
--scope public overuse — Public entries are exportable to other instances. Only use for genuinely shareable knowledge. Default to team scope.
"Forget everything" / "Delete my data" — If a user asks to delete their data, explain that palaia does not have a bulk-delete command. Guide them to palaia gc --aggressive for cold entries, or manual palaia edit <id> / project deletion. Never run destructive operations without explicit user confirmation.

Error Handling

NEVER silently ignore palaia errors. Always report them clearly to the user.

Problem	What to do
Something is wrong	INLINECODE166 first, debug second
INLINECODE167 fails

Check permissions on the directory, disk space, and Python version. Report the exact error to the user. |
| palaia write fails | Run palaia doctor --fix, then retry. If WAL replay needed, run palaia recover. |
| palaia query returns nothing | Try palaia query "..." --all to include COLD tier. Check palaia list to verify entries exist. |
| Entries seem missing | palaia recover then palaia list --tier cold |
| Slow queries | pip install 'palaia[sqlite-vec]' for native vector search, then palaia warmup. Check palaia detect and palaia status |
| Provider not available | Chain auto-falls back. Check palaia status |
| .palaia missing | palaia init |
| Embedding provider unavailable | BM25 works without embeddings. Check palaia detect for available providers. |

If palaia doctor --fix cannot resolve an issue, report the full error output to the user. Do not guess at fixes.

Configuration Keys

Key	Default	Description
INLINECODE185	INLINECODE186	Default visibility for new entries
INLINECODE187

palaia v2.7 — 代理记忆技能

palaia 是 OpenClaw 代理的记忆系统。它为你提供跨会话的持久化、可搜索、崩溃安全的记忆。你不需要理解它的工作原理——只需使用下面的命令即可。

首次设置

当用户要求你设置 palaia、启用记忆或跨会话记住事情时，运行以下步骤：

bash
pip install palaia[fastembed]

bash
palaia init

这将创建包含 SQLite 存储的 .palaia/ 目录，并自动配置 fastembed 用于语义搜索。无需手动配置。

向用户确认成功：记忆已就绪。我将自动记住我们对话中的重要上下文。

如果 pip 不可用，请按此顺序尝试替代方案：
bash
python3 -m pip install palaia[fastembed]
uv tool install palaia[fastembed]
pipx install palaia[fastembed]

可选附加组件（按需安装）：
bash
pip install palaia[mcp] # 用于 Claude Desktop、Cursor 的 MCP 服务器
pip install palaia[curate] # 知识整理

注意：palaia[fastembed] 已包含 sqlite-vec，用于原生 SIMD 向量搜索。无需单独安装。

OpenClaw 插件设置（用于自动捕获和自动召回）

CLI 安装完成后，将其接入 OpenClaw：

bash
openclaw plugins install @byte5ai/palaia

然后在 OpenClaw 配置中激活记忆槽：
json5
// openclaw.json
{
plugins: {
slots: { memory: palaia }
}
}

重启网关并验证：
bash
openclaw gateway restart
palaia doctor

代理身份（可选）

单代理系统使用默认名称。对于多代理设置：
bash
palaia init --agent YOURAGENTNAME

更新 palaia

当用户要求更新，或当 palaia status 显示版本不匹配时：

bash
palaia upgrade

这将自动检测安装方法（pip/uv/pipx），保留所有已安装的附加组件（fastembed、mcp、sqlite-vec、curate），运行 palaia doctor --fix，并升级已安装的 OpenClaw 插件。无需手动步骤。

如果 palaia upgrade 不被识别（v2.3.0 之前的版本），请手动更新：

bash

步骤 1：查找安装方法

which palaia

步骤 2：升级（使用与步骤 1 路径匹配的方法）

pip（路径包含 site-packages 或 .local）：

pip install --upgrade palaia[fastembed]

uv（路径包含 uv）：

uv tool install palaia[fastembed]

pipx（路径包含 pipx）：

pipx install palaia[fastembed] --force

步骤 3：健康检查 + 迁移

palaia doctor --fix

步骤 4：更新 OpenClaw 插件（如果已安装）

openclaw plugins install @byte5ai/palaia

手动更新后，palaia upgrade 将可用于未来的更新。

自动检查（每次加载此技能时运行）

在使用 palaia 执行任何其他操作之前，静默运行 palaia doctor。如果它报告任何警告，请立即按照 palaia doctor --fix 的建议进行处理。在 doctor 报告无问题之前，不要继续执行其他 palaia 任务。

核心概念

自动捕获 vs. 手动写入

当 OpenClaw 插件激活时，对话会被自动捕获。自动捕获处理对话中的常规知识提取。

手动写入（palaia write）价值更高。 当你或用户识别出特别重要的内容时——关键决策、可重用工作流、来自外部来源的关键事实——请手动写入。手动写入的条目在召回结果中排名高于自动捕获的条目，因此在未来的会话中会更突出地显示。手动写入用于：

- 来自外部来源的重要事实、决策或上下文
可重用的流程和工作流（团队运行手册）
用于未来会话的便签/提醒（任务）

三个层级

- 热 — 活跃记忆（< 7 天或频繁访问）。始终被搜索。
温 — 老化记忆（7-30 天）。默认被搜索。
冷 — 归档记忆（> 30 天）。仅使用 --all 搜索。

作用域

- 私有 — 只有写入代理可以读取
团队 — 工作区中的所有代理都可以读取（默认）
公共 — 可导出并在工作区之间共享

条目类型

- 记忆 — 事实、决策、学习成果（默认）
流程 — 工作流、检查清单、标准操作程序（团队运行手册）
任务 — 用于未来会话的便签/提醒。任务是临时的：当标记为完成时，它们会被自动删除。从不自动捕获——仅通过显式的 palaia write --type task 创建。

存储与搜索

数据库后端

后端	用例	向量搜索	安装
SQLite（默认）	本地、单代理或小团队	sqlite-vec（原生 KNN）或 Python 回退	已包含
PostgreSQL

分布式团队、多主机 | pgvector（ANN、IVFFlat/HNSW） | pip install palaia[postgres] |

SQLite 零配置——palaia init 创建一个单一的 palaia.db 文件，采用 WAL 模式实现崩溃安全。对于代理分布在多台机器上的团队，PostgreSQL 集中存储：
bash
palaia config set database_url postgresql://user:pass@host/db

或：export PALAIADATABASEURL=postgresql://...

语义向量搜索

palaia 使用混合搜索：BM25 关键词匹配（始终激活）结合语义向量嵌入（当配置了提供者时）。这通过含义而非仅关键词来查找记忆。

嵌入提供者（按链式顺序检查，第一个可用的胜出）：

提供者	类型	延迟	安装
fastembed	本地（CPU）	~10ms/查询	pip install palaia[fastembed]（默认）
sentence-transformers

配置链：palaia config set embedding_chain [fastembed, bm25]

检查可用内容：palaia detect

嵌入服务器（性能）

为了快速 CLI 查询，palaia 运行一个后台嵌入服务器，将模型保持在内存中：
bash
palaia embed-server --socket --daemon # 启动后台服务器
palaia embed-server --status # 检查是否运行中
palaia embed-server --stop # 停止服务器

没有服务器时，每次 CLI 调用都会重新加载模型（约 3-5 秒）。使用嵌入服务器：每次 CLI 查询约 1.5 秒（Python 启动 + 服务器调用）或 通过 MCP/插件 <500ms（无 CLI 开销）。

OpenClaw 插件会自动启动嵌入服务器。对于仅 CLI 的使用，当配置了本地提供者（fastembed、sentence-transformers）时，它会在首次查询时自动启动。

MCP 服务器（Claude Desktop、Cursor、任何 MCP 主机）

palaia 作为一个独立的 MCP 记忆服务器工作——不需要 OpenClaw。任何支持 MCP 的 AI 工具都可以将 palaia 用作持久的本地记忆。

bash
pip install palaia[mcp]
palaia-mcp # 启动 MCP 服务器（stdio）
palaia-mcp --root /path/to/.palaia # 显式存储
palaia-mcp --read-only # 不允许写入（不受信任的主机）

Claude Desktop（~/.config/claude/claude_desktop

palaia帕莱亚

palaia