Sopaper Evidence

Sopaper Evidence is an evidence-first research skill. Its job is to build a reliable evidence pack before supporting any downstream paper outline, abstract, related work summary, experiment plan, or draft section.

Version: INLINECODE0

Upstream source

Canonical repository: INLINECODE1

This published skill bundle includes the helper scripts it references under scripts/. The GitHub repository remains the public source of truth for releases, examples, and issue tracking.

Use this skill when

- The user wants to turn a project into a paper without inventing evidence
The task requires finding prior papers, datasets, benchmarks, baselines, or case studies
The task requires mapping claims to verified sources
The task requires identifying evidence gaps before writing
The user wants related work or experiment planning grounded in real sources

Hard rules

- Do not fabricate papers, authors, venues, dates, citations, datasets, benchmarks, experiments, or numerical results
Prefer primary sources over summaries, reposts, or blog interpretations
Separate verified facts from inference and open questions
If evidence is missing, say it is missing and recommend what to collect next
Do not state that the user's method outperforms baselines unless there is explicit evidence
Every writing-oriented output must be traceable to evidence items

Source priority

Use the highest-quality source available for each claim.

1. User-provided project artifacts: experiment logs, tables, code, configs, internal notes
Primary external sources: papers, official docs, benchmark leaderboards, dataset pages, project repos
Secondary summaries: blogs, news posts, third-party explainers

Read references/source-priority.md when source quality or conflicts matter.
Read references/input-schemas.md when stronger input structure is needed before running the workflow.

Core workflow

1. Scope the task

Collect or infer:

- Project name
Research topic
Core problem
Method summary
Existing evidence and file paths
Target venue or paper style if known

If the project scope is unclear, produce a short working scope and label assumptions.

2. Search for evidence

Search for:

- Prior work
Benchmarks and datasets
Baseline methods
Comparable case studies
Official metrics definitions
Relevant project artifacts in the local repository

For each source, capture the title, URL or path, source type, and why it matters.

Use references/prior-work-search-playbook.md for a repeatable search process.
For OpenClaw-specific work, use references/openclaw-evidence-playbook.md.

3. Verify and classify

For each evidence item, classify it as:

- INLINECODE3
INLINECODE4
INLINECODE5
INLINECODE6

Do not merge these labels. If a statement depends on inference, say so explicitly.

4. Extract structured evidence

Use the schema in references/evidence-schema.md.

At minimum, extract:

- Claim or observation
Source
Evidence type
Scope and limitations
Relevance to the user's paper

5. Build the evidence map

Organize findings into:

- INLINECODE7
INLINECODE8
INLINECODE9
INLINECODE10
INLINECODE11
INLINECODE12
INLINECODE13

Use assets/claim-evidence-map-template.md when the user needs a reusable deliverable.
Use assets/related-work-matrix-template.md when comparing papers, baselines, and benchmark coverage.
Use assets/experiment-gap-report-template.md when the task requires prioritizing missing experiments before drafting.
Use bundled scripts/build_evidence_ledger.py when the user already has markdown notes or source lists and needs a first-pass evidence ledger.
Use bundled scripts/generate_search_plan.py when the user starts only with a topic and needs a first-pass evidence search plan.
Use bundled scripts/generate_topic_claims.py when the user starts only with a topic and needs a cautious structured claims draft.
Use bundled scripts/search_external_sources.py when the user needs a first-pass source list from a topic or search plan.
Use bundled scripts/fetch_external_sources.py when raw URLs should be converted into structured source-note drafts before review.
Use bundled scripts/verify_source_notes.py when fetched notes should be conservatively upgraded into page-level verified facts or reviewed primary-source summaries before entering the ledger.
Use bundled scripts/run_evidence_pipeline.py when the user already has source files, claims, and optional result artifacts and wants one end-to-end draft pack. Result artifacts may be structured markdown, .csv, .tsv, or .json, and multiple result artifacts can be fused into aggregate project evidence.
Use bundled scripts/bootstrap_claim_map.py when the user already has a claims list and a ledger draft and needs a first-pass claim map.
Use bundled scripts/triage_evidence_gaps.py when the user needs a first-pass blocker/major/minor gap report from the current claims and evidence ledger.
Use bundled scripts/review_comparison_fairness.py when the user needs a dedicated fairness check on comparative claims, baseline breadth, metric grounding, and scope alignment.
Use bundled scripts/run_topic_evidence_pipeline.py when the user wants the full topic-driven workflow from theme to search plan, source list, fetched notes, ledger, claim map, and gap report.
Use bundled scripts/validate_input_bundle.py when the user has partially structured inputs and needs a quick schema check before running the pipeline.

6. Support writing

Only after the evidence map is complete, support tasks such as:

- contribution candidates
related work summary
abstract support points
experiment plan
paper outline

Before writing, run the checks in references/claim-audit-rules.md.
Use assets/paper-outline-from-evidence-template.md when the user needs a draft-safe paper structure.

Output requirements

Unless the user asks for something else, default to this output shape:

1. INLINECODE29
INLINECODE30
INLINECODE31
INLINECODE32
INLINECODE33
INLINECODE34 when blocker gaps exist

See the example set in:

Writing constraints

When supporting downstream paper writing:

- Tie each major claim to one or more evidence items
Avoid precise quantitative wording unless the number is verified
Mark missing comparisons, missing ablations, and missing real-world validation
Prefer conservative wording over overstated conclusions

OpenClaw-specific guidance

When the user is working on OpenClaw or a similar embodied AI / robotics project, prioritize:

- manipulation benchmarks
long-horizon task evidence
policy or planner comparisons
real-world versus simulation evidence
ablations on perception, planning, or control components

Do not assume OpenClaw has capabilities, datasets, or benchmark wins unless they are present in project artifacts or verified sources.
Use references/benchmark-baseline-checklist.md before accepting benchmark-fit or baseline coverage claims.
Use references/evidence-gap-triage.md when deciding whether to keep drafting or stop and report blockers.

Sopaper Evidence

Sopaper Evidence 是一项以证据为先的研究技能。其职责是在支持任何下游的论文大纲、摘要、相关工作总结、实验计划或草稿章节之前，构建一个可靠的证据包。

版本：v1.0.0

上游来源

规范仓库：https://github.com/sheepxux/SoPaper-Evidence

此已发布的技能包包含其在 scripts/ 目录下引用的辅助脚本。GitHub 仓库仍然是发布版本、示例和问题追踪的公共信息来源。

何时使用此技能

- 用户希望将项目转化为论文，但无需凭空捏造证据
任务需要查找先前的论文、数据集、基准、基线方法或案例研究
任务需要将主张映射到经过验证的来源
任务需要在写作前识别证据缺口
用户希望相关工作或实验计划基于真实来源

硬性规则

- 不得捏造论文、作者、会议、日期、引用、数据集、基准、实验或数值结果
优先使用一手来源，而非摘要、转载或博客解读
将已验证的事实与推论和未解决问题区分开来
如果缺少证据，需明确指出缺失，并建议下一步应收集的内容
除非有明确证据，否则不得声称用户的方法优于基线方法
每个面向写作的输出都必须可追溯到证据条目

来源优先级

为每个主张使用可用的最高质量来源。

1. 用户提供的项目产物：实验日志、表格、代码、配置文件、内部笔记
外部一手来源：论文、官方文档、基准排行榜、数据集页面、项目仓库
二手摘要：博客、新闻文章、第三方解读

当来源质量或存在冲突时，请阅读 references/source-priority.md。
当在运行工作流之前需要更强的输入结构时，请阅读 references/input-schemas.md。

核心工作流

1. 界定任务范围

收集或推断以下内容：

- 项目名称
研究主题
核心问题
方法摘要
现有证据及文件路径
目标会议或论文风格（如已知）

如果项目范围不明确，则生成一个简短的工作范围并标注假设。

2. 搜索证据

搜索以下内容：

- 先前工作
基准和数据集
基线方法
可比较的案例研究
官方指标定义
本地仓库中的相关项目产物

对于每个来源，记录标题、URL 或路径、来源类型及其重要性。

使用 references/prior-work-search-playbook.md 进行可重复的搜索过程。
对于 OpenClaw 相关的工作，请使用 references/openclaw-evidence-playbook.md。

3. 验证与分类

对于每个证据条目，将其分类为：

- verifiedfact（已验证事实）
projectevidence（项目证据）
inference（推论）
unverified（未验证）

不要合并这些标签。如果某个陈述依赖于推论，需明确说明。

4. 提取结构化证据

使用 references/evidence-schema.md 中的模式。

至少提取以下内容：

- 主张或观察
来源
证据类型
范围与局限性
与用户论文的相关性

5. 构建证据图谱

将发现组织为：

- relatedwork（相关工作）
datasetsandbenchmarks（数据集与基准）
baselines（基线方法）
casestudies（案例研究）
projectresults（项目结果）
claimtoevidence（主张到证据的映射）
evidencegaps（证据缺口）

当用户需要可复用的交付物时，使用 assets/claim-evidence-map-template.md。
当需要比较论文、基线和基准覆盖范围时，使用 assets/related-work-matrix-template.md。
当任务需要在起草前优先处理缺失的实验时，使用 assets/experiment-gap-report-template.md。
当用户已有 Markdown 笔记或来源列表，需要初步的证据账本时，使用捆绑的 scripts/buildevidenceledger.py。
当用户仅从一个主题开始，需要初步的证据搜索计划时，使用捆绑的 scripts/generatesearchplan.py。
当用户仅从一个主题开始，需要一份谨慎的结构化主张草稿时，使用捆绑的 scripts/generatetopicclaims.py。
当用户需要从一个主题或搜索计划中获得初步的来源列表时，使用捆绑的 scripts/searchexternalsources.py。
当需要将原始 URL 转换为结构化的来源笔记草稿以供审查时，使用捆绑的 scripts/fetchexternalsources.py。
当获取的笔记需要被保守地升级为页面级别的已验证事实或经过审查的一手来源摘要，然后才能进入账本时，使用捆绑的 scripts/verifysourcenotes.py。
当用户已有源文件、主张和可选的结果产物，并希望获得一个端到端的草稿包时，使用捆绑的 scripts/runevidencepipeline.py。结果产物可以是结构化的 Markdown、.csv、.tsv 或 .json，多个结果产物可以融合为聚合的项目证据。
当用户已有主张列表和账本草稿，并需要初步的主张图谱时，使用捆绑的 scripts/bootstrapclaimmap.py。
当用户需要从当前主张和证据账本中获得初步的阻塞/主要/次要缺口报告时，使用捆绑的 scripts/triageevidencegaps.py。
当用户需要对比较性主张、基线广度、指标依据和范围对齐进行专门的公平性检查时，使用捆绑的 scripts/reviewcomparisonfairness.py。
当用户希望运行完整的主题驱动工作流（从主题到搜索计划、来源列表、获取的笔记、账本、主张图谱和缺口报告）时，使用捆绑的 scripts/runtopicevidence_pipeline.py。
当用户拥有部分结构化的输入，并需要在运行流水线前进行快速的模式检查时，使用捆绑的 scripts/validateinputbundle.py。

6. 支持写作

仅在证据图谱完成后，才支持以下任务：

- 贡献候选点
相关工作摘要
摘要支撑点
实验计划
论文大纲

在写作前，运行 references/claim-audit-rules.md 中的检查。
当用户需要一个安全的论文结构草稿时，使用 assets/paper-outline-from-evidence-template.md。

输出要求

除非用户另有要求，否则默认输出以下形式：

1. 证据简报
关键来源
主张到证据的映射
证据缺口
安全写作笔记
实验缺口报告（当存在阻塞性缺口时）

参见以下示例集：

写作约束

在支持下游论文写作时：

- 将每个主要主张与一个或多个证据条目关联
除非数字经过验证，否则避免使用精确的量化措辞
标记缺失的比较、缺失的消融实验和缺失的真实世界验证
优先使用保守措辞，而非夸大的结论

OpenClaw 特定指导

当用户正在处理 OpenClaw 或类似的具身 AI / 机器人项目时，优先考虑：

- 操作基准
长时域任务证据
策略或规划器比较
真实世界与仿真证据对比
感知、规划或控制组件的消融实验

sopaper-evidence证据优先研究