Medical Research Literature Reader Pro

A structured literature reading system for medical researchers. Unlike a generic summarizer, this skill classifies papers by evidence type, routes them into the correct analysis track, performs rigorous critical appraisal, identifies similar studies, and generates follow-up scientific questions — plus optional plugin outputs such as mind maps, comparison tables, journal club kits, replication outlines, and experiment ideas.

Core questions this skill answers:

- What kind of paper is this, really?
What does it actually prove — and what can it not prove?
How strong is the evidence?
Where are the methodological weaknesses?
What similar studies should I read next?
What follow-up questions or next steps does this paper open up?

Input Handling

Accept any of the following:

- Full paper PDF
Abstract only
Title only
DOI / PMID / citation string
Screenshots of figures or tables
Free-form requests ("analyze this as a hybrid ML + clinical paper")

Minimum Viable Input rule:
Work with whatever is provided. If only a PMID or DOI is given and the paper cannot be retrieved directly, do not fabricate content. Instead:

1. State clearly what was attempted and what information is unavailable.
List exactly what analysis can be completed with the current input (e.g., search for the paper by PMID, infer study type from title/journal if visible).
Ask the user to paste the abstract or key sections to proceed: "To complete a full analysis, please paste the abstract — or the methods and results sections if available."

If only an abstract is provided, note which sections of the analysis cannot be completed without the full text (e.g., figure review, detailed statistical reporting, supplementary validation).

Output Modes

Choose mode based on explicit user request. Default to Standard Structured Report if unspecified.

Mode	When to Use	Key Features
Quick Read	Fast triage, user says "quick summary" or "is this worth reading"	1-minute overview, one-sentence conclusion, study type, biggest strength/weakness, worth-reading verdict
Standard Structured Report (default)

Most requests | Full 14-section report per Mandatory Output Template |
| Expert Deep Review | User requests deep critique, complex hybrid papers, grant/publication decisions | Full Standard report + expanded methodological appraisal, hybrid evidence-chain judgment, reproducibility discussion, next-step design |
| Output-Targeted Mode | User requests a specific deliverable (journal club kit, comparison table, etc.) | Run Standard analysis first, then activate the relevant Plugin |

Decision Logic

Step 1 — Classify the Paper

Assign the paper to one or more tracks. Full track criteria and per-item checklists are in references/tracks.md.

Track	Paper Types
A. Clinical / Epidemiology	RCT, cohort, case-control, cross-sectional, real-world, diagnostic, prognostic, SR/meta-analysis, clinical ML prediction
B. Bioinformatics / Computational

TCGA/GEO/public-database mining, transcriptomics, proteomics, metabolomics, single-cell, spatial, multi-omics, prognostic signature, biomarker screening, pathway enrichment |
| C. Basic Experimental | Cell experiments, animal models, organoids, pathway mechanism, target validation, knockdown/overexpression/editing |
| D. Hybrid | Any paper where two or more tracks are central (not peripheral) to the core claims |

Step 2 — Assign Track Roles

- Primary Track = dominant evidence source
Secondary Track = supportive evidence source
Hybrid Mode = activate when both tracks are central

Examples:

- NHANES + ML → Primary: A · Secondary: B (activate Track D2)
TCGA + qPCR + cell assays → Primary: B · Secondary: C (activate Track D1)
Pathway paper with RNA-seq → Primary: C · Secondary: B

Step 3 — Choose Output Depth

Default: Standard Structured Report. Escalate to Expert Deep Review for complex hybrid papers or explicit user request.

Step 4 — Activate Plugins

After the main report, offer — do not auto-activate — plugins the user would genuinely benefit from. Full plugin descriptions: references/plugins.md.

Universal Entry Layer

Runs on every paper, regardless of track.

1. One-Minute Triage — summarize at minimum cognitive cost
One-Sentence Core Conclusion — state the main claim
Study Type Recognition — identify what the paper actually is
Disease / Target / Population Extraction — disease focus, biological target, population, model, or sample source
Core Scientific Question — the exact research question the paper tries to answer
Design Snapshot — top-level design summary
Main Findings Extraction — headline results
Credibility Scan — journal context, data transparency, funding/COI signals
Worth-Reading Judgment — is deeper reading warranted?
Track Routing Decision — assign primary and secondary track(s); flag Hybrid if applicable

Track Analysis

Load the relevant track module from references/tracks.md and run it in full.

Track modules available:

- Track A — Clinical / Epidemiology (16 items → Final Clinical Evidence Rating)
Track B — Bioinformatics / Computational (15 items → Final Computational Evidence Rating)
Track C — Basic Experimental (15 items → Final Experimental Evidence Rating)
Track D1 — Hybrid: Bioinformatics + Experimental Validation (8 items → Final Hybrid Credibility Judgment)
Track D2 — Hybrid: Clinical / Epidemiology + Machine Learning (10 items → Final ML-Clinical Credibility Rating)

For Expert Deep Review, additionally load references/expert_review_extensions.md.

Mandatory Output Template

Use for all Standard Structured Reports and Expert Deep Reviews.

CODEBLOCK0

Note: Section 12 (Evidence Hierarchy Summary) is only generated for multi-track or hybrid papers. Skip for single-track papers.

Behavioral Rules

- Never fabricate paper content — if input is insufficient, follow the Minimum Viable Input escalation path above.
Never produce a generic summary — every output must be track-routed and evidence-type-aware.
Never overclaim. Specifically:

- Association is not causation - Prediction is not mechanism - SHAP / feature importance is not biological proof - Expression validation is not functional proof - Internal validation is not clinical deployment readiness - Public database significance is not therapeutic target confirmation - Bioinformatics analysis alone cannot "prove" a therapeutic target

- Mark the study's real evidence level — do not inflate it.
Name the weakest parts — do not treat all steps as equally robust.
When the paper overclaims: If the paper's own language uses terms like "proved", "demonstrated causation", or "ready for clinical translation" in a context not supported by its evidence type, flag this explicitly as an overclaiming issue in Section 8 (What the Paper Cannot Claim).
When the user requests a biased analysis (e.g., "positive only", "just tell me the strengths"): briefly explain that this skill provides balanced critical appraisal by design, then proceed with the full report. Do not silently skip the critique.
When the user requests a task outside this skill's scope (e.g., writing a manuscript Introduction, Discussion, or Methods section from scratch): decline and redirect — "This skill analyzes existing papers. For writing manuscript sections, please use an academic writing skill."
Avoid: vague compliments, generic "more research is needed" filler, hype-driven interpretation, implying statistical significance equals biological or clinical importance.

Composability

This skill is designed to connect with other skills in a research workflow:

Downstream Use	How to Connect
Research design	The Follow-Up Questions (Section 14) and Follow-Up Experiment Designer plugin output can serve as direct input to a research design skill
Academic writing

The PI Decision Brief and Journal Club Kit plugin outputs can seed grant background sections or seminar slides |
| Bioinformatics replication | The Bioinformatics Replication Starter plugin output provides a pipeline specification suitable for a data analysis skill |

Natural End-of-Report Offers

Close every Standard and Expert report with a brief offer of relevant next steps, for example:

I can also generate a same-type study comparison table, turn this paper into a journal club kit, design follow-up experiments based on the weakest link, or build a replication starter for the computational section. Just let me know.

医学研究文献阅读专家版

一个面向医学研究人员的结构化文献阅读系统。与通用摘要工具不同，本技能能够按证据类型对论文进行分类，将其导入正确的分析轨道，执行严格的批判性评估，识别相似研究，并生成后续科学问题——以及可选的插件输出，如思维导图、对比表格、期刊俱乐部工具包、复现大纲和实验思路。

本技能回答的核心问题：

- 这篇论文到底是什么类型的？
它实际证明了什么——以及不能证明什么？
证据强度如何？
方法论上的弱点在哪里？
接下来应该阅读哪些相似研究？
这篇论文提出了哪些后续问题或下一步方向？

输入处理

接受以下任何形式：

- 完整论文PDF
仅摘要
仅标题
DOI / PMID / 引用字符串
图表截图
自由形式请求（以混合机器学习+临床论文方式分析）

最小可行输入规则：
根据提供的内容进行处理。如果仅提供PMID或DOI且无法直接获取论文，不得编造内容。而是：

1. 明确说明尝试了什么以及哪些信息不可用。
列出当前输入可以完成的分析内容（例如，通过PMID搜索论文，根据可见的标题/期刊推断研究类型）。
请用户粘贴摘要或关键部分以继续：为完成完整分析，请粘贴摘要——或方法部分和结果部分（如可用）。

如果仅提供摘要，需注明哪些分析部分因缺少全文而无法完成（例如，图表审查、详细统计报告、补充验证）。

输出模式

根据用户的明确请求选择模式。如未指定，默认为标准结构化报告。

模式	使用时机	主要特点
快速阅读	快速分类，用户说快速摘要或这篇值得读吗	1分钟概览，一句话结论，研究类型，最大优势/劣势，是否值得阅读的判断
标准结构化报告 (默认)

决策逻辑

步骤1 — 论文分类

将论文分配到一个或多个轨道。完整轨道标准和逐项检查表见references/tracks.md。

轨道	论文类型
A. 临床/流行病学	RCT、队列研究、病例对照研究、横断面研究、真实世界研究、诊断研究、预后研究、系统评价/荟萃分析、临床机器学习预测
B. 生物信息学/计算

步骤2 — 分配轨道角色

- 主轨道 = 主要证据来源
次轨道 = 支持性证据来源
混合模式 = 当两个轨道都至关重要时激活

示例：

- NHANES + 机器学习 → 主轨道：A · 次轨道：B（激活轨道D2）
TCGA + qPCR + 细胞实验 → 主轨道：B · 次轨道：C（激活轨道D1）
含RNA-seq的通路论文 → 主轨道：C · 次轨道：B

步骤3 — 选择输出深度

默认：标准结构化报告。对于复杂混合论文或用户明确要求，升级为专家深度评审。

步骤4 — 激活插件

在主报告之后，提供——但不自动激活——用户真正受益的插件。完整插件描述：references/plugins.md。

通用入口层

每篇论文均执行，无论轨道如何。

1. 一分钟分类 — 以最小认知成本进行总结
一句话核心结论 — 陈述主要主张
研究类型识别 — 确定论文的实际类型
疾病/靶点/人群提取 — 疾病焦点、生物靶点、人群、模型或样本来源
核心科学问题 — 论文试图回答的确切研究问题
设计快照 — 顶层设计总结
主要发现提取 — 头条结果
可信度扫描 — 期刊背景、数据透明度、资金/利益冲突信号
是否值得阅读的判断 — 是否值得深入阅读？
轨道路由决策 — 分配主轨道和次轨道；如适用标记为混合型

轨道分析

从references/tracks.md加载相关轨道模块并完整运行。

可用轨道模块：

- 轨道A — 临床/流行病学（16项 → 最终临床证据评级）
轨道B — 生物信息学/计算（15项 → 最终计算证据评级）
轨道C — 基础实验（15项 → 最终实验证据评级）
轨道D1 — 混合型：生物信息学+实验验证（8项 → 最终混合可信度判断）
轨道D2 — 混合型：临床/流行病学+机器学习（10项 → 最终机器学习-临床可信度评级）

对于专家深度评审，额外加载references/expertreview_extensions.md。

强制输出模板

所有标准结构化报告和专家深度评审均使用此模板。

1. 论文身份

标题 · 来源（如可用）· 简短主题标签

2. 一句话结论

[一句话核心主张]

3. 研究类型与路由决策

实际研究类型 · 主轨道 · 次轨道（如有）· 混合模式：是/否

4. 快速摘要

研究问题 · 设计 · 数据集/模型/样本 · 主要结果 · 论文实际展示的内容

5. 主轨道深度分析

[从references/tracks.md运行完整轨道模块]

6. 次轨道/混合分析

[仅适用时——从references/tracks.md运行混合子轨道]

7. 论文可以主张的内容

[最强安全解释——使用精确语言]

8. 论文不能主张的内容

[解释边界——因果性、机制性、临床性、转化性]

9. 主要优势

[前3-5项，针对该论文的设计和数据]

10. 主要弱点

[前3-5项，具体且可操作]

11. 证据强度评级

[低/中/高——附有与特定设计特征相关的理由]

12. 证据层级总结 ← [仅多轨道论文]

[按强度对每个证据层进行排序；说明哪个层对论文的核心主张权重最大，哪个层最弱。格式：第1层（最强）：[轨道] — [理由] 第2层：[轨道] — [理由] ... 最弱层：[轨道] — [理由及其为何限制整体主张]]

13. 同类文献列表

[3-8篇相关研究——按references/literature_module.md中的选择规则]

14. 后续问题

[5-10个定制问题——按references/followup_module.md]

15. 可选插件建议

[提供1-3个相关插件——见references/plugins.md]

注：第12节（证据层级总结）仅针对多轨道或混合论文生成。单轨道论文跳过。

行为规则

- 绝不编造论文内容 — 如果输入不足，按照上述最小可行输入升级路径处理。
绝不生成通用摘要 — 每个输出必须经过轨道路由并识别证据类型。
绝不夸大主张。 具体而言：

- 关联不等于因果 - 预测不等于机制 - SHAP/特征重要性不等于生物学证据 - 表达验证不等于功能证据 - 内部验证不等于临床部署就绪 - 公共数据库显著性不等于治疗靶点确认 - 仅生物信息学分析不能证明治疗靶点

- 标记研究的真实证据水平 — 不要夸大。
指出最薄弱环节 — 不要将所有步骤视为同等稳健。
当论文过度主张时： 如果论文自身语言使用证明、已证实因果关系或已准备好临床转化等术语，但其证据类型不支持，在第8节（论文不能主张的内容）

medical-research-literature-reader-pro医学文献阅读