data-ground-truth

When presenting numbers, metrics, or statistics in reports, recommendations, or analysis — verify the facts and contextualize the figures against industry baselines. Combines verify (live fact-checking) with norm (statistical benchmarking).

When to Activate

Use this skill when:

- Writing a report that cites specific metrics (revenue, churn, conversion rates)
A user shares their business numbers and asks "is this good?"
Comparing a metric to industry standards ("how does our 5% churn compare?")
Building a recommendation that depends on current market data
Presenting financial figures that may have changed since training
Analyzing a dataset and wanting to flag outliers against known baselines

Do NOT use for: opinions, qualitative assessments, or metrics with no established baseline.

Workflow

Step 1: Classify the data point

Determine whether each number is:

- A factual claim (exchange rate, stock price, population) → route to verify
A business/performance metric (churn rate, NPS, response time) → route to norm
Both (e.g., "our conversion rate of 3.2% is above average") → check both

Step 2: Verify factual claims

For current facts (prices, rates, dates), use verify-claim.

MCP (preferred): INLINECODE0

HTTP:
CODEBLOCK0

Handle verdicts per the verify-claim decision tree (confirmed → use, stale → update, disputed → present both sides, false → correct).

Step 3: Benchmark metrics against baselines

For business metrics, check where the value falls on the distribution.

MCP (preferred): INLINECODE1

HTTP:
CODEBLOCK1

For multiple metrics at once:
CODEBLOCK2

Optional: add company_size (startup/smb/mid_market/enterprise) and region for more specific baselines.

Step 4: Present with context

When reporting findings, combine verification and benchmarking:

Data type	How to present
Verified fact	"The current [metric] is [current_truth] (verified live, [freshness])."
Benchmarked metric

"[Value] is at the [percentile]th percentile — [assessment] for [category]." |
| Both | "At [current_truth] (verified), this is [percentile]th percentile vs. industry ([baseline source])." |
| Anomalous metric | Flag clearly: "[Value] is [assessment] — [percentile]th percentile. The typical range is [p25]-[p75]." |

Assessment values from norm: very_low, low, normal, high, very_high, anomalous.

Available baseline categories

121 baselines across 14 domains. Browse with:
CODEBLOCK3

Common categories: saas:churn_rate_monthly, saas:nps_score, saas:ltv_cac_ratio, ecommerce:cart_abandonment_rate, infrastructure:api_latency_p99, infrastructure:uptime_percentage.

Data Handling

This skill sends claims (natural language text) and metric values (category identifiers + numbers) to two external APIs. No documents, user data, or file contents are transmitted.

Pricing

- Verify: 25 free/day, then $0.004/query
Norm: free category listing, $0.002/check or $0.001/batch item
Full ground-truth check (verify + norm): ~$0.006 per data point

All via x402 protocol (USDC on Base). No authentication required for free tiers.

Privacy

No personal data collected. Claims cached up to 1 hour (verify), metric checks are stateless (norm). Rate limiting uses IP hashing only.

data-ground-truth

在报告、建议或分析中呈现数字、指标或统计数据时——需核实事实并将数据与行业基准进行对比。结合了验证（实时事实核查）与规范（统计基准对比）。

何时激活

在以下场景使用该技能：

- 撰写引用特定指标（收入、流失率、转化率）的报告时
用户分享其业务数据并询问这个数据好吗？
将指标与行业标准进行比较（我们5%的流失率处于什么水平？）
制定依赖当前市场数据的建议时
呈现训练后可能已发生变化的财务数据时
分析数据集并希望标记与已知基准的异常值时

不适用于： 观点、定性评估或尚无既定基准的指标。

工作流程

第一步：分类数据点

判断每个数字属于：

- 事实性声明（汇率、股价、人口）→ 转至验证
业务/绩效指标（流失率、NPS、响应时间）→ 转至规范
两者兼具（例如我们3.2%的转化率高于平均水平）→ 同时检查

第二步：验证事实性声明

对于当前事实（价格、利率、日期），使用verify-claim。

MCP（推荐）： verify_claim({ claim: 美元兑欧元汇率为0.92 })

HTTP：
bash
curl -X POST https://verify.agentutil.net/v1/verify \
-H Content-Type: application/json \
-d {claim: 美元兑欧元汇率为0.92}

按照verify-claim决策树处理判定结果（确认→使用，过时→更新，争议→呈现双方观点，错误→纠正）。

第三步：将指标与基准进行对比

对于业务指标，检查该值在分布中的位置。

MCP（推荐）： normcheck({ category: saas:churnrate_monthly, value: 5.2, unit: % })

HTTP：
bash
curl -X POST https://norm.agentutil.net/v1/check \
-H Content-Type: application/json \
-d {category: saas:churnratemonthly, value: 5.2, unit: %}

同时检查多个指标：
bash
curl -X POST https://norm.agentutil.net/v1/batch \
-H Content-Type: application/json \
-d {items: [{category: saas:churnratemonthly, value: 5.2}, {category: saas:nps_score, value: 45}]}

可选：添加companysize（startup/smb/midmarket/enterprise）和region以获得更具体的基准。

第四步：结合上下文呈现

报告结果时，结合验证和基准对比：

数据类型	呈现方式
已验证事实	当前[指标]为[当前真实值]（实时验证，[时效性]）。
基准对比指标

来自norm的评估值：非常低、低、正常、高、非常高、异常。

可用基准类别

覆盖14个领域的121个基准。浏览方式：
bash
curl https://norm.agentutil.net/v1/categories

常见类别：saas:churnratemonthly、saas:npsscore、saas:ltvcacratio、ecommerce:cartabandonmentrate、infrastructure:apilatencyp99、infrastructure:uptimepercentage。

数据处理

该技能将声明（自然语言文本）和指标值（类别标识符+数字）发送至两个外部API。不传输任何文档、用户数据或文件内容。

定价

- 验证：每天25次免费，之后$0.004/次查询
规范：免费类别列表，$0.002/次检查或$0.001/批次项目
完整ground-truth检查（验证+规范）：约$0.006/数据点

全部通过x402协议（Base链上的USDC）。免费层级无需认证。

隐私

不收集个人数据。声明缓存最多1小时（验证），指标检查无状态（规范）。速率限制仅使用IP哈希。

data-ground-truth数据真实性