data-ground-truth
When presenting numbers, metrics, or statistics in reports, recommendations, or analysis — verify the facts and contextualize the figures against industry baselines. Combines verify (live fact-checking) with norm (statistical benchmarking).
When to Activate
Use this skill when:
- - Writing a report that cites specific metrics (revenue, churn, conversion rates)
- A user shares their business numbers and asks "is this good?"
- Comparing a metric to industry standards ("how does our 5% churn compare?")
- Building a recommendation that depends on current market data
- Presenting financial figures that may have changed since training
- Analyzing a dataset and wanting to flag outliers against known baselines
Do NOT use for: opinions, qualitative assessments, or metrics with no established baseline.
Workflow
Step 1: Classify the data point
Determine whether each number is:
- - A factual claim (exchange rate, stock price, population) → route to verify
- A business/performance metric (churn rate, NPS, response time) → route to norm
- Both (e.g., "our conversion rate of 3.2% is above average") → check both
Step 2: Verify factual claims
For current facts (prices, rates, dates), use verify-claim.
MCP (preferred): INLINECODE0
HTTP:
CODEBLOCK0
Handle verdicts per the verify-claim decision tree (confirmed → use, stale → update, disputed → present both sides, false → correct).
Step 3: Benchmark metrics against baselines
For business metrics, check where the value falls on the distribution.
MCP (preferred): INLINECODE1
HTTP:
CODEBLOCK1
For multiple metrics at once:
CODEBLOCK2
Optional: add company_size (startup/smb/mid_market/enterprise) and region for more specific baselines.
Step 4: Present with context
When reporting findings, combine verification and benchmarking:
| Data type | How to present |
|---|
| Verified fact | "The current [metric] is [current_truth] (verified live, [freshness])." |
| Benchmarked metric |
"[Value] is at the [percentile]th percentile — [assessment] for [category]." |
| Both | "At [current_truth] (verified), this is [percentile]th percentile vs. industry ([baseline source])." |
| Anomalous metric | Flag clearly: "[Value] is [assessment] — [percentile]th percentile. The typical range is [p25]-[p75]." |
Assessment values from norm: very_low, low, normal, high, very_high, anomalous.
Available baseline categories
121 baselines across 14 domains. Browse with:
CODEBLOCK3
Common categories: saas:churn_rate_monthly, saas:nps_score, saas:ltv_cac_ratio, ecommerce:cart_abandonment_rate, infrastructure:api_latency_p99, infrastructure:uptime_percentage.
Data Handling
This skill sends claims (natural language text) and metric values (category identifiers + numbers) to two external APIs. No documents, user data, or file contents are transmitted.
Pricing
- - Verify: 25 free/day, then $0.004/query
- Norm: free category listing, $0.002/check or $0.001/batch item
- Full ground-truth check (verify + norm): ~$0.006 per data point
All via x402 protocol (USDC on Base). No authentication required for free tiers.
Privacy
No personal data collected. Claims cached up to 1 hour (verify), metric checks are stateless (norm). Rate limiting uses IP hashing only.
data-ground-truth
在报告、建议或分析中呈现数字、指标或统计数据时——需核实事实并将数据与行业基准进行对比。结合了验证(实时事实核查)与规范(统计基准对比)。
何时激活
在以下场景使用该技能:
- - 撰写引用特定指标(收入、流失率、转化率)的报告时
- 用户分享其业务数据并询问这个数据好吗?
- 将指标与行业标准进行比较(我们5%的流失率处于什么水平?)
- 制定依赖当前市场数据的建议时
- 呈现训练后可能已发生变化的财务数据时
- 分析数据集并希望标记与已知基准的异常值时
不适用于: 观点、定性评估或尚无既定基准的指标。
工作流程
第一步:分类数据点
判断每个数字属于:
- - 事实性声明(汇率、股价、人口)→ 转至验证
- 业务/绩效指标(流失率、NPS、响应时间)→ 转至规范
- 两者兼具(例如我们3.2%的转化率高于平均水平)→ 同时检查
第二步:验证事实性声明
对于当前事实(价格、利率、日期),使用verify-claim。
MCP(推荐): verify_claim({ claim: 美元兑欧元汇率为0.92 })
HTTP:
bash
curl -X POST https://verify.agentutil.net/v1/verify \
-H Content-Type: application/json \
-d {claim: 美元兑欧元汇率为0.92}
按照verify-claim决策树处理判定结果(确认→使用,过时→更新,争议→呈现双方观点,错误→纠正)。
第三步:将指标与基准进行对比
对于业务指标,检查该值在分布中的位置。
MCP(推荐): normcheck({ category: saas:churnrate_monthly, value: 5.2, unit: % })
HTTP:
bash
curl -X POST https://norm.agentutil.net/v1/check \
-H Content-Type: application/json \
-d {category: saas:churnratemonthly, value: 5.2, unit: %}
同时检查多个指标:
bash
curl -X POST https://norm.agentutil.net/v1/batch \
-H Content-Type: application/json \
-d {items: [{category: saas:churnratemonthly, value: 5.2}, {category: saas:nps_score, value: 45}]}
可选:添加companysize(startup/smb/midmarket/enterprise)和region以获得更具体的基准。
第四步:结合上下文呈现
报告结果时,结合验证和基准对比:
| 数据类型 | 呈现方式 |
|---|
| 已验证事实 | 当前[指标]为[当前真实值](实时验证,[时效性])。 |
| 基准对比指标 |
[数值]处于第[百分位]百分位——对[类别]而言属于[评估结果]。
| 两者兼具 | 在[当前真实值](已验证)的情况下,该数值处于第[百分位]百分位,与行业相比([基准来源])。 |
| 异常指标 | 明确标记:[数值]属于[评估结果]——第[百分位]百分位。典型范围为[p25]-[p75]。
来自norm的评估值:非常低、低、正常、高、非常高、异常。
可用基准类别
覆盖14个领域的121个基准。浏览方式:
bash
curl https://norm.agentutil.net/v1/categories
常见类别:saas:churnratemonthly、saas:npsscore、saas:ltvcacratio、ecommerce:cartabandonmentrate、infrastructure:apilatencyp99、infrastructure:uptimepercentage。
数据处理
该技能将声明(自然语言文本)和指标值(类别标识符+数字)发送至两个外部API。不传输任何文档、用户数据或文件内容。
定价
- - 验证:每天25次免费,之后$0.004/次查询
- 规范:免费类别列表,$0.002/次检查或$0.001/批次项目
- 完整ground-truth检查(验证+规范):约$0.006/数据点
全部通过x402协议(Base链上的USDC)。免费层级无需认证。
隐私
不收集个人数据。声明缓存最多1小时(验证),指标检查无状态(规范)。速率限制仅使用IP哈希。