返回顶部
f

finance-ocr-pro

Use this skill when the user asks to OCR, transcribe, extract, or convert the contents of a scanned PDF, image, or office document into Markdown, HTML, DOCX, or Excel. This workflow sends page images and OCR prompts to a configured OpenAI-compatible VLM endpoint and requires `API_KEY`, `BASE_URL`, and `VLM_MODEL`. It is especially valuable for financial documents and other visually complex materials with dense tables, charts, graphs, and multi-part layouts. Prefer durable background jobs for lon

作者: admin | 来源: ClawHub
源自
ClawHub
版本
V 1.0.4
安全检测
已通过
105
下载量
0
收藏
概述
安装方式
版本历史

finance-ocr-pro

# Finance OCR Pro Run this skill only after OCR intent from the user. This skill is especially helpful for financial reports, annual reports, prospectuses, investor presentations, regulatory filings, research reports, and other documents with complicated structure, charts, graphs, tables, and mixed layout elements. ## Security And Privacy Before running OCR, make the operating model clear: - This skill requires three environment variables, all of which must be configured before OCR can run: - `API_KEY` (sensitive) -- the API key for authenticating with the VLM endpoint. - `BASE_URL` -- the base URL of the OpenAI-compatible VLM endpoint. All page images and OCR prompts are transmitted to this URL. - `VLM_MODEL` -- the vision-capable model identifier. Must support image inputs; text-only models will not work. - OCR sends rendered page images and structured prompts to `BASE_URL`. This is the primary data-transmission path. Users must verify that the endpoint is trusted before processing sensitive documents. - If the user wants offline or local-only OCR, `BASE_URL` must point to a local VLM service. Do not run this skill against an external endpoint with sensitive documents unless the provider is trusted. - Never commit a populated `.env` file. Use `.env.example` as a template and keep real credentials local. ## Pre-Run Notice After the user asks for OCR or extraction, give a short notice that includes: - whether `BASE_URL` is local or remote - which `VLM_MODEL` will be used - which execution mode will be used - where results will be written - that page images and prompts will be transmitted to the configured endpoint Proceed automatically unless the user asks to change those defaults. ## Defaults To Announce - Running mode: background job by default - Model: `VLM_MODEL` - Threads: `1` - Result path: - background: `~/.semantic-ocr/jobs/<job_id>/results/` - synchronous: `ocr_output/OCR_<filename>/results/` ## Setup Use the skill-local virtual environment if present. - macOS/Linux: `.venv/bin/python` - Windows: `.venv/Scripts/python.exe` - Fallback: `python` Run: ```bash python scripts/ocr_setup.py --check ``` If setup is incomplete, run: ```bash python scripts/ocr_setup.py ``` ## Preferred Execution By default, start a background worker: ```bash python scripts/ocrctl.py --json start /path/to/document.pdf ``` Then inspect progress and outputs: ```bash python scripts/ocrctl.py --json status <job_id> python scripts/ocrctl.py --json artifacts <job_id> python scripts/ocrctl.py --json tail <job_id> ``` Use synchronous mode only when the user explicitly wants inline execution: ```bash python scripts/ocr_main.py /path/to/document.pdf ``` ## Notes - Inputs: PDF, common office documents, Apple office formats, and images. - Outputs: merged Markdown, HTML review report, DOCX, and Excel. - OCR requires `API_KEY`, `BASE_URL`, and `VLM_MODEL` to be configured before running. - Sensitive document pages are transmitted to the configured endpoint during OCR unless the endpoint is a local service. - Best suited for financial documents and other visually dense materials with tables, charts, graphs, and complex page structure. - Office-document conversion may require LibreOffice. - OCR extraction by the VLM model may be time-consuming; check the status regularly.

标签

skill ai

通过对话安装

该技能支持在以下平台通过对话安装:

OpenClaw WorkBuddy QClaw Kimi Claude

方式一:安装 SkillHub 和技能

帮我安装 SkillHub 和 finance-ocr-pro-1775943565 技能

方式二:设置 SkillHub 为优先技能安装源

设置 SkillHub 为我的优先技能安装源,然后帮我安装 finance-ocr-pro-1775943565 技能

通过命令行安装

skillhub install finance-ocr-pro-1775943565

下载 Zip 包

⬇ 下载 finance-ocr-pro v1.0.4

文件大小: 95.64 KB | 发布时间: 2026-4-12 09:57

v1.0.4 最新 2026-4-12 09:57
- Removed the `openai.yaml` configuration file.
- Updated documentation to clarify that OCR runs only after explicit user intent, not just file upload.
- Revised execution instructions and removed automated start based on file attachments.
- Clarified security, privacy, and default behaviors.
- Added a note that OCR extraction may be time-consuming; users should check status regularly.

Archiver·手机版·闲社网·闲社论坛·羊毛社区· 多链控股集团有限公司 · 苏ICP备2025199260号-1

Powered by Discuz! X5.0   © 2024-2025 闲社网·线报更新论坛·羊毛分享社区·http://xianshe.com

p2p_official_large
返回顶部