返回顶部
w

windows-skillsWindows桌面自动化

Windows desktop automation skills - screenshot capture, OCR text extraction, and image-based UI element location. Use when: (1) capturing screen content (2) extracting text from images (3) locating UI elements for automation

作者: admin | 来源: ClawHub
源自
ClawHub
版本
V 1.0.1
安全检测
已通过
509
下载量
免费
免费
1
收藏
概述
安装方式
版本历史

windows-skills

Windows Desktop Automation

Quick Start

Dependencies

bash pip install mss pytesseract pillow pyautogui opencv-python numpy

Note: OCR requires Tesseract OCR installed

Core Features

1. Screenshot

python from scripts.screenshot import capturescreen, captureregion, capture_window

Full screen

capture_screen(output.png)

Region (x, y, width, height)

capture_region(0, 0, 800, 600, region.png)

Window by title

capture_window(Notepad, notepad.png)

2. OCR (Text Recognition)

python from scripts.ocr import extract_text

Extract text from image

text = extract_text(screenshot.png) print(text)

Specify language (chi_sim=Chinese, eng=English)

text = extracttext(screenshot.png, lang=chisim+eng)

3. Image Location

python from scripts.imagelocate import locateonscreen, locateall

Find image position (returns center coordinates)

pos = locateonscreen(button.png) if pos: x, y, confidence = pos pyautogui.click(x, y) # Click the found element

Find all matches

positions = locate_all(icon.png)

Scripts

ScriptDescription
screenshot.pyScreenshot capture
ocr.py
Text recognition | | image_locate.py | Image-based element location | | helpers.py | Common utilities |

Notes

  • - Image location is sensitive to image similarity; keep screenshots consistent
  • OCR quality depends on image quality and text clarity
  • Tesseract path needs to be in system PATH or specified in code

Windows 桌面自动化

快速开始

依赖安装

bash pip install mss pytesseract pillow pyautogui opencv-python numpy

注意:OCR 需要安装 Tesseract OCR

核心功能

1. 截图

python from scripts.screenshot import capturescreen, captureregion, capture_window

全屏截图

capture_screen(output.png)

区域截图 (x, y, width, height)

capture_region(0, 0, 800, 600, region.png)

窗口截图

capture_window(Notepad, notepad.png)

2. 文字识别 (OCR)

python from scripts.ocr import extract_text

从图片提取文字

text = extract_text(screenshot.png) print(text)

指定语言 (chi_sim = 简体中文, eng = 英文)

text = extracttext(screenshot.png, lang=chisim+eng)

3. 图像定位

python from scripts.imagelocate import locateonscreen, locateall

查找图片位置 (返回中心坐标)

pos = locateonscreen(button.png) if pos: x, y, conf = pos pyautogui.click(x, y) # 点击找到的元素

查找所有匹配位置

positions = locate_all(icon.png)

脚本说明

脚本功能
screenshot.py截图功能
ocr.py
文字识别 | | image_locate.py | 图像定位 | | helpers.py | 公共工具 |

注意事项

  • - 图像定位对图片相似度敏感,建议截图时保持一致
  • OCR 效果取决于图片质量和文字清晰度
  • Tesseract 路径需要添加到系统 PATH 或在代码中指定

标签

skill ai

通过对话安装

该技能支持在以下平台通过对话安装:

OpenClaw WorkBuddy QClaw Kimi Claude

方式一:安装 SkillHub 和技能

帮我安装 SkillHub 和 windows-skills-1776189129 技能

方式二:设置 SkillHub 为优先技能安装源

设置 SkillHub 为我的优先技能安装源,然后帮我安装 windows-skills-1776189129 技能

通过命令行安装

skillhub install windows-skills-1776189129

下载

⬇ 下载 windows-skills v1.0.1(免费)

文件大小: 6.8 KB | 发布时间: 2026-4-15 11:19

v1.0.1 最新 2026-4-15 11:19
Added full English documentation alongside the original Chinese instructions.

- Provided English descriptions for all features, usage examples, and dependencies.
- Added an English table describing each script's purpose.
- Updated code examples: clarified parameter names and matching output variables in both languages.
- Improved clarity of usage notes in English section.
- Chinese documentation is preserved and now presented after the English section.

Archiver·手机版·闲社网·闲社论坛·羊毛社区· 多链控股集团有限公司 · 苏ICP备2025199260号-1

Powered by Discuz! X5.0   © 2024-2025 闲社网·线报更新论坛·羊毛分享社区·http://xianshe.com

p2p_official_large
返回顶部