Windows Desktop Automation

Quick Start

Dependencies

pip install mss pytesseract pillow pyautogui opencv-python numpy

Note: OCR requires Tesseract OCR installed

Core Features

1. Screenshot

CODEBLOCK1

2. OCR (Text Recognition)

CODEBLOCK2

3. Image Location

CODEBLOCK3

Scripts

Script	Description
INLINECODE0	Screenshot capture
INLINECODE1

Notes

- Image location is sensitive to image similarity; keep screenshots consistent
OCR quality depends on image quality and text clarity
Tesseract path needs to be in system PATH or specified in code

Windows 桌面自动化

快速开始

依赖安装

pip install mss pytesseract pillow pyautogui opencv-python numpy

注意：OCR 需要安装 Tesseract OCR

核心功能

1. 截图

CODEBLOCK5

2. 文字识别 (OCR)

CODEBLOCK6

3. 图像定位

CODEBLOCK7

脚本说明

脚本	功能
INLINECODE4	截图功能
INLINECODE5

注意事项

- 图像定位对图片相似度敏感，建议截图时保持一致
OCR 效果取决于图片质量和文字清晰度
Tesseract 路径需要添加到系统 PATH 或在代码中指定

Windows Desktop Automation

Quick Start

Dependencies

bash pip install mss pytesseract pillow pyautogui opencv-python numpy

Note: OCR requires Tesseract OCR installed

Core Features

1. Screenshot

python from scripts.screenshot import capturescreen, captureregion, capture_window

Full screen

capture_screen(output.png)

Region (x, y, width, height)

capture_region(0, 0, 800, 600, region.png)

Window by title

capture_window(Notepad, notepad.png)

2. OCR (Text Recognition)

python from scripts.ocr import extract_text

Extract text from image

text = extract_text(screenshot.png) print(text)

Specify language (chi_sim=Chinese, eng=English)

text = extracttext(screenshot.png, lang=chisim+eng)

3. Image Location

python from scripts.imagelocate import locateonscreen, locateall

Find image position (returns center coordinates)

pos = locateonscreen(button.png) if pos: x, y, confidence = pos pyautogui.click(x, y) # Click the found element

Find all matches

positions = locate_all(icon.png)

Scripts

Script	Description
screenshot.py	Screenshot capture
ocr.py

Notes

- Image location is sensitive to image similarity; keep screenshots consistent
OCR quality depends on image quality and text clarity
Tesseract path needs to be in system PATH or specified in code

Windows 桌面自动化

快速开始

依赖安装

bash pip install mss pytesseract pillow pyautogui opencv-python numpy

注意：OCR 需要安装 Tesseract OCR

核心功能

1. 截图

python from scripts.screenshot import capturescreen, captureregion, capture_window

全屏截图

capture_screen(output.png)

区域截图 (x, y, width, height)

capture_region(0, 0, 800, 600, region.png)

窗口截图

capture_window(Notepad, notepad.png)

2. 文字识别 (OCR)

python from scripts.ocr import extract_text

从图片提取文字

text = extract_text(screenshot.png) print(text)

指定语言 (chi_sim = 简体中文, eng = 英文)

text = extracttext(screenshot.png, lang=chisim+eng)

3. 图像定位

python from scripts.imagelocate import locateonscreen, locateall

查找图片位置 (返回中心坐标)

pos = locateonscreen(button.png) if pos: x, y, conf = pos pyautogui.click(x, y) # 点击找到的元素

查找所有匹配位置

positions = locate_all(icon.png)

脚本说明

脚本	功能
screenshot.py	截图功能
ocr.py

注意事项

- 图像定位对图片相似度敏感，建议截图时保持一致
OCR 效果取决于图片质量和文字清晰度
Tesseract 路径需要添加到系统 PATH 或在代码中指定

windows-skillsWindows桌面自动化

windows-skills

Windows Desktop Automation

Quick Start

Dependencies

Core Features

1. Screenshot

2. OCR (Text Recognition)

3. Image Location

Scripts

Notes

Windows 桌面自动化

快速开始

依赖安装

核心功能

1. 截图

2. 文字识别 (OCR)

3. 图像定位

脚本说明

注意事项

Windows Desktop Automation

Quick Start

Dependencies

Core Features

1. Screenshot

Full screen

Region (x, y, width, height)

Window by title

2. OCR (Text Recognition)

Extract text from image

Specify language (chi_sim=Chinese, eng=English)

3. Image Location

Find image position (returns center coordinates)

Find all matches

Scripts

Notes

Windows 桌面自动化

快速开始

依赖安装

核心功能

1. 截图

全屏截图

区域截图 (x, y, width, height)

窗口截图

2. 文字识别 (OCR)

从图片提取文字

指定语言 (chi_sim = 简体中文, eng = 英文)

3. 图像定位

查找图片位置 (返回中心坐标)

查找所有匹配位置

脚本说明

注意事项

标签

通过对话安装

方式一：安装 SkillHub 和技能

方式二：设置 SkillHub 为优先技能安装源

通过命令行安装

下载

相关推荐

self-improvement

self-improvement

self-improvement

self-improvement