Fastest Browser Use
A Rust-based browser automation engine that provides a lightweight binary driving Chrome directly via CDP. It is optimized for token-efficient DOM extraction, robust session management, and speed.

🧪 Recipes for Agents
1. Bypass "Bot Detection" via Human Emulation
Simulate mouse jitter and random delays to scrape protected sites.
CODEBLOCK0
2. The "Deep Freeze" Snapshot
Capture the entire DOM state
and computed styles for perfect reconstruction later.
CODEBLOCK1
3. Login & Cookie Heist
Log in manually once, then steal the session for headless automation.
Step 1: Open non-headless for manual login
CODEBLOCK2
Step 2: Reuse session later
CODEBLOCK3
4. 🚜 Infinite Scroll Harvester
Extract fresh data from infinite-scroll pages — perfect for harvesting the latest posts, news, or social feeds.
CODEBLOCK4
Real output (59 unique items in ~6 seconds):
CODEBLOCK5
Works on any infinite scroll page: Reddit, Twitter, LinkedIn feeds, search results, etc.
5. 📸 Quick Screenshot
Capture any page as PNG:
CODEBLOCK6
6. 🗺️ Sitemap & Page Structure Analyzer
Discover how a site is organized by parsing sitemaps and analyzing page structure.
CODEBLOCK7
CODEBLOCK8
Options:
- -
--analyze-structure: Also extract page structure (headings, nav, sections, meta) - INLINECODE1 : Limit structure analysis to N pages (default: 5)
- INLINECODE2 : Limit sitemap parsing to N sitemaps (default: 10, useful for large sites)
Example output:
CODEBLOCK9
Use this to understand site architecture before scraping, map navigation flows, or audit SEO structure.
⚡ Performance Comparison
| Feature | Fast Browser Use (Rust) | Puppeteer (Node) | Selenium (Java) |
|---|
| Startup Time | < 50ms | ~800ms | ~2500ms |
| Memory Footprint |
15 MB | 100 MB+ | 200 MB+ |
|
DOM Extract |
Zero-Copy | JSON Serialize | Slow Bridge |
Capabilities & Tools
Vision & Extraction
- - vision_map: Returns a screenshot overlay with numbered bounding boxes for all interactive elements.
- snapshot: Capture the raw HTML snapshot (YAML/Markdown optimized for AI).
- screenshot: Capture a visual image of the page.
- extract: Get structured data from the DOM.
- markdown: Convert the current page content to Markdown.
- sitemap: Analyze site structure via robots.txt, sitemaps, and page semantic analysis.
Navigation & Lifecycle
- - navigate: Visit a specific URL.
- goback / goforward: Traverse browser history.
- wait: Pause execution or wait for specific conditions.
- newtab: Open a new browser tab.
- switchtab: Switch focus to a specific tab.
- closetab: Close the current or specified tab.
- tablist: List all open tabs.
- close: Terminate the browser session.
Interaction
- - click: Click elements via CSS selectors or DOM indices.
- input: Type text into fields.
- press_key: Send specific keyboard events.
- hover: Hover over elements.
- scroll: Scroll the viewport.
- select: Choose options in dropdowns.
State & Debugging
- - cookies: Manage session cookies (get/set).
- local_storage: Manage local storage data.
- debug: Access console logs and debug information.
Usage
This skill is specialized for complex web interactions that require maintaining state (like being logged in), handling dynamic JavaScript content, or managing multiple pages simultaneously. It offers higher performance and control compared to standard fetch-based tools.
最快浏览器自动化
一个基于Rust的浏览器自动化引擎,提供轻量级二进制文件,通过CDP直接驱动Chrome。针对高效令牌DOM提取、稳健会话管理和速度进行了优化。

🧪 智能体配方
1. 通过人类模拟绕过机器人检测
模拟鼠标抖动和随机延迟以抓取受保护网站。
bash
fast-browser-use navigate --url https://protected-site.com \
--human-emulation \
--wait-for-selector #content
2. 深度冻结快照
捕获完整DOM状态
和计算样式,以便日后完美重建。
bash
fast-browser-use snapshot --include-styles --output state.json
3. 登录与Cookie窃取
手动登录一次,然后窃取会话用于无头自动化。
步骤1:打开非无头模式进行手动登录
bash
fast-browser-use login --url https://github.com/login --save-session ./auth.json
步骤2:稍后重用会话
bash
fast-browser-use navigate --url https://github.com/dashboard --load-session ./auth.json
4. 🚜 无限滚动收割机
从无限滚动页面提取新数据——非常适合收割最新帖子、新闻或社交动态。
bash
从Hacker News收割标题(滚动3次,间隔800毫秒)
fast-browser-use harvest \
--url https://news.ycombinator.com \
--selector .titleline a \
--scrolls 3 \
--delay 800 \
--output headlines.json
实际输出(约6秒内59个唯一项目):
json
[
Genode OS是一个构建高度安全专用操作系统的工具包,
移动运营商可以获取你的GPS位置,
学生使用\人性化\程序来逃避AI作弊指控,
芬兰将通过禁止青少年社交媒体来结束\不受控制的人类实验\,
...
]
适用于任何无限滚动页面:Reddit、Twitter、LinkedIn动态、搜索结果等。
5. 📸 快速截图
将任何页面捕获为PNG:
bash
fast-browser-use screenshot \
--url https://example.com \
--output page.png \
--full-page # 可选:捕获整个可滚动页面
6. 🗺️ 站点地图与页面结构分析器
通过解析站点地图和分析页面结构来发现网站的组织方式。
bash
基本站点地图发现(检查robots.txt + 常见站点地图URL)
fast-browser-use sitemap --url https://example.com
bash
带页面结构的完整分析(标题、导航、区块)
fast-browser-use sitemap \
--url https://example.com \
--analyze-structure \
--max-pages 10 \
--max-sitemaps 5 \
--output site-structure.json
选项:
- - --analyze-structure:同时提取页面结构(标题、导航、区块、元数据)
- --max-pages N:限制结构分析到N个页面(默认:5)
- --max-sitemaps N:限制站点地图解析到N个站点地图(默认:10,适用于大型网站)
示例输出:
json
{
base_url: https://example.com,
robots_txt: User-agent: *\nSitemap: https://example.com/sitemap.xml,
sitemaps: [https://example.com/sitemap.xml],
pages: [
https://example.com/about,
https://example.com/products,
https://example.com/contact
],
page_structures: [
{
url: https://example.com,
title: Example - 首页,
headings: [
{level: 1, text: 欢迎来到Example},
{level: 2, text: 我们的服务}
],
nav_links: [
{text: 关于, href: /about},
{text: 产品, href: /products}
],
sections: [
{tag: main, id: content, role: main},
{tag: footer, id: footer, role: null}
],
maincontent: {tag: main, id: content, wordcount: 450},
meta: {
description: Example公司首页,
canonical: https://example.com/
}
}
]
}
使用此工具在抓取前了解网站架构,映射导航流程,或审计SEO结构。
⚡ 性能对比
| 特性 | Fast Browser Use (Rust) | Puppeteer (Node) | Selenium (Java) |
|---|
| 启动时间 | < 50ms | ~800ms | ~2500ms |
| 内存占用 |
15 MB | 100 MB+ | 200 MB+ |
|
DOM提取 |
零拷贝 | JSON序列化 | 慢速桥接 |
能力与工具
视觉与提取
- - vision_map:返回带有所有交互元素编号边界框的截图叠加层。
- snapshot:捕获原始HTML快照(针对AI优化的YAML/Markdown格式)。
- screenshot:捕获页面的视觉图像。
- extract:从DOM获取结构化数据。
- markdown:将当前页面内容转换为Markdown。
- sitemap:通过robots.txt、站点地图和页面语义分析来分析网站结构。
导航与生命周期
- - navigate:访问特定URL。
- goback / goforward:遍历浏览器历史记录。
- wait:暂停执行或等待特定条件。
- newtab:打开新的浏览器标签页。
- switchtab:切换到特定标签页。
- closetab:关闭当前或指定标签页。
- tablist:列出所有打开的标签页。
- close:终止浏览器会话。
交互
- - click:通过CSS选择器或DOM索引点击元素。
- input:在字段中输入文本。
- press_key:发送特定键盘事件。
- hover:悬停在元素上。
- scroll:滚动视口。
- select:在下拉菜单中选择选项。
状态与调试
- - cookies:管理会话Cookie(获取/设置)。
- local_storage:管理本地存储数据。
- debug:访问控制台日志和调试信息。
使用方法
此技能专为需要维护状态(如保持登录状态)、处理动态JavaScript内容或同时管理多个页面的复杂Web交互而设计。与标准的基于fetch的工具相比,它提供更高的性能和控制力。