返回顶部
G

Gbrow

Full-featured headless browser for OpenClaw agents. Navigate, snapshot with accessibility tree (@ref clicks), tabs, JS execution, cookie import. No vision model needed — free, fast, reliable.

作者: admin | 来源: ClawHub
源自
ClawHub
版本
V 1.0.0
安全检测
已通过
52
下载量
0
收藏
概述
安装方式
版本历史

Gbrow

# Gbrow — The Browser Your AI Agent Actually Needs A full-featured headless browser powered by [Playwright](https://playwright.dev/) and [Bun](https://bun.sh/). Uses the **accessibility tree** for page reading — not expensive vision models. ## Why Gbrow? | Traditional (screenshots + vision) | Gbrow (accessibility tree) | |---|---| | Screenshot → upload to GPT-4o → wait → read | `ariaSnapshot()` → instant structured text | | ~$0.01 per page read | **Free** | | 3-10 seconds per page | **< 100ms** | | Fails on API key issues | **Always works** | | Click by fragile CSS selector | **Click by @ref** (`@e1`, `@e2`, etc.) | ## Quick Setup ```bash # Clone and install git clone https://github.com/ashish797/Gbrow.git ~/.openclaw/workspace/skills/Gbrow cd ~/.openclaw/workspace/skills/Gbrow bash setup.sh ``` Or one-liner: ```bash curl -fsSL https://raw.githubusercontent.com/ashish797/Gbrow/main/setup.sh | bash ``` ## How It Works ### 1. Start the server ```bash cd ~/.openclaw/workspace/skills/Gbrow bun run src/server.ts ``` ### 2. Read the page (accessibility tree) The snapshot gives you a structured view with clickable refs: ``` @e1 [heading] "Welcome" [level=1] @e2 [link] "Get Started" @e3 [button] "Sign in" @e4 [textbox] "Search" ``` ### 3. Click by ref ``` click @e2 → clicks "Get Started" fill @e4 "query" → types into search box ``` ## Commands ### Navigation | Command | Description | Example | |---------|-------------|---------| | `goto <url>` | Navigate to URL | `goto https://example.com` | | `back` | History back | `back` | | `forward` | History forward | `forward` | | `reload` | Reload page | `reload` | | `url` | Print current URL | `url` | ### Reading | Command | Description | Example | |---------|-------------|---------| | `snapshot` | Accessibility tree with @refs | `snapshot -i` (interactive only) | | `text` | Cleaned page text | `text` | | `html [selector]` | Raw HTML | `html .article` | | `links` | All links as "text → href" | `links` | | `forms` | Form fields as JSON | `forms` | ### Interaction | Command | Description | Example | |---------|-------------|---------| | `click <ref>` | Click element | `click @e3` | | `fill <ref> <text>` | Fill input | `fill @e4 "hello"` | | `select <ref> <value>` | Select dropdown | `select @e5 "option1"` | | `type <ref> <text>` | Type with keyboard | `type @e4 "search term"` | | `press <key>` | Press key | `press Enter` | | `scroll <direction>` | Scroll page | `scroll down` | ### Inspection | Command | Description | Example | |---------|-------------|---------| | `js <expr>` | Run JavaScript | `js document.title` | | `css <sel> <prop>` | Computed CSS | `css .box color` | | `attrs <ref>` | Element attributes | `attrs @e1` | | `is <prop> <ref>` | State check | `is visible @e3` | ### Tabs | Command | Description | |---------|-------------| | `tabs` | List open tabs | | `tab N` | Switch to tab N | | `newtab` | Open new tab | | `closetab` | Close current tab | ### Visual | Command | Description | |---------|-------------| | `screenshot` | Take screenshot | | `responsive <w> <h>` | Set viewport size | | `pdf` | Save page as PDF | ## Snapshot Flags | Flag | Description | |------|-------------| | `-i` | Interactive elements only (buttons, links, inputs) | | `-c` | Compact (remove empty structural nodes) | | `-d N` | Limit tree depth | | `-s <sel>` | Scope to CSS selector | | `-D` | Diff against previous snapshot | | `-a` | Annotated screenshot with ref overlays | ## HTTP API All commands go through the HTTP API: ```bash # Get port and token from state file PORT=$(python3 -c "import json; print(json.load(open('.gstack/browse.json'))['port'])") TOKEN=$(python3 -c "import json; print(json.load(open('.gstack/browse.json'))['token'])") # Send command curl -s -X POST "http://127.0.0.1:${PORT}/command" \ -H "Authorization: Bearer ${TOKEN}" \ -H "Content-Type: application/json" \ -d '{"command":"goto","args":["https://example.com"]}' ``` ## Architecture ``` ┌─────────────┐ HTTP ┌──────────────────┐ │ OpenClaw │ ──────────▶ │ Gbrow Server │ │ Agent │ │ (Bun + Playwright)│ └─────────────┘ └────────┬─────────┘ │ ▼ ┌──────────────────┐ │ Chromium │ │ (headless) │ └──────────────────┘ │ ▼ ┌──────────────────┐ │ Accessibility │ │ Tree (ariaSnapshot)│ └──────────────────┘ ``` No vision models. No API calls. Just structured text from the browser's accessibility layer. ## Credits Built on top of [gstack](https://github.com/garrytan/gstack) by Gary Tan (Y Combinator). Adapted for OpenClaw with permission under MIT license. ## License MIT

标签

skill ai

通过对话安装

该技能支持在以下平台通过对话安装:

OpenClaw WorkBuddy QClaw Kimi Claude

方式一:安装 SkillHub 和技能

帮我安装 SkillHub 和 gbrow-1775883861 技能

方式二:设置 SkillHub 为优先技能安装源

设置 SkillHub 为我的优先技能安装源,然后帮我安装 gbrow-1775883861 技能

通过命令行安装

skillhub install gbrow-1775883861

下载 Zip 包

⬇ 下载 Gbrow v1.0.0

文件大小: 105.89 KB | 发布时间: 2026-4-12 10:04

v1.0.0 最新 2026-4-12 10:04
Initial release: Full-featured headless browser for OpenClaw agents. Accessibility tree snapshot, @ref clicks, tabs, JS execution. No vision model needed.

Archiver·手机版·闲社网·闲社论坛·羊毛社区· 多链控股集团有限公司 · 苏ICP备2025199260号-1

Powered by Discuz! X5.0   © 2024-2025 闲社网·线报更新论坛·羊毛分享社区·http://xianshe.com

p2p_official_large
返回顶部