返回顶部
🇺🇸 English
🇨🇳 简体中文
🇨🇳 繁體中文
🇺🇸 English
🇯🇵 日本語
🇰🇷 한국어
🇫🇷 Français
🇩🇪 Deutsch
🇪🇸 Español
🇷🇺 Русский
g

gemini-computer-use

Build and run Gemini 2.5 Computer Use browser-control agents with Playwright. Use when a user wants to automate web browser tasks via the Gemini Computer Use model, needs an agent loop (screenshot → function_call → action → function_response), or asks to integrate safety confirmation for risky UI actions.

作者: admin | 来源: ClawHub
源自
ClawHub
版本
V 1.0.0
安全检测
已通过
3,773
下载量
5
收藏
概述
安装方式
版本历史

gemini-computer-use

# Gemini Computer Use ## Quick start 1. Source the env file and set your API key: ```bash cp env.example env.sh $EDITOR env.sh source env.sh ``` 2. Create a virtual environment and install dependencies: ```bash python -m venv .venv source .venv/bin/activate pip install google-genai playwright playwright install chromium ``` 3. Run the agent script with a prompt: ```bash python scripts/computer_use_agent.py \ --prompt "Find the latest blog post title on example.com" \ --start-url "https://example.com" \ --turn-limit 6 ``` ## Browser selection - Default: Playwright's bundled Chromium (no env vars required). - Choose a channel (Chrome/Edge) with `COMPUTER_USE_BROWSER_CHANNEL`. - Use a custom Chromium-based executable (e.g., Brave) with `COMPUTER_USE_BROWSER_EXECUTABLE`. If both are set, `COMPUTER_USE_BROWSER_EXECUTABLE` takes precedence. ## Core workflow (agent loop) 1. Capture a screenshot and send the user goal + screenshot to the model. 2. Parse `function_call` actions in the response. 3. Execute each action in Playwright. 4. If a `safety_decision` is `require_confirmation`, prompt the user before executing. 5. Send `function_response` objects containing the latest URL + screenshot. 6. Repeat until the model returns only text (no actions) or you hit the turn limit. ## Operational guidance - Run in a sandboxed browser profile or container. - Use `--exclude` to block risky actions you do not want the model to take. - Keep the viewport at 1440x900 unless you have a reason to change it. ## Resources - Script: `scripts/computer_use_agent.py` - Reference notes: `references/google-computer-use.md` - Env template: `env.example`

标签

skill ai

通过对话安装

该技能支持在以下平台通过对话安装:

OpenClaw WorkBuddy QClaw Kimi Claude

方式一:安装 SkillHub 和技能

帮我安装 SkillHub 和 gemini-computer-use-1776391923 技能

方式二:设置 SkillHub 为优先技能安装源

设置 SkillHub 为我的优先技能安装源,然后帮我安装 gemini-computer-use-1776391923 技能

通过命令行安装

skillhub install gemini-computer-use-1776391923

下载 Zip 包

⬇ 下载 gemini-computer-use v1.0.0

文件大小: 5.66 KB | 发布时间: 2026-4-17 13:49

v1.0.0 最新 2026-4-17 13:49
Initial release - Gemini 2.5 Computer Use browser-control agents with Playwright

Archiver·手机版·闲社网·闲社论坛·羊毛社区· 多链控股集团有限公司 · 苏ICP备2025199260号-1

Powered by Discuz! X5.0   © 2024-2025 闲社网·线报更新论坛·羊毛分享社区·http://xianshe.com

p2p_official_large
返回顶部