browser-use

快速开始

python
import asyncio
from skills.browseruse.scripts.runagent import stealthsession, geminillm
from browser_use import Agent

async def run(task: str):
llm = gemini_llm() # 免费 — Google Cloud Code Assist OAuth
session = stealth_session() # 反机器人加固
agent = Agent(task=task, llm=llm, browser_session=session)
result = await agent.run()
return result.final_result()

print(asyncio.run(run(访问 example.com 并返回页面标题)))

通过 uv run python script.py 运行脚本（切勿直接使用 python3）。

⚠️ 反检测规则（强制 — 2026-03-07 加固）

以下规则不可协商。每个 BrowserSession 必须应用这些规则，否则 X/Google 等平台会拦截你。

1. 始终使用 stealth_session()

python
from skills.browseruse.scripts.runagent import stealth_session
session = stealth_session()

底层实现：

- --disable-blink-features=AutomationControlled — 隐藏 Chromium 自动化标志
通过 addinitscript 将 navigator.webdriver 伪装为 undefined
真实 Chrome 用户代理字符串（非 Chromium/无头模式）
真实视口尺寸（1366×768）

2. 类人输入 — 切勿使用全速的 fill() 或 page.type()

python

❌ 错误 — 立即触发机器人检测

await page.fill([data-testid=textarea], tweet_text)

✅ 正确 — 使用 keyboard.type 并添加可变延迟

for char in text: await page.keyboard.type(char, delay=random.randint(30, 120)) if random.random() < 0.05: await page.waitfortimeout(random.randint(200, 600))

3. 每次操作之间添加随机延迟

python
await page.waitfortimeout(random.randint(800, 2000)) # 点击前
await element.click()
await page.waitfortimeout(random.randint(500, 1500)) # 点击后

4. 直接导航到操作 URL — 跳过首页/落地页

python

❌ 先导航到首页再找发布按钮

await page.goto(https://x.com/home)

✅ 直接前往操作页面

await page.goto(https://x.com/compose/post)

5. 移除 GraphQL 的 [DONE] 验证 — 仅使用 UI

X 的 GraphQL（CreateTweet）即使使用有效 Cookie 也会返回 226 自动化错误。
始终通过 UI（发布框 → 发布按钮）发帖，切勿通过 API。

LLM 配置

选项 A：通过 Cloud Code Assist 使用 Google Gemini（免费 — 推荐）

已通过你的 google-gemini-cli OAuth 认证。无需 API 密钥。

python
from skills.browseruse.scripts.runagent import gemini_llm

llm = gemini_llm(model=gemini-2.5-flash) # 默认 — 快速 + 免费

llm = gemini_llm(model=gemini-2.5-pro) # 更强推理能力

底层使用 cloudcode-pa.googleapis.com/v1internal — 与 OpenClaw 相同的端点。
令牌从 ~/.openclaw/agents/main/agent/auth.json 自动刷新。

选项 B：Anthropic（需要直接 API 密钥）

python
from langchain_anthropic import ChatAnthropic
llm = ChatAnthropic(model=claude-sonnet-4-5, timeout=60)

选项 C：Groq（免费，但不支持 JSON schema — 不要用于 browser-use）

Groq 的 llama-3.3-70b-versatile 缺少 json_schema 响应格式 → browser-use
会失败。请使用 Gemini 或 Anthropic。

BrowserSession 选项

python
from skills.browseruse.scripts.runagent import stealth_session

session = stealth_session(
headless=True, # 服务器端设为 True；本地调试设为 False
inject_cookies=None, # 要注入的 Cookie 字典列表（用于预认证）
)

预认证会话（Cookie 注入）

python
session = stealthsession(injectcookies=[
{name: auth_token, value: TOKEN, domain: .x.com, path: /, secure: True, httpOnly: True, sameSite: None},
{name: ct0, value: CT0, domain: .x.com, path: /, secure: True, sameSite: None},
])

结构化输出

python
from pydantic import BaseModel

class Result(BaseModel):
title: str
price: float

agent = Agent(task=..., llm=llm, outputmodelschema=Result)
history = await agent.run()
data = history.final_result() # 解析后的 Result 实例

敏感数据

在不向 LLM 暴露凭据的情况下传递凭据：

python
agent = Agent(
task=使用用户名 {user} 和密码 {pass} 登录,
llm=llm,
sensitive_data={user: alex@example.com, pass: secret},
)

身份信息（Alex Chen）

注册服务时使用：

- 姓名：Alex Chen
出生日期：1998年1月18日
邮箱：alex.chen31337@gmail.com
验证码是技术障碍 — 正常通过
通过 sensitive_data={} 传递凭据 — 切勿硬编码在任务字符串中

常见模式

参见 references/patterns.md 了解：

- 社交媒体发布（X、Reddit、LinkedIn）的隐身操作
表单提交与登录流程
带分页的多页面抓取
预认证会话的 Cookie 注入
错误处理与重试

环境变量

ANTHROPICAPIKEY # 用于 ChatAnthropic（使用 gemini_llm 时可省略）
BROWSERUSEHEADLESS # 设为 false 可在本地查看
CHROMIUM_PATH # 默认：/usr/bin/chromium-browser

browser-use浏览器使用

browser-use

browser-use

Quick Start

⚠️ Anti-Detection Rules (MANDATORY — hardened 2026-03-07)

1. Always use stealth_session()

2. Human-like typing — NEVER use fill() or page.type() at full speed

3. Random delays between every action

4. Navigate directly to action URLs — skip home/landing pages

5. Remove [DONE] verification from GraphQL — use UI only

LLM Setup

Option A: Google Gemini via Cloud Code Assist (FREE — preferred)

Option B: Anthropic (direct API key required)

Option C: Groq (free, but no JSON schema support — don't use for browser-use)

BrowserSession Options

Pre-authenticated session (cookie injection)

Structured Output

Sensitive Data

Identity (Alex Chen)

Common Patterns

Env Vars

browser-use

快速开始

⚠️ 反检测规则（强制 — 2026-03-07 加固）

1. 始终使用 stealth_session()

2. 类人输入 — 切勿使用全速的 fill() 或 page.type()

❌ 错误 — 立即触发机器人检测

✅ 正确 — 使用 keyboard.type 并添加可变延迟

3. 每次操作之间添加随机延迟

4. 直接导航到操作 URL — 跳过首页/落地页

❌ 先导航到首页再找发布按钮

✅ 直接前往操作页面

5. 移除 GraphQL 的 [DONE] 验证 — 仅使用 UI

LLM 配置

选项 A：通过 Cloud Code Assist 使用 Google Gemini（免费 — 推荐）

llm = gemini_llm(model=gemini-2.5-pro) # 更强推理能力

选项 B：Anthropic（需要直接 API 密钥）

选项 C：Groq（免费，但不支持 JSON schema — 不要用于 browser-use）

BrowserSession 选项

预认证会话（Cookie 注入）

结构化输出

敏感数据

身份信息（Alex Chen）

常见模式

环境变量

标签

通过对话安装

方式一：安装 SkillHub 和技能

方式二：设置 SkillHub 为优先技能安装源

通过命令行安装

下载

相关推荐

self-improvement

self-improvement

self-improvement

self-improvement

1. Always use `stealth_session()`

2. Human-like typing — NEVER use `fill()` or `page.type()` at full speed

5. Remove `[DONE]` verification from GraphQL — use UI only