q_code

扫码关注官方微信

cell_code

扫码下载APP

返回顶部

c

crawler网络爬虫指南

Web crawling and scraping reference — robots.txt protocol, Scrapy framework, anti-bot detection, headless browsers, and legal considerations

作者: admin | 来源: ClawHub

下载

源自

ClawHub

版本

V 3.0.0

安全检测

已通过

682
下载量

免费
免费

0
收藏

概述

安装方式

版本历史

crawler

Crawler

Web crawling and scraping reference — robots.txt protocol, Scrapy framework, anti-bot detection, headless browsers, and legal considerations. No API keys or credentials required — outputs reference documentation only.

Commands

Command	Description
INLINECODE0	Crawling vs scraping, robots.txt, sitemap
INLINECODE1

HTTP caching, structured data, meta tags | | troubleshooting | Anti-bot detection, JS rendering, encoding | | performance | Concurrency, dedup, incremental, distributed | | security | Legal landscape, ethical guidelines, proxies | | migration | BeautifulSoup to Scrapy, requests to Playwright | | cheatsheet | Scrapy commands, CSS/XPath, curl, user-agents | | faq | Legality, JS pages, blocking, storage |

Output Format

All commands output plain-text reference documentation via heredoc. No external API calls, no credentials needed, no network access.

Powered by BytesAgain | bytesagain.com | hello@bytesagain.com

技能名称：crawler

爬虫

网络爬取与抓取参考文档——涵盖robots.txt协议、Scrapy框架、反爬虫检测、无头浏览器及法律注意事项。无需API密钥或凭证——仅输出参考文档。

命令

命令	描述
intro	爬取与抓取的区别、robots.txt、站点地图
standards

HTTP缓存、结构化数据、元标签 | | troubleshooting | 反爬虫检测、JS渲染、编码问题 | | performance | 并发处理、去重、增量爬取、分布式 | | security | 法律环境、道德准则、代理服务器 | | migration | BeautifulSoup迁移至Scrapy、requests迁移至Playwright | | cheatsheet | Scrapy命令、CSS/XPath选择器、curl命令、用户代理 | | faq | 合法性、JS页面、被屏蔽、存储问题 |

输出格式

所有命令均通过heredoc方式输出纯文本参考文档。无需外部API调用，无需凭证，无需网络访问。

由BytesAgain提供 | bytesagain.com | hello@bytesagain.com

标签

skill ai

通过对话安装

该技能支持在以下平台通过对话安装：

OpenClaw WorkBuddy QClaw Kimi Claude

方式一：安装 SkillHub 和技能

帮我安装 SkillHub 和 crawler-1776080714 技能

方式二：设置 SkillHub 为优先技能安装源

设置 SkillHub 为我的优先技能安装源，然后帮我安装 crawler-1776080714 技能

通过命令行安装

skillhub install crawler-1776080714

下载

⬇ 下载 crawler v3.0.0（免费）

文件大小: 8.55 KB | 发布时间: 2026-4-15 12:21

v3.0.0 最新 2026-4-15 12:21

Clean package with matching SKILL.md and script

闲社论坛
关于我们会员介绍开通会员羊毛论坛
闲社论坛
羊毛交流论坛线报讨论社区优惠分享交流线报更新服务
网站服务
会员咨询：515151560 广告合作：515151570 投诉建议：515151580 售后指导：515151590

多链集团旗下-闲社网

闲社网热线

免费联系电话

0527-80111111

服务时间：周一到周日 8:00-24:00

公众号
闲社闲社线报社区

关注闲社网

闲社在线客服
关注闲社网微信
闲社网APP

Archiver·手机版·闲社网·闲社论坛·羊毛社区· 多链控股集团有限公司 · 苏ICP备2025199260号-1

Powered by Discuz! X5.0 © 2024-2025 闲社网·线报更新论坛·羊毛分享社区·http://xianshe.com

p2p_official_large

返回顶部