返回顶部
🇺🇸 English
🇨🇳 简体中文
🇨🇳 繁體中文
🇺🇸 English
🇯🇵 日本語
🇰🇷 한국어
🇫🇷 Français
🇩🇪 Deutsch
🇪🇸 Español
🇷🇺 Русский
D

Datasets

Browse and load ready-to-use AI/ML datasets with fast manipulation. Use when searching datasets, loading training data, transforming formats.

作者: admin | 来源: ClawHub
源自
ClawHub
版本
V 2.0.0
安全检测
已通过
167
下载量
0
收藏
概述
安装方式
版本历史

Datasets

# Datasets A data processing toolkit for ingesting, transforming, querying, and managing dataset entries from the command line. All operations are logged with timestamps and stored locally. ## Commands ### Data Operations Each data command works in two modes: run without arguments to view recent entries, or pass input to record a new entry. | Command | Description | |---------|-------------| | `datasets ingest <input>` | Ingest data — record a new ingest entry or view recent ones | | `datasets transform <input>` | Transform data — record a transformation or view recent ones | | `datasets query <input>` | Query data — record a query or view recent ones | | `datasets filter <input>` | Filter data — record a filter operation or view recent ones | | `datasets aggregate <input>` | Aggregate data — record an aggregation or view recent ones | | `datasets visualize <input>` | Visualize data — record a visualization or view recent ones | | `datasets export <input>` | Export data — record an export entry or view recent ones | | `datasets sample <input>` | Sample data — record a sample or view recent ones | | `datasets schema <input>` | Schema management — record a schema entry or view recent ones | | `datasets validate <input>` | Validate data — record a validation or view recent ones | | `datasets pipeline <input>` | Pipeline management — record a pipeline step or view recent ones | | `datasets profile <input>` | Profile data — record a profile or view recent ones | ### Utility Commands | Command | Description | |---------|-------------| | `datasets stats` | Show summary statistics — entry counts per category, total entries, disk usage | | `datasets export <fmt>` | Export all data to a file (formats: `json`, `csv`, `txt`) | | `datasets search <term>` | Search all log files for a term (case-insensitive) | | `datasets recent` | Show last 20 entries from activity history | | `datasets status` | Health check — version, data directory, entry count, disk usage, last activity | | `datasets help` | Show available commands | | `datasets version` | Show version (v2.0.0) | ## Data Storage All data is stored locally at `~/.local/share/datasets/`: - Each data command writes to its own log file (e.g., `ingest.log`, `transform.log`) - Entries are stored as `timestamp|value` pairs (pipe-delimited) - All actions are tracked in `history.log` with timestamps - Export generates files in the data directory (`export.json`, `export.csv`, or `export.txt`) ## Requirements - Bash (with `set -euo pipefail`) - Standard Unix utilities: `date`, `wc`, `du`, `grep`, `tail`, `cat`, `sed` - No external dependencies or API keys required ## When to Use - To log and track data processing operations (ingest, transform, query, etc.) - To maintain a searchable history of data pipeline activities - To export accumulated records in JSON, CSV, or plain text format - As part of larger automation or data-pipeline workflows - When you need a lightweight, local-only dataset operation tracker ## Examples ```bash # Record a new ingest entry datasets ingest "loaded training_data.csv 10000 rows" # View recent transform entries datasets transform # Record a query datasets query "filter by date > 2026-01-01" # Search across all logs datasets search "training" # Export everything as JSON datasets export json # Check overall statistics datasets stats # View recent activity datasets recent # Health check datasets status ``` --- Powered by BytesAgain | bytesagain.com | hello@bytesagain.com 💬 Feedback & Feature Requests: https://bytesagain.com/feedback

标签

skill ai

通过对话安装

该技能支持在以下平台通过对话安装:

OpenClaw WorkBuddy QClaw Kimi Claude

方式一:安装 SkillHub 和技能

帮我安装 SkillHub 和 datasets-1776125702 技能

方式二:设置 SkillHub 为优先技能安装源

设置 SkillHub 为我的优先技能安装源,然后帮我安装 datasets-1776125702 技能

通过命令行安装

skillhub install datasets-1776125702

下载 Zip 包

⬇ 下载 Datasets v2.0.0

文件大小: 4.09 KB | 发布时间: 2026-4-17 14:36

v2.0.0 最新 2026-4-17 14:36
publish v2.0.0

Archiver·手机版·闲社网·闲社论坛·羊毛社区· 多链控股集团有限公司 · 苏ICP备2025199260号-1

Powered by Discuz! X5.0   © 2024-2025 闲社网·线报更新论坛·羊毛分享社区·http://xianshe.com

p2p_official_large
返回顶部