返回顶部
v

voice-memo-sync语音备忘同步

|

作者: admin | 来源: ClawHub
源自
ClawHub
版本
V 1.6.1
安全检测
已通过
293
下载量
免费
免费
0
收藏
概述
安装方式
版本历史

voice-memo-sync

Voice Memo Sync 🎙️

Intelligent voice/video transcription and organization system.
智能语音/视频转录与整理系统。



Quick Start / 快速开始

bash

Run installation script / 运行安装脚本


cd ~/.openclaw/workspace/skills/voice-memo-sync
./scripts/install.sh

What it does / 安装内容:

  1. 1. Creates data directory memory/voice-memos/ / 创建数据目录
  2. Creates config file config/voice-memo-sync.yaml / 创建配置文件
  3. Creates Apple Notes folder Voice Memos / 创建 Apple Notes 文件夹
  4. Checks dependencies and prompts installation / 检查依赖并提示安装



When to Use / 何时使用

USE this skill when user:

  • - Sends voice/audio/video files / 发送语音/音频/视频文件
  • Sends YouTube/Bilibili URLs / 发送 YouTube/B站 链接
  • Sends transcript text files / 发送转录文本文件
  • Says sync voice memos, process recording, organize this video
  • 说「同步语音备忘录」「处理录音」「整理这个视频」

DO NOT use when:

  • - User just wants to play audio/video / 用户只想播放音视频
  • User asks about music/podcasts without transcription needs / 询问音乐/播客但不需要转录



Supported Formats / 支持格式

⚡ Metal GPU Acceleration (NEW)

On Apple Silicon, whisper-cpp provides 15-20x faster transcription:

AudioCPU (openai-whisper)Metal GPU (whisper-cpp)
5 min~5 min~20 sec
30 min
~30 min | ~2 min |
| 60 min | ~60 min | ~4 min |

bash

Install for Metal acceleration (recommended)


brew install whisper-cpp

The skill auto-detects and uses Metal when available.

Type / 类型Formats / 格式Processing / 处理方式
Voice Memos.qta, .m4aApple native (QTA metadata) → Whisper fallback
Audio
.mp3, .wav, .aac, .flac | Whisper local transcription |
| Video | .mp4, .mov, .mkv, .webm | ffmpeg extract → Whisper |
| YouTube | URL | summarize CLI → yt-dlp fallback |
| Bilibili | URL | yt-dlp download → Whisper |
| Text | .txt, .md | Direct read, skip transcription |
| Documents | .doc, .docx | textutil convert → process |
| Structured | .json, .csv | Parse and extract text |
| iCloud | Configured paths | Scheduled sync |


Processing Pipeline / 处理流程

Input (File/URL/Text)


┌─────────────────────────────────────┐
│ 1. Source Detection │
│ 来源识别 │
│ Voice Memo / URL / File / Text │
└─────────────────┬───────────────────┘


┌─────────────────────────────────────┐
│ 2. Save Source Metadata │
│ 保存源信息 │
│ → memory/voice-memos/sources/ │
└─────────────────┬───────────────────┘


┌─────────────────────────────────────┐
│ 3. Transcription │
│ 转录提取 │
│ Priority: Apple > Text > summarize│
│ > Whisper-local > API │
└─────────────────┬───────────────────┘


┌─────────────────────────────────────┐
│ 4. Save Raw Transcript │
│ 保存原始转录 │
│ → memory/voice-memos/transcripts/ │
└─────────────────┬───────────────────┘


┌─────────────────────────────────────┐
│ 5. LLM Deep Processing │
│ LLM深度整理 │
│ • Read USER.md & MEMORY.md │
│ • Clean up spoken language │
│ • Extract key points & insights │
│ • Identify TODOs & connections │
└─────────────────┬───────────────────┘


┌─────────────────────────────────────┐
│ 6. Save Processed Result │
│ 保存处理结果 │
│ → memory/voice-memos/processed/ │
└─────────────────┬───────────────────┘

┌───────┴───────┐
▼ ▼
┌─────────────────┐ ┌─────────────────┐
│ 7a. Apple Notes │ │ 7b. Reminders │
│ Structured note │ │ Create TODOs │
│ with #hashtags │ │ 创建提醒 │
└────────┬────────┘ └────────┬───────┘
│ │
└─────────┬─────────┘

┌─────────────────────────────────────┐
│ 8. Update Index │
│ 更新索引 │
│ → memory/voice-memos/INDEX.md │
└─────────────────────────────────────┘



Data Structure / 数据结构

memory/voice-memos/ # All data, searchable via memory_search
├── INDEX.md # Processing records index / 处理记录索引
├── sources/ # Original file metadata / 原始文件元数据
│ └── YYYY-MM-DD_xxx.json
├── transcripts/ # Raw transcripts / 原始转录文本
│ └── YYYY-MM-DDsourcetitle.md
├── processed/ # LLM processed content / LLM处理后内容
│ └── YYYY-MM-DDsourcetitle.md
└── synced/ # Sync records / 同步记录
└── YYYY-MM-DDsourcetitle.json



Apple Notes Output Format / 输出格式

The skill reads USER.md, SOUL.md, and MEMORY.md to provide personalized analysis:

  • - Deep insights tailored to users research/work focus
  • Connections to active projects and ongoing interests
  • Actionable recommendations based on users decision style
  • Critical thinking that challenges assumptions

处理时会读取 USER.md、SOUL.md 和 MEMORY.md 提供个性化分析

  • - 结合用户研究/工作重点的深度洞察
  • 与活跃项目和持续关注领域的关联
  • 基于用户决策风格的行动建议
  • 挑战假设的批判性思考

🎙️ [Auto-generated Title / 智能生成的标题]

📅 Date | ⏱️ Duration | 👤 Source
🏷️ #tag1 #tag2 #tag3

━━━━━━━━━━━━━━━━━━━━━━

📌 Summary / 核心摘要
[One paragraph summarizing the content]

🎯 Key Points / 关键要点
• Point 1
• Point 2
• Point 3

💡 Deep Analysis & Reflection (For User) / 深度分析与反思
[Personalized analysis connecting to users:
- Current research directions (from MEMORY.md)
- Active projects and interests (from USER.md)
- Decision-making style and preferences
- Critical counter-arguments and blind spots]

📋 Action Items / 行动建议
☐ Research: [specific to users academic work]
☐ Business: [relevant to startup/investment focus]
☐ Content: [ideas for courses/articles]

🔗 Related Connections / 相关联系
• Connection to [project/memory]
• Recommended reading/research

💬 Notable Quotes / 金句摘录
• Quote 1
• Quote 2

━━━━━━━━━━━━━━━━━━━━━━

📝 Original Transcript (Cleaned) / 原始转录(已整理)
[Full transcript text, cleaned up from spoken language / 完整转录,已整理口语表达]



QTA File Format / QTA文件格式 (Technical Reference)

Apple Voice Memos on iOS/macOS 14+ uses .qta (QuickTime Audio) files that embed native transcription directly in the file metadata.

Structure

QTA File
├── ftyp (file type marker: qt )
├── wide (extended marker)
├── mdat (audio data, typically 90%+ of file size)
└── moov (metadata container)
├── mvhd (movie header)
└── trak (one or more tracks)
├── tkhd (track header)
├── mdia (media data)
└── meta (metadata - TRANSCRIPTION HERE!)
├── hdlr (handler: mdta)
├── keys (key list: com.apple.VoiceMemos.tsrp)
└── ilst (data list)

标签

skill ai

通过对话安装

该技能支持在以下平台通过对话安装:

OpenClaw WorkBuddy QClaw Kimi Claude

方式一:安装 SkillHub 和技能

帮我安装 SkillHub 和 voice-memo-sync-1776187093 技能

方式二:设置 SkillHub 为优先技能安装源

设置 SkillHub 为我的优先技能安装源,然后帮我安装 voice-memo-sync-1776187093 技能

通过命令行安装

skillhub install voice-memo-sync-1776187093

下载

⬇ 下载 voice-memo-sync v1.6.1(免费)

文件大小: 31.53 KB | 发布时间: 2026-4-15 13:10

v1.6.1 最新 2026-4-15 13:10
## Voice Memo Sync 1.6.1

- Documentation updates in SKILL.md.
- Version number updated to 1.6.1.
- No code or functionality changes; release focuses on keeping documentation current.

Archiver·手机版·闲社网·闲社论坛·羊毛社区· 多链控股集团有限公司 · 苏ICP备2025199260号-1

Powered by Discuz! X5.0   © 2024-2025 闲社网·线报更新论坛·羊毛分享社区·http://xianshe.com

p2p_official_large
返回顶部