gemini-voice-assistant

# Gemini Voice Assistant A voice-to-voice AI assistant powered by Google's Gemini Live API. Speak to the AI and it responds with natural-sounding voice. ## Usage ### Text Mode ```bash cd ~/.openclaw/agents/kashif/skills/gemini-assistant && python3 handler.py "Your question or message" ``` ### Voice Mode ```bash cd ~/.openclaw/agents/kashif/skills/gemini-assistant && python3 handler.py --audio /path/to/audio.ogg "optional context" ``` ## Response Format The handler returns a JSON response: ```json { "message": "[[audio_as_voice]]\nMEDIA:/tmp/gemini_voice_xxx.ogg", "text": "Text response from Gemini" } ``` ## Configuration Set your Gemini API key: ```bash export GEMINI_API_KEY="your-api-key-here" ``` Or create a `.env` file in the skill directory: ``` GEMINI_API_KEY=your-api-key-here ``` ## Model Options The default model is `gemini-2.5-flash-native-audio-preview-12-2025` for audio support. To use a different model, edit `handler.py`: ```python MODEL = "gemini-2.0-flash-exp" # For text-only ``` ## Requirements - `google-genai>=1.0.0` - `numpy>=1.24.0` - `soundfile>=0.12.0` - `librosa>=0.10.0` (for audio input) - FFmpeg (for audio conversion) ## Features - 🎙️ Voice input/output support - 💬 Text conversations - 🔧 Configurable system instructions - ⚡ Fast responses with Gemini Flash

gemini-voice-assistant

标签

通过对话安装

方式一：安装 SkillHub 和技能

方式二：设置 SkillHub 为优先技能安装源

通过命令行安装

下载 Zip 包

gemini-voice-assistant