aissia

StillHammer/aissia

Fork 0

Commit Graph

Author	SHA1	Message	Date
StillHammer	ce2b25a599	Migration Gitea - sauvegarde locale 2025-12-04 18:58	2025-12-04 18:58:32 +08:00
StillHammer	d7971e0c34	feat: Phase 7 STT - Complete Windows setup with Whisper.cpp Added Speech-to-Text configuration and testing infrastructure: ## STT Engines Configured - ✅ Whisper.cpp (local, offline) - base model downloaded (142MB) - ✅ OpenAI Whisper API - configured with existing API key - ✅ Google Speech-to-Text - configured with existing API key - ⚠️ Azure STT - optional (not configured) - ⚠️ Deepgram - optional (not configured) ## New Files - `docs/STT_SETUP.md` - Complete Windows STT setup guide - `test_stt_live.cpp` - Test tool for all 5 STT engines - `create_test_audio_simple.py` - Generate test audio (440Hz tone, 16kHz WAV) - `create_test_audio.py` - Generate speech audio (requires gtts) - `models/ggml-base.bin` - Whisper.cpp base model (gitignored) - `test_audio.wav` - Generated test audio (gitignored) ## Documentation - Complete setup guide for all STT engines - API key configuration instructions - Model download links and recommendations - Troubleshooting section - Cost comparison for cloud APIs ## Next Steps - Compile test_stt_live.cpp to validate all engines - Test with real audio input - Integrate into VoiceModule via pub/sub 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-11-30 17:12:37 +08:00

Author

SHA1

Message

Date

StillHammer

ce2b25a599

Migration Gitea - sauvegarde locale 2025-12-04 18:58

2025-12-04 18:58:32 +08:00

StillHammer

d7971e0c34

feat: Phase 7 STT - Complete Windows setup with Whisper.cpp

Added Speech-to-Text configuration and testing infrastructure:

## STT Engines Configured
- ✅ Whisper.cpp (local, offline) - base model downloaded (142MB)
- ✅ OpenAI Whisper API - configured with existing API key
- ✅ Google Speech-to-Text - configured with existing API key
- ⚠️ Azure STT - optional (not configured)
- ⚠️ Deepgram - optional (not configured)

## New Files
- `docs/STT_SETUP.md` - Complete Windows STT setup guide
- `test_stt_live.cpp` - Test tool for all 5 STT engines
- `create_test_audio_simple.py` - Generate test audio (440Hz tone, 16kHz WAV)
- `create_test_audio.py` - Generate speech audio (requires gtts)
- `models/ggml-base.bin` - Whisper.cpp base model (gitignored)
- `test_audio.wav` - Generated test audio (gitignored)

## Documentation
- Complete setup guide for all STT engines
- API key configuration instructions
- Model download links and recommendations
- Troubleshooting section
- Cost comparison for cloud APIs

## Next Steps
- Compile test_stt_live.cpp to validate all engines
- Test with real audio input
- Integrate into VoiceModule via pub/sub

🤖 Generated with [Claude Code](https://claude.com/claude-code)

Co-Authored-By: Claude <noreply@anthropic.com>

2025-11-30 17:12:37 +08:00

2 Commits