Commit Graph

2 Commits

Author SHA1 Message Date
ce2b25a599 Migration Gitea - sauvegarde locale 2025-12-04 18:58 2025-12-04 18:58:32 +08:00
d7971e0c34 feat: Phase 7 STT - Complete Windows setup with Whisper.cpp
Added Speech-to-Text configuration and testing infrastructure:

## STT Engines Configured
-  Whisper.cpp (local, offline) - base model downloaded (142MB)
-  OpenAI Whisper API - configured with existing API key
-  Google Speech-to-Text - configured with existing API key
- ⚠️ Azure STT - optional (not configured)
- ⚠️ Deepgram - optional (not configured)

## New Files
- `docs/STT_SETUP.md` - Complete Windows STT setup guide
- `test_stt_live.cpp` - Test tool for all 5 STT engines
- `create_test_audio_simple.py` - Generate test audio (440Hz tone, 16kHz WAV)
- `create_test_audio.py` - Generate speech audio (requires gtts)
- `models/ggml-base.bin` - Whisper.cpp base model (gitignored)
- `test_audio.wav` - Generated test audio (gitignored)

## Documentation
- Complete setup guide for all STT engines
- API key configuration instructions
- Model download links and recommendations
- Troubleshooting section
- Cost comparison for cloud APIs

## Next Steps
- Compile test_stt_live.cpp to validate all engines
- Test with real audio input
- Integrate into VoiceModule via pub/sub

🤖 Generated with [Claude Code](https://claude.com/claude-code)

Co-Authored-By: Claude <noreply@anthropic.com>
2025-11-30 17:12:37 +08:00