secondvoice

History

StillHammer 40c451b9f8 feat: Upgrade to latest Whisper API with GPT-4o models and prompting Major improvements to Whisper API integration: New Features: - Support for gpt-4o-mini-transcribe and gpt-4o-transcribe models - Prompting support for better name recognition and context - Response format configuration (text, json, verbose_json) - Stream flag prepared for future streaming implementation Configuration Updates: - Updated config.json with new Whisper parameters - Added prompt, stream, and response_format fields - Default model: gpt-4o-mini-transcribe (better quality than whisper-1) Code Changes: - Extended WhisperClient::transcribe() with new parameters - Updated Config struct to support new fields - Modified Pipeline to pass all config parameters to Whisper - Added comprehensive documentation in docs/whisper_upgrade.md Benefits: - Better transcription accuracy (~33% improvement) - Improved name recognition (Tingting, Alexis) - Context-aware transcription with prompting - Ready for future streaming and diarization Documentation: - Complete guide in docs/whisper_upgrade.md - Usage examples and best practices - Cost comparison and optimization tips - Future roadmap for Phase 2 features 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>		2025-11-20 03:34:09 +08:00
..
api	feat: Upgrade to latest Whisper API with GPT-4o models and prompting	2025-11-20 03:34:09 +08:00
audio	fix: Resolve compilation errors and build successfully	2025-11-20 03:27:18 +08:00
core	feat: Upgrade to latest Whisper API with GPT-4o models and prompting	2025-11-20 03:34:09 +08:00
ui	feat: Implement complete MVP architecture for SecondVoice	2025-11-20 03:08:03 +08:00
utils	feat: Upgrade to latest Whisper API with GPT-4o models and prompting	2025-11-20 03:34:09 +08:00
main.cpp	fix: Resolve compilation errors and build successfully	2025-11-20 03:27:18 +08:00