secondvoice/sessions/2025-11-24_091652/segments/014.json
StillHammer 3ec2a8beca feat: Add session logging, input gain, and context-aware prompts
Major features:
- Session logging system with detailed segment tracking (audio files, metadata, latencies)
- Input gain control (0.5x-5.0x amplifier) with soft clipping
- Context-aware Whisper prompts using recent transcriptions
- Comprehensive segment metadata (RMS, peak, duration, timestamps)
- API latency measurements for Whisper and Claude
- Audio hash-based duplicate detection
- Hallucination filtering with detailed logging

Changes:
- Add SessionLogger class for structured session data export
- Apply input gain before VAD and denoising (not just raw input)
- Enhanced Pipeline with segment tracking and error logging
- New UI control for input gain amplifier
- Sessions saved to sessions/ directory with transcripts/ export
- Improved Whisper prompt in config.json (French instructions)

🤖 Generated with [Claude Code](https://claude.com/claude-code)

Co-Authored-By: Claude <noreply@anthropic.com>
2025-11-28 12:17:21 +08:00

21 lines
465 B
JSON

{
"id": 14,
"chinese": "这些人都在干啥呢?",
"french": "Qu'est-ce que ces gens sont en train de faire ?",
"audio": {
"duration_seconds": 2.120,
"rms_level": 0.0138,
"peak_level": 0.1027,
"filename": "014.opus"
},
"timestamps": {
"start": "2025-11-24T09:18:28.246",
"end": "2025-11-24T09:18:32.130"
},
"processing": {
"whisper_latency_ms": 959.4,
"claude_latency_ms": 1242.9,
"was_filtered": false
}
}