secondvoice/sessions/2025-11-24_091652/segments/010.json
StillHammer 3ec2a8beca feat: Add session logging, input gain, and context-aware prompts
Major features:
- Session logging system with detailed segment tracking (audio files, metadata, latencies)
- Input gain control (0.5x-5.0x amplifier) with soft clipping
- Context-aware Whisper prompts using recent transcriptions
- Comprehensive segment metadata (RMS, peak, duration, timestamps)
- API latency measurements for Whisper and Claude
- Audio hash-based duplicate detection
- Hallucination filtering with detailed logging

Changes:
- Add SessionLogger class for structured session data export
- Apply input gain before VAD and denoising (not just raw input)
- Enhanced Pipeline with segment tracking and error logging
- New UI control for input gain amplifier
- Sessions saved to sessions/ directory with transcripts/ export
- Improved Whisper prompt in config.json (French instructions)

🤖 Generated with [Claude Code](https://claude.com/claude-code)

Co-Authored-By: Claude <noreply@anthropic.com>
2025-11-28 12:17:21 +08:00

21 lines
437 B
JSON

{
"id": 10,
"chinese": "这些人都在干啥呢?",
"french": "Que font ces gens ?",
"audio": {
"duration_seconds": 2.250,
"rms_level": 0.0212,
"peak_level": 0.1146,
"filename": "010.opus"
},
"timestamps": {
"start": "2025-11-24T09:18:19.138",
"end": "2025-11-24T09:18:21.211"
},
"processing": {
"whisper_latency_ms": 776.3,
"claude_latency_ms": 1265.9,
"was_filtered": false
}
}