secondvoice/sessions/2025-11-24_091652/segments/011.json
StillHammer 3ec2a8beca feat: Add session logging, input gain, and context-aware prompts
Major features:
- Session logging system with detailed segment tracking (audio files, metadata, latencies)
- Input gain control (0.5x-5.0x amplifier) with soft clipping
- Context-aware Whisper prompts using recent transcriptions
- Comprehensive segment metadata (RMS, peak, duration, timestamps)
- API latency measurements for Whisper and Claude
- Audio hash-based duplicate detection
- Hallucination filtering with detailed logging

Changes:
- Add SessionLogger class for structured session data export
- Apply input gain before VAD and denoising (not just raw input)
- Enhanced Pipeline with segment tracking and error logging
- New UI control for input gain amplifier
- Sessions saved to sessions/ directory with transcripts/ export
- Improved Whisper prompt in config.json (French instructions)

🤖 Generated with [Claude Code](https://claude.com/claude-code)

Co-Authored-By: Claude <noreply@anthropic.com>
2025-11-28 12:17:21 +08:00

21 lines
477 B
JSON

{
"id": 11,
"chinese": "那是一个比较古朴的。",
"french": "C'est quelque chose de plutôt ancien et traditionnel.",
"audio": {
"duration_seconds": 1.410,
"rms_level": 0.0132,
"peak_level": 0.0778,
"filename": "011.opus"
},
"timestamps": {
"start": "2025-11-24T09:18:20.551",
"end": "2025-11-24T09:18:23.207"
},
"processing": {
"whisper_latency_ms": 749.3,
"claude_latency_ms": 1225.4,
"was_filtered": false
}
}