secondvoice/sessions/2025-11-24_091652/segments/018.json
StillHammer 3ec2a8beca feat: Add session logging, input gain, and context-aware prompts
Major features:
- Session logging system with detailed segment tracking (audio files, metadata, latencies)
- Input gain control (0.5x-5.0x amplifier) with soft clipping
- Context-aware Whisper prompts using recent transcriptions
- Comprehensive segment metadata (RMS, peak, duration, timestamps)
- API latency measurements for Whisper and Claude
- Audio hash-based duplicate detection
- Hallucination filtering with detailed logging

Changes:
- Add SessionLogger class for structured session data export
- Apply input gain before VAD and denoising (not just raw input)
- Enhanced Pipeline with segment tracking and error logging
- New UI control for input gain amplifier
- Sessions saved to sessions/ directory with transcripts/ export
- Improved Whisper prompt in config.json (French instructions)

🤖 Generated with [Claude Code](https://claude.com/claude-code)

Co-Authored-By: Claude <noreply@anthropic.com>
2025-11-28 12:17:21 +08:00

21 lines
478 B
JSON

{
"id": 18,
"chinese": "外面的样子,实际上没有。",
"french": "L'apparence extérieure n'existe en réalité pas.",
"audio": {
"duration_seconds": 3.720,
"rms_level": 0.0202,
"peak_level": 0.1103,
"filename": "018.opus"
},
"timestamps": {
"start": "2025-11-24T09:18:38.208",
"end": "2025-11-24T09:18:41.532"
},
"processing": {
"whisper_latency_ms": 1292.5,
"claude_latency_ms": 1465.7,
"was_filtered": false
}
}