Major features: - Session logging system with detailed segment tracking (audio files, metadata, latencies) - Input gain control (0.5x-5.0x amplifier) with soft clipping - Context-aware Whisper prompts using recent transcriptions - Comprehensive segment metadata (RMS, peak, duration, timestamps) - API latency measurements for Whisper and Claude - Audio hash-based duplicate detection - Hallucination filtering with detailed logging Changes: - Add SessionLogger class for structured session data export - Apply input gain before VAD and denoising (not just raw input) - Enhanced Pipeline with segment tracking and error logging - New UI control for input gain amplifier - Sessions saved to sessions/ directory with transcripts/ export - Improved Whisper prompt in config.json (French instructions) 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>
21 lines
434 B
JSON
21 lines
434 B
JSON
{
|
|
"id": 21,
|
|
"chinese": "然后来看一下。",
|
|
"french": "Ensuite, viens voir.",
|
|
"audio": {
|
|
"duration_seconds": 2.110,
|
|
"rms_level": 0.0259,
|
|
"peak_level": 0.1535,
|
|
"filename": "021.opus"
|
|
},
|
|
"timestamps": {
|
|
"start": "2025-11-24T09:18:45.067",
|
|
"end": "2025-11-24T09:18:47.886"
|
|
},
|
|
"processing": {
|
|
"whisper_latency_ms": 794.9,
|
|
"claude_latency_ms": 1126.7,
|
|
"was_filtered": false
|
|
}
|
|
}
|