Major features: - Session logging system with detailed segment tracking (audio files, metadata, latencies) - Input gain control (0.5x-5.0x amplifier) with soft clipping - Context-aware Whisper prompts using recent transcriptions - Comprehensive segment metadata (RMS, peak, duration, timestamps) - API latency measurements for Whisper and Claude - Audio hash-based duplicate detection - Hallucination filtering with detailed logging Changes: - Add SessionLogger class for structured session data export - Apply input gain before VAD and denoising (not just raw input) - Enhanced Pipeline with segment tracking and error logging - New UI control for input gain amplifier - Sessions saved to sessions/ directory with transcripts/ export - Improved Whisper prompt in config.json (French instructions) 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>
21 lines
420 B
JSON
21 lines
420 B
JSON
{
|
|
"id": 15,
|
|
"chinese": "假装会。",
|
|
"french": "Faire semblant.",
|
|
"audio": {
|
|
"duration_seconds": 1.760,
|
|
"rms_level": 0.0177,
|
|
"peak_level": 0.1595,
|
|
"filename": "015.opus"
|
|
},
|
|
"timestamps": {
|
|
"start": "2025-11-24T09:18:30.523",
|
|
"end": "2025-11-24T09:18:34.165"
|
|
},
|
|
"processing": {
|
|
"whisper_latency_ms": 913.2,
|
|
"claude_latency_ms": 1098.1,
|
|
"was_filtered": false
|
|
}
|
|
}
|