Major features: - Session logging system with detailed segment tracking (audio files, metadata, latencies) - Input gain control (0.5x-5.0x amplifier) with soft clipping - Context-aware Whisper prompts using recent transcriptions - Comprehensive segment metadata (RMS, peak, duration, timestamps) - API latency measurements for Whisper and Claude - Audio hash-based duplicate detection - Hallucination filtering with detailed logging Changes: - Add SessionLogger class for structured session data export - Apply input gain before VAD and denoising (not just raw input) - Enhanced Pipeline with segment tracking and error logging - New UI control for input gain amplifier - Sessions saved to sessions/ directory with transcripts/ export - Improved Whisper prompt in config.json (French instructions) 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>
21 lines
438 B
JSON
21 lines
438 B
JSON
{
|
|
"id": 16,
|
|
"chinese": "然后在这里边。",
|
|
"french": "Ensuite à l'intérieur.",
|
|
"audio": {
|
|
"duration_seconds": 1.710,
|
|
"rms_level": 0.0174,
|
|
"peak_level": 0.1387,
|
|
"filename": "016.opus"
|
|
},
|
|
"timestamps": {
|
|
"start": "2025-11-24T09:18:32.295",
|
|
"end": "2025-11-24T09:18:36.587"
|
|
},
|
|
"processing": {
|
|
"whisper_latency_ms": 951.6,
|
|
"claude_latency_ms": 1446.2,
|
|
"was_filtered": false
|
|
}
|
|
}
|