secondvoice/sessions/2025-11-24_091652/segments/003.json
StillHammer 3ec2a8beca feat: Add session logging, input gain, and context-aware prompts
Major features:
- Session logging system with detailed segment tracking (audio files, metadata, latencies)
- Input gain control (0.5x-5.0x amplifier) with soft clipping
- Context-aware Whisper prompts using recent transcriptions
- Comprehensive segment metadata (RMS, peak, duration, timestamps)
- API latency measurements for Whisper and Claude
- Audio hash-based duplicate detection
- Hallucination filtering with detailed logging

Changes:
- Add SessionLogger class for structured session data export
- Apply input gain before VAD and denoising (not just raw input)
- Enhanced Pipeline with segment tracking and error logging
- New UI control for input gain amplifier
- Sessions saved to sessions/ directory with transcripts/ export
- Improved Whisper prompt in config.json (French instructions)

🤖 Generated with [Claude Code](https://claude.com/claude-code)

Co-Authored-By: Claude <noreply@anthropic.com>
2025-11-28 12:17:21 +08:00

21 lines
546 B
JSON

{
"id": 3,
"chinese": "那么我们也去不花生。声音能够量就那么大。",
"french": "Alors nous n'allons pas non plus aux cacahuètes. Le volume sonore ne peut être que si grand.",
"audio": {
"duration_seconds": 5.040,
"rms_level": 0.0465,
"peak_level": 0.2801,
"filename": "003.opus"
},
"timestamps": {
"start": "2025-11-24T09:17:56.893",
"end": "2025-11-24T09:17:59.404"
},
"processing": {
"whisper_latency_ms": 867.1,
"claude_latency_ms": 1576.8,
"was_filtered": false
}
}