secondvoice/transcripts/transcript_20251124_083029.txt
StillHammer 3ec2a8beca feat: Add session logging, input gain, and context-aware prompts
Major features:
- Session logging system with detailed segment tracking (audio files, metadata, latencies)
- Input gain control (0.5x-5.0x amplifier) with soft clipping
- Context-aware Whisper prompts using recent transcriptions
- Comprehensive segment metadata (RMS, peak, duration, timestamps)
- API latency measurements for Whisper and Claude
- Audio hash-based duplicate detection
- Hallucination filtering with detailed logging

Changes:
- Add SessionLogger class for structured session data export
- Apply input gain before VAD and denoising (not just raw input)
- Enhanced Pipeline with segment tracking and error logging
- New UI control for input gain amplifier
- Sessions saved to sessions/ directory with transcripts/ export
- Improved Whisper prompt in config.json (French instructions)

🤖 Generated with [Claude Code](https://claude.com/claude-code)

Co-Authored-By: Claude <noreply@anthropic.com>
2025-11-28 12:17:21 +08:00

42 lines
2.3 KiB
Plaintext

═══════════════════════════════════════════════════════════════
SecondVoice - Transcript Export
Date: 2025-11-24 08:30:29
Duration: 0:46
Segments: 3
═══════════════════════════════════════════════════════════════
───────────────────────────────────────────────────────────────
TEXTE COMPLET / FULL TEXT
───────────────────────────────────────────────────────────────
[中文 / Chinese]
我很忙。 你会说英文吗? 我也不知道。
[Français / French]
Je suis très occupé. Parles-tu anglais ? Je ne sais pas non plus.
───────────────────────────────────────────────────────────────
SEGMENTS DÉTAILLÉS / DETAILED SEGMENTS
───────────────────────────────────────────────────────────────
[Segment 1]
中文: 我很忙。
FR: Je suis très occupé.
[Segment 2]
中文: 你会说英文吗?
FR: Parles-tu anglais ?
[Segment 3]
中文: 我也不知道。
FR: Je ne sais pas non plus.
───────────────────────────────────────────────────────────────
STATISTIQUES / STATISTICS
───────────────────────────────────────────────────────────────
Audio processed: 3 seconds
Whisper API calls: 3
Claude API calls: 3
Estimated cost: $0.0033
═══════════════════════════════════════════════════════════════