Skip to content

feat(app): add VAD preprocessing via FluidAudio Silero v6#76

Merged
pasrom merged 1 commit intomainfrom
feat/vad-preprocessing
Apr 1, 2026
Merged

feat(app): add VAD preprocessing via FluidAudio Silero v6#76
pasrom merged 1 commit intomainfrom
feat/vad-preprocessing

Conversation

@pasrom
Copy link
Copy Markdown
Owner

@pasrom pasrom commented Apr 1, 2026

Summary

  • Add Voice Activity Detection (VAD) using FluidAudio's Silero v6 model (CoreML/ANE)
  • Removes silence before transcription, improving accuracy and speed
  • New types: SpeechRegion, VadSegmentMap (pure, testable), FluidVAD (wrapper)
  • Settings: vadEnabled (default off), vadThreshold (0.3–0.9 slider)
  • Pipeline: single-source path preprocesses with VAD when enabled

Inspired by @execsumo's work in #70.

Test plan

  • 6 unit tests for VadSegmentMap (empty, single, multi, remap, extract, duration)
  • Build passes
  • Lint clean (0 violations)
  • Manual: enable VAD in Settings, record a meeting, verify transcript quality

@github-actions github-actions bot added the enhancement New feature or request label Apr 1, 2026
Voice Activity Detection removes silence before transcription, improving
accuracy and speed for recordings with significant pauses. Uses FluidAudio's
Silero VAD v6 model (CoreML/ANE) to detect speech regions, then:
- Extracts speech-only audio for transcription
- Remaps timestamps back to original timeline

New types: SpeechRegion, VadSegmentMap (pure, testable), FluidVAD (wrapper).
Settings: vadEnabled (default off), vadThreshold (0.3–0.9 slider).
Pipeline: single-source path preprocesses with VAD when enabled.

Inspired by @execsumo's work in #70.
@pasrom pasrom force-pushed the feat/vad-preprocessing branch from 4c68383 to bc161ae Compare April 1, 2026 07:50
@pasrom pasrom merged commit 74ca85f into main Apr 1, 2026
7 checks passed
@pasrom pasrom deleted the feat/vad-preprocessing branch April 1, 2026 08:00
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

enhancement New feature or request

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant