Frontier CoreML audio models in your apps — text-to-speech, speech-to-text, voice activity detection, and speaker diarization. In Swift, powered by SOTA open source.
-
Updated
Feb 16, 2026 - Swift
Frontier CoreML audio models in your apps — text-to-speech, speech-to-text, voice activity detection, and speaker diarization. In Swift, powered by SOTA open source.
Swift library for Speaker Embedding extraction and verification using NVIDIA NeMo TitaNet model converted to CoreML. Extract 192-dim speaker embeddings, verify speakers, and perform real-time speaker diarization on iOS/macOS.
Add a description, image, and links to the speaker-embedding topic page so that developers can more easily learn about it.
To associate your repository with the speaker-embedding topic, visit your repo's landing page and select "manage topics."