Record audio or transcribe files using ctranslate2 and whisper!
-
Updated
Apr 28, 2026 - Python
Record audio or transcribe files using ctranslate2 and whisper!
تفريغ النصوص وإنشاء ملفات SRT و VTT باستخدام نماذج Whisper وتقنية wit.ai.
A performant high-throughput CPU-based API for Meta's No Language Left Behind (NLLB) using CTranslate2, hosted on Hugging Face Spaces.
Uses the powerful WhisperS2T and Ctranslate2 libraries to batch transcribe multiple files
faster-whisper is a reimplementation of OpenAI's Whisper model using CTranslate2, which is a fast inference engine for Transformer models. This implementation is up to 4 times faster than openai/whisper for the same accuracy while using less memory. The efficiency can be further improved with 8-bit quantization on both CPU and GPU.
speech to text gui for different (mostly Whisper, also Voxtral) models and backends, including whisper.cpp, mlx-whisper, faster-whisper, ctranslate2; applies pyannote for diarization
An offline CPU-first low-resource chat application to perform RAG on your corpus of data. Powered by OpenChat and CTranslate2.
Taiwanese Hokkien (Taigi) speech-to-text transcriber - MediaTek Breeze-ASR-26 with faster-whisper, tuned for RTX 3050 4GB low-VRAM GPUs. Gradio UI, CLI, Docker, SRT/VTT/TXT/JSON.
A fast CPU-first video/audio transcriber for generating caption files with Whisper and CTranslate2, hosted on Hugging Face Spaces.
Local, real-time AI translator for language immersion. Filters English, translates everything else.
A high-performance Docker container that runs OpenAI's Whisper model. Optimized for CPU, Intel NPU, Intel Arc/iGPU, and NVIDIA CUDA GPUs.
Basic vector database plugin that works wth KoboldAI. Adds documents, images, and audio files.
Lightweight Windows screen recorder with built-in live transcription. Captures screen, mic, and system audio (Google Meet, Zoom). Produces an MP4 with embedded subtitles. Runs fully offline via faster-whisper.
An audio summarizer (faster-whisper and BART glued together)
# 🎮 METranslator: A privacy-focused, offline AI translation tool for games and visual novels. Supports OPUS-MT, MADLAD-400, and mBART-50 with GPU acceleration.أداة ترجمة بالذكاء الاصطناعي تركز على الخصوصية وتعمل دون إنترنت، مخصصة للألعاب والروايات المرئية. تدعم نماذج OPUS و MADLAD و mBART مع تسريع كرت الشاشة.
Production-grade Traditional Chinese / Taiwan Mandarin speech-to-text. Qwen3-ASR + MediaTek Breeze-ASR-25, hot-word injection, LLM polish, speaker diarization. RTF up to 1554x on RTX 5090, 56 TDD tests.
Docker - Faster Whisper FR - RunPod Serverless API
POKAD - Persian Offline Knowledge AI & Digital assistant - LLM chatbot with web search, translation, and IoT support
AI-powered multilingual video subtitle generator. Transcribe with Whisper (+100 languages), translate with NLLB-200 (+200 languages), burn into video. Auto-installer with GPU support.
A recursive document translator tool that leverages argostranslate/ctranslate2 and cuda/mps acceleration when possible.
Add a description, image, and links to the ctranslate2 topic page so that developers can more easily learn about it.
To associate your repository with the ctranslate2 topic, visit your repo's landing page and select "manage topics."