Skip to content

BerthalonLucas/whisper_app

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

5 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Whisper Voice Transcription App

A real-time voice transcription desktop application powered by faster-whisper (large-v3 model). Record audio with a hotkey and have it automatically transcribed and pasted at your cursor position.

Features

  • Real-time voice recording — toggle recording with a keyboard shortcut
  • Automatic transcription — uses the Whisper large-v3 model via faster-whisper
  • Auto-paste — transcribed text is automatically inserted at your cursor position
  • GPU acceleration — automatically detects and uses CUDA when available
  • Visual indicators — on-screen overlays show recording and transcription status
  • Accent support — full support for accented characters (French, etc.)
  • Re-inject — re-paste the last transcription with a shortcut

Requirements

  • Python 3.9+
  • Windows (uses Windows-specific GUI overlays and hotkeys)
  • A microphone
  • (Optional) NVIDIA GPU with CUDA for faster transcription

Installation

  1. Clone the repository:

    git clone https://github.com/BerthalonLucas/whisper_app.git
    cd whisper_app
  2. Create and activate a virtual environment:

    python -m venv venv
    venv\Scripts\activate
  3. Install dependencies:

    pip install -r requirements.txt
  4. (Optional) Install CUDA support for GPU acceleration — install PyTorch with CUDA or ensure ctranslate2 can detect your GPU.

Usage

Run the application

python app.py

Or use the provided batch file on Windows:

launch_whisper.bat

Keyboard shortcuts

Shortcut Action
Ctrl + F9 Start / Stop recording
Ctrl + Shift + F9 Re-paste the last transcription
Ctrl + F10 Quit the application

Transcribe an audio file

You can also transcribe an existing audio file from the command line:

python transcribe.py path/to/audio.wav

How it works

  1. Press Ctrl + F9 to start recording — a red ● REC indicator appears on screen.
  2. Press Ctrl + F9 again to stop — the audio is sent to the Whisper model for transcription.
  3. A blue progress overlay shows transcription status.
  4. Once complete, the transcribed text is automatically pasted at your current cursor position.

License

This project is licensed under the MIT License. See the LICENSE file for details.

About

No description, website, or topics provided.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors