Whisper Voice Transcription App

A real-time voice transcription desktop application powered by faster-whisper (large-v3 model). Record audio with a hotkey and have it automatically transcribed and pasted at your cursor position.

Features

Real-time voice recording — toggle recording with a keyboard shortcut
Automatic transcription — uses the Whisper large-v3 model via faster-whisper
Auto-paste — transcribed text is automatically inserted at your cursor position
GPU acceleration — automatically detects and uses CUDA when available
Visual indicators — on-screen overlays show recording and transcription status
Accent support — full support for accented characters (French, etc.)
Re-inject — re-paste the last transcription with a shortcut

Requirements

Python 3.9+
Windows (uses Windows-specific GUI overlays and hotkeys)
A microphone
(Optional) NVIDIA GPU with CUDA for faster transcription

Installation

Clone the repository:

git clone https://github.com/BerthalonLucas/whisper_app.git
cd whisper_app

Create and activate a virtual environment:
```
python -m venv venv
venv\Scripts\activate
```
Install dependencies:
```
pip install -r requirements.txt
```
(Optional) Install CUDA support for GPU acceleration — install PyTorch with CUDA or ensure ctranslate2 can detect your GPU.

Usage

Run the application

python app.py

Or use the provided batch file on Windows:

launch_whisper.bat

Keyboard shortcuts

Shortcut	Action
`Ctrl + F9`	Start / Stop recording
`Ctrl + Shift + F9`	Re-paste the last transcription
`Ctrl + F10`	Quit the application

Transcribe an audio file

You can also transcribe an existing audio file from the command line:

python transcribe.py path/to/audio.wav

How it works

Press Ctrl + F9 to start recording — a red ● REC indicator appears on screen.
Press Ctrl + F9 again to stop — the audio is sent to the Whisper model for transcription.
A blue progress overlay shows transcription status.
Once complete, the transcribed text is automatically pasted at your current cursor position.

License

This project is licensed under the MIT License. See the LICENSE file for details.

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
.claude		.claude
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
app.py		app.py
launch_whisper.bat		launch_whisper.bat
requirements.txt		requirements.txt
transcribe.py		transcribe.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Whisper Voice Transcription App

Features

Requirements

Installation

Usage

Run the application

Keyboard shortcuts

Transcribe an audio file

How it works

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Whisper Voice Transcription App

Features

Requirements

Installation

Usage

Run the application

Keyboard shortcuts

Transcribe an audio file

How it works

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages