Skip to content
View eonglints's full-sized avatar

Block or report eonglints

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

ColBERT: state-of-the-art neural search (SIGIR'20, TACL'21, NeurIPS'21, NAACL'22, CIKM'22, ACL'23, EMNLP'23)

Python 3,857 470 Updated Oct 14, 2025

logWMSE, an audio quality metric with support for digital silence target. Useful for evaluating audio source separation systems, even when there are many audio tracks or stems.

Python 38 3 Updated Jun 24, 2025

Automatically create Faiss knn indices with the most optimal similarity search parameters.

Python 902 77 Updated Nov 4, 2025

A library for efficient similarity search and clustering of dense vectors.

C++ 39,958 4,368 Updated May 8, 2026

🧹 Python package for text cleaning

Python 1,012 82 Updated Jan 28, 2026

AQUA-Tk = Audio QUality Assessment-Toolkit. (In development)

Python 105 10 Updated Dec 8, 2025

A browser extension that enhance search engines with ChatGPT

TypeScript 590 64 Updated Jul 3, 2024

Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audi…

Python 9,789 810 Updated Mar 25, 2026

Machine Learning Engineering Open Book

Python 17,877 1,138 Updated Mar 16, 2026

An Open Source text-to-speech system built by inverting Whisper.

Jupyter Notebook 4,602 272 Updated Dec 14, 2025

simple trainer for musicgen/audiocraft

Python 15 2 Updated Jul 14, 2023

Manage audio and video datasets

Python 36 3 Updated Apr 16, 2026

Pitch Estimating Neural Networks (PENN)

Python 273 26 Updated Apr 2, 2025

A Python toolbox for speech features extraction

Python 165 25 Updated Feb 8, 2023

Implementation of MusicLM, Google's new SOTA model for music generation using attention networks, in Pytorch

Python 3,292 266 Updated Sep 6, 2023

A Python library for audio data augmentation. Useful for making audio ML models work well in the real world, not just in the lab.

Python 2,268 219 Updated Apr 13, 2026

Implementation of AudioLM, a SOTA Language Modeling Approach to Audio Generation out of Google Research, in Pytorch

Python 2,621 280 Updated Jan 12, 2025

Audio generation using diffusion models, in PyTorch.

Python 2,103 179 Updated Jun 12, 2023

State-of-the-art deep learning based audio codec supporting both mono 24 kHz audio and stereo 48 kHz audio.

Python 3,950 356 Updated Jan 4, 2024

Robust Speech Recognition via Large-Scale Weak Supervision

Python 99,099 12,156 Updated Apr 15, 2026

HuBERT content encoders for: A Comparison of Discrete and Soft Speech Units for Improved Voice Conversion

Python 400 60 Updated Oct 1, 2024

Implementation of DiffWave and SaShiMi audio generation models

Python 128 15 Updated Apr 4, 2023

Large, modern dataset for speech recognition

Shell 726 66 Updated Feb 26, 2024
Python 510 49 Updated Jun 25, 2024
Python 1,460 186 Updated Feb 11, 2024

Collection of audio-focused loss functions in PyTorch

Python 863 77 Updated Jul 30, 2024

Structured state space sequence models

Jupyter Notebook 2,893 360 Updated Jul 17, 2024

Performant and accurate speech recognition built on Pytorch

Python 254 27 Updated May 19, 2022

A library for speech data augmentation in time-domain

Python 687 60 Updated Aug 30, 2021

Library for Textless Spoken Language Processing

Python 558 57 Updated Aug 29, 2023
Next