eonglints

Follow

Dan Lyth eonglints

Follow

Research engineer working on something new. Previously leading speech research at @Stability-AI and Rockstar Games.

33 followers · 2 following

Achievements

Achievements

Stars

stanford-futuredata / ColBERT

ColBERT: state-of-the-art neural search (SIGIR'20, TACL'21, NeurIPS'21, NAACL'22, CIKM'22, ACL'23, EMNLP'23)

Python 3,857 470 Updated Oct 14, 2025

nomonosound / log-wmse-audio-quality

logWMSE, an audio quality metric with support for digital silence target. Useful for evaluating audio source separation systems, even when there are many audio tracks or stems.

Python 38 3 Updated Jun 24, 2025

criteo / autofaiss

Automatically create Faiss knn indices with the most optimal similarity search parameters.

Python 902 77 Updated Nov 4, 2025

facebookresearch / faiss

A library for efficient similarity search and clustering of dense vectors.

C++ 39,958 4,368 Updated May 8, 2026

jfilter / clean-text

🧹 Python package for text cleaning

Python 1,012 82 Updated Jan 28, 2026

Ashvala / AQUA-Tk

AQUA-Tk = Audio QUality Assessment-Toolkit. (In development)

Python 105 10 Updated Dec 8, 2025

hunkimForks / chatgpt-arxiv-extension

Forked from wong2/chatgpt-google-extension

A browser extension that enhance search engines with ChatGPT

TypeScript 590 64 Updated Jul 3, 2024

open-mmlab / Amphion

Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audi…

Python 9,789 810 Updated Mar 25, 2026

stas00 / ml-engineering

Machine Learning Engineering Open Book

Python 17,877 1,138 Updated Mar 16, 2026

WhisperSpeech / WhisperSpeech

An Open Source text-to-speech system built by inverting Whisper.

Jupyter Notebook 4,602 272 Updated Dec 14, 2025

neverix / musicgen_trainer

Forked from Sciumo/musicgen_trainer

simple trainer for musicgen/audiocraft

Python 15 2 Updated Jul 14, 2023

audeering / audb

Manage audio and video datasets

Python 36 3 Updated Apr 16, 2026

interactiveaudiolab / penn

Pitch Estimating Neural Networks (PENN)

Python 273 26 Updated Apr 2, 2025

bootphon / shennong

A Python toolbox for speech features extraction

Python 165 25 Updated Feb 8, 2023

lucidrains / musiclm-pytorch

Implementation of MusicLM, Google's new SOTA model for music generation using attention networks, in Pytorch

Python 3,292 266 Updated Sep 6, 2023

iver56 / audiomentations

A Python library for audio data augmentation. Useful for making audio ML models work well in the real world, not just in the lab.

Python 2,268 219 Updated Apr 13, 2026

lucidrains / audiolm-pytorch

Implementation of AudioLM, a SOTA Language Modeling Approach to Audio Generation out of Google Research, in Pytorch

Python 2,621 280 Updated Jan 12, 2025

archinetai / audio-diffusion-pytorch

Audio generation using diffusion models, in PyTorch.

Python 2,103 179 Updated Jun 12, 2023

facebookresearch / encodec

State-of-the-art deep learning based audio codec supporting both mono 24 kHz audio and stereo 48 kHz audio.

Python 3,950 356 Updated Jan 4, 2024

openai / whisper

Robust Speech Recognition via Large-Scale Weak Supervision

Python 99,099 12,156 Updated Apr 15, 2026

bshall / hubert

HuBERT content encoders for: A Comparison of Discrete and Soft Speech Units for Improved Voice Conversion

Python 400 60 Updated Oct 1, 2024

albertfgu / diffwave-sashimi

Implementation of DiffWave and SaShiMi audio generation models

Python 128 15 Updated Apr 4, 2023

SpeechColab / GigaSpeech

Large, modern dataset for speech recognition

Shell 726 66 Updated Feb 26, 2024

qiuqiangkong / torchlibrosa

Python 510 49 Updated Jun 25, 2024

microsoft / NeuralSpeech

Python 1,460 186 Updated Feb 11, 2024

csteinmetz1 / auraloss

Collection of audio-focused loss functions in PyTorch

Python 863 77 Updated Jul 30, 2024

state-spaces / s4

Structured state space sequence models

Jupyter Notebook 2,893 360 Updated Jul 17, 2024

neonbjb / ocotillo

Performant and accurate speech recognition built on Pytorch

Python 254 27 Updated May 19, 2022

facebookresearch / WavAugment

A library for speech data augmentation in time-domain

Python 687 60 Updated Aug 30, 2021

facebookresearch / textlesslib

Library for Textless Spoken Language Processing

Python 558 57 Updated Aug 29, 2023