Skip to content
View kkoutini's full-sized avatar

Block or report kkoutini

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Generative models for conditional audio generation

Python 3,619 421 Updated Feb 14, 2026

An Audio Language model for Audio Tasks

Python 319 16 Updated Apr 19, 2024

Sacred is a tool to help you configure, organize, log and reproduce experiments developed at IDSIA.

Python 3 2 Updated Nov 8, 2023

Official Jax Implementation of MaskGIT

Jupyter Notebook 554 53 Updated Nov 18, 2022

Open reproduction of MUSE for fast text2image generation.

Python 359 31 Updated Jun 1, 2024

music generation with masked transformers!

Max 350 42 Updated May 16, 2025

Unofficial PyTorch implementation of Discrete Denoising Diffusion Probabilistic Model(D3PM)

Python 46 3 Updated Apr 8, 2023

A curated list of awesome self-supervised methods

6,367 837 Updated Feb 24, 2026

Algorithm and data structure articles for https://cp-algorithms.com (based on http://e-maxx.ru)

C++ 10,274 1,993 Updated Mar 7, 2026

This reporsitory contains metadata of WavCaps dataset and codes for downstream tasks.

Python 257 11 Updated Jul 25, 2024

A feature-rich command-line audio/video downloader

Python 150,216 12,177 Updated Mar 3, 2026

Development repository for the Triton language and compiler

MLIR 18,587 2,646 Updated Mar 8, 2026

Evaluate EfficientAT models on the Holistic Evaluation of Audio Representations Benchmark.

Python 32 2 Updated Jun 23, 2023

This repository aims at providing efficient CNNs for Audio Tagging. We provide AudioSet pre-trained models ready for downstream training and extraction of audio embeddings.

Python 331 56 Updated Nov 20, 2024

Train ImageNet *fast* in 500 lines of code with FFCV

Python 149 33 Updated May 10, 2024
Python 19 6 Updated Jul 15, 2022

Objective C educational compiler

C++ 3 Updated Nov 2, 2014

This repository contains the code of the distribution shift framework presented in A Fine-Grained Analysis on Distribution Shift (Wiles et al., 2022).

Python 86 8 Updated Feb 20, 2026

Python bindings for minimp3

C 17 1 Updated Sep 11, 2023

Inference code for PaSST, using the HEAR API.

Python 32 15 Updated Jan 2, 2024

Pytorch domain adaptation package

Python 10 1 Updated Mar 1, 2023

Efficient Training of Audio Transformers with Patchout

Python 370 59 Updated Jan 12, 2024

ActNN: Reducing Training Memory Footprint via 2-Bit Activation Compressed Training

Python 199 30 Updated Dec 22, 2022

PyHessian is a Pytorch library for second-order based analysis and training of Neural Networks

Jupyter Notebook 778 125 Updated Jul 10, 2025

A beautiful, simple, clean, and responsive Jekyll theme for academics

HTML 15,260 12,831 Updated Mar 4, 2026

CP-JKU submission to DCASE 2021

4 Updated Jun 29, 2021

A curated list of neural network pruning resources.

2,491 332 Updated Apr 4, 2024

Web-based dashboard for Sacred

JavaScript 548 43 Updated Feb 1, 2023
Next