Skip to content
View vithursant's full-sized avatar
🎯
Focusing
🎯
Focusing

Organizations

@uoguelph-mlrg @VectorInstitute @ContinualAI

Block or report vithursant

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

This repository provides the official implementation of CodeQuant (ICLR, 2026), a unified clustering and quantization framework for Mixture-of-Experts (MoE) Large Language Models (LLMs), addressing…

Python 5 Updated Mar 2, 2026

A curated list of awesome projects, tools, and resources for Apple MLX — the ML framework for Apple Silicon

485 46 Updated Apr 11, 2026

[CVPR 2026] MMR1: Enhancing Multimodal Reasoning with Variance-Aware Sampling and Open Resources

Python 218 9 Updated Sep 26, 2025

REAP expert pruning for MoE LLMs on Apple Silicon via MLX

TypeScript 55 5 Updated Mar 16, 2026
Python 8 4 Updated Oct 13, 2025

REAP: Router-weighted Expert Activation Pruning for SMoE compression

Python 358 66 Updated Apr 17, 2026

An MLX port of Meta's Coconut reasoning model

Python 16 1 Updated Sep 2, 2025

Robust recipes to align language models with human and AI preferences

Python 5,599 489 Updated Apr 8, 2026

The official implementation of Self-Play Fine-Tuning (SPIN)

Python 1,240 105 Updated May 8, 2024

ICLR 2026

Python 157 23 Updated Apr 8, 2026

The AILuminate v1.1 benchmark suite is an AI risk assessment benchmark developed with broad involvement from leading AI companies, academia, and civil society.

78 19 Updated Jun 11, 2025

Ongoing research training transformer language models at scale, including: BERT & GPT-2

Python 2,246 365 Updated Aug 14, 2025

MoBA: Mixture of Block Attention for Long-Context LLMs

Python 2,119 147 Updated Apr 3, 2025

Optimizing inference proxy for LLMs

Python 3,781 312 Updated May 7, 2026

A Zoom Team Chat bot that combines Cerebras' Llama 3.1-8b model with Exa search capabilities to provide intelligent responses. The bot can search for current information when needed and maintain co…

JavaScript 3 Updated Oct 25, 2024

Effective LLM Alignment Toolkit

Python 154 11 Updated Jun 25, 2025

Official repository of Sparse ISO-FLOP Transformations for Maximizing Training Efficiency

Python 25 Updated Jul 31, 2024

2-2000x faster ML algos, 50% less memory usage, works on all hardware - new and old.

Jupyter Notebook 2,454 160 Updated Nov 19, 2024

LLM101n: Let's build a Storyteller

36,910 2,017 Updated Aug 1, 2024

🤗 LeRobot: Making AI for Robotics more accessible with end-to-end learning

Python 24,024 4,538 Updated May 15, 2026

ADAHESSIAN: An Adaptive Second Order Optimizer for Machine Learning

Python 286 54 Updated Feb 27, 2023

Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.

Python 10,481 1,046 Updated Jul 1, 2024

torchtrail: trace the graph of torch functions and modules for visualization, reports, etc

Python 25 Updated May 25, 2025

Minimalistic large language model 3D-parallelism training

Python 2,690 307 Updated Apr 7, 2026

OCR, layout analysis, reading order, table recognition in 90+ languages

Python 19,739 1,372 Updated May 6, 2026

Go ahead and axolotl questions

Python 11,911 1,340 Updated May 13, 2026

Benchmark of Apple MLX operations on all Apple Silicon chips (GPU, CPU) + MPS and CUDA.

Python 230 31 Updated Apr 6, 2026

Port of Andrej Karpathy's nanoGPT to Apple MLX framework.

Python 120 13 Updated Feb 12, 2024
Next