vithursant

🎯

Focusing

Vithu Thangarasa vithursant

🎯

Focusing

Principal Research Scientist @Cerebras

88 followers · 14 following

@Cerebras
Toronto, ON Canada
vithursant.com

Achievements

x2 x2

Achievements

x2 x2

Organizations

Stars

SAI-Lab-NYU / CodeQuant

This repository provides the official implementation of CodeQuant (ICLR, 2026), a unified clustering and quantization framework for Mixture-of-Experts (MoE) Large Language Models (LLMs), addressing…

Python 5 Updated Mar 2, 2026

raullenchai / awesome-mlx

Forked from antranapp/awesome-mlx

A curated list of awesome projects, tools, and resources for Apple MLX — the ML framework for Apple Silicon

485 46 Updated Apr 11, 2026

LengSicong / MMR1

[CVPR 2026] MMR1: Enhancing Multimodal Reasoning with Variance-Aware Sampling and Open Resources

Python 218 9 Updated Sep 26, 2025

0xSero / reap-mlx

REAP expert pruning for MoE LLMs on Apple Silicon via MLX

TypeScript 55 5 Updated Mar 16, 2026

SAI-Lab-NYU / DREAM

Python 8 4 Updated Oct 13, 2025

CerebrasResearch / reap

REAP: Router-weighted Expert Activation Pruning for SMoE compression

Python 358 66 Updated Apr 17, 2026

vincentamato / mlx-coconut

An MLX port of Meta's Coconut reasoning model

Python 16 1 Updated Sep 2, 2025

huggingface / alignment-handbook

Robust recipes to align language models with human and AI preferences

Python 5,599 489 Updated Apr 8, 2026

uclaml / SPIN

The official implementation of Self-Play Fine-Tuning (SPIN)

Python 1,240 105 Updated May 8, 2024

microsoft / wina

ICLR 2026

Python 157 23 Updated Apr 8, 2026

mlcommons / ailuminate

The AILuminate v1.1 benchmark suite is an AI risk assessment benchmark developed with broad involvement from leading AI companies, academia, and civil society.

78 19 Updated Jun 11, 2025

deepspeedai / Megatron-DeepSpeed

Forked from NVIDIA/Megatron-LM

Ongoing research training transformer language models at scale, including: BERT & GPT-2

Python 2,246 365 Updated Aug 14, 2025

MoonshotAI / MoBA

MoBA: Mixture of Block Attention for Long-Context LLMs

Python 2,119 147 Updated Apr 3, 2025

algorithmicsuperintelligence / optillm

Optimizing inference proxy for LLMs

Python 3,781 312 Updated May 7, 2026

deepseek-ai / DeepSeek-V3

Python 103,535 16,743 Updated Aug 28, 2025

zoom / Zoom-Chat-Neural-Search-Assistant-Sample

A Zoom Team Chat bot that combines Cerebras' Llama 3.1-8b model with Exa search capabilities to provide intelligent responses. The bot can search for current information when needed and maintain co…

JavaScript 3 Updated Oct 25, 2024