zhixuan-lin

Zhixuan Lin zhixuan-lin

PhD student at Mila and UdeM

120 followers · 80 following

University of Montreal
www.zhixuanlin.com

Achievements

Stars

MoonshotAI / Attention-Residuals

1,955 87 Updated Mar 17, 2026

karpathy / autoresearch

AI agents running research on single-GPU nanochat training automatically

Python 43,080 5,966 Updated Mar 16, 2026

princeton-nlp / HELMET

The HELMET Benchmark

Jupyter Notebook 204 39 Updated Feb 26, 2026

qwibitai / nanoclaw

A lightweight alternative to OpenClaw that runs in containers for security. Connects to WhatsApp, Telegram, Slack, Discord, Gmail and other messaging apps,, has memory, scheduled jobs, and runs dir…

TypeScript 24,282 6,984 Updated Mar 19, 2026

LyWangPX / Reinforcement-Learning-2nd-Edition-by-Sutton-Exercise-Solutions

Solutions of Reinforcement Learning, An Introduction

Jupyter Notebook 2,397 514 Updated Jul 10, 2025

xlite-dev / ffpa-attn

🤖FFPA: Extend FlashAttention-2 with Split-D, ~O(1) SRAM complexity for large headdim, 1.8x~3x↑🎉 vs SDPA EA.

Cuda 253 14 Updated Feb 13, 2026

Lyun0912-wu / LongAttn

LongAttn ：Selecting Long-context Training Data via Token-level Attention

Python 15 2 Updated Jul 16, 2025

sgl-project / mini-sglang

A compact implementation of SGLang, designed to demystify the complexities of modern LLM serving systems.

Python 3,750 504 Updated Mar 13, 2026

NVIDIA / cutile-python

cuTile is a programming model for writing parallel kernels for NVIDIA GPUs

Python 1,976 126 Updated Mar 18, 2026

leloykun / flash-attention-minimal

Forked from tspeterkim/flash-attention-minimal

Flash Attention in 300-500 lines of CUDA/C++

Cuda 36 2 Updated Aug 22, 2025

MoonshotAI / Kimi-Linear

1,331 63 Updated Nov 17, 2025

uni-plan / uni-plan

Planning with unified multimodal models

Python 10 1 Updated Dec 11, 2025

karpathy / nanochat

The best ChatGPT that $100 can buy.

Python 49,497 6,486 Updated Mar 17, 2026

nshepperd / flash_attn_jax

JAX bindings for Flash Attention v2

C++ 103 9 Updated Feb 28, 2026

jax-ml / jax-llm-examples

Minimal yet performant LLM examples in pure JAX

Python 245 32 Updated Jan 14, 2026

evalplus / evalplus

Rigourous evaluation of LLM-synthesized code - NeurIPS 2023 & COLM 2024

Python 1,698 192 Updated Oct 2, 2025

NVIDIA / tilus

Tilus is a tile-level kernel programming language with explicit control over shared memory and registers.

Python 454 17 Updated Mar 17, 2026

fla-org / fla-eval

A minimal evaluation framework for FLA models

7 Updated Aug 3, 2025

a1600012888 / LaCT

Code release for paper "Test-Time Training Done Right"

Python 413 24 Updated Jan 5, 2026

tilde-research / nsa-impl

An efficient implementation of the NSA (Native Sparse Attention) kernel

Python 132 5 Updated Jun 24, 2025

meta-pytorch / tritonbench

Tritonbench is a collection of PyTorch custom operators with example inputs to measure their performance.

Python 332 75 Updated Mar 19, 2026

Dao-AILab / quack

A Quirky Assortment of CuTe Kernels

Python 861 96 Updated Mar 18, 2026

GeeeekExplorer / nano-vllm

Nano vLLM

Python 12,319 1,748 Updated Nov 3, 2025

HanGuo97 / log-linear-attention

Python 274 15 Updated Jun 6, 2025

sail-sg / Attention-Sink

[ICLR 2025] When Attention Sink Emerges in Language Models: An Empirical View (Spotlight)

Python 159 5 Updated Jul 8, 2025

qiuzh20 / gated_attention

The official implementation for [NeurIPS2025 Oral] Gated Attention for Large Language Models: Non-linearity, Sparsity, and Attention-Sink-Free

Jupyter Notebook 899 54 Updated Dec 20, 2025

SandAI-org / MagiAttention

A Distributed Attention Towards Linear Scalability for Ultra-Long Context, Heterogeneous Data Training

Python 677 38 Updated Mar 16, 2026

PKU-ML / LongPPL

Code for ICLR 2025 Paper "What is Wrong with Perplexity for Long-context Language Modeling?"

Python 110 9 Updated Oct 11, 2025

huyiwen / awesome-llm-pretraining

Forked from RUCAIBox/awesome-llm-pretraining

Awesome LLM pre-training resources, including data, frameworks, and methods.

2 Updated Apr 25, 2025

RUCAIBox / awesome-llm-pretraining

Awesome LLM pre-training resources, including data, frameworks, and methods.

342 22 Updated Apr 29, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Zhixuan Lin zhixuan-lin

Achievements

Achievements

Block or report zhixuan-lin

Stars

MoonshotAI / Attention-Residuals

karpathy / autoresearch

princeton-nlp / HELMET

qwibitai / nanoclaw

LyWangPX / Reinforcement-Learning-2nd-Edition-by-Sutton-Exercise-Solutions

xlite-dev / ffpa-attn

Lyun0912-wu / LongAttn

sgl-project / mini-sglang

NVIDIA / cutile-python

leloykun / flash-attention-minimal

MoonshotAI / Kimi-Linear

uni-plan / uni-plan

karpathy / nanochat

nshepperd / flash_attn_jax

jax-ml / jax-llm-examples

evalplus / evalplus

NVIDIA / tilus

fla-org / fla-eval

a1600012888 / LaCT

tilde-research / nsa-impl

meta-pytorch / tritonbench

Dao-AILab / quack

GeeeekExplorer / nano-vllm

HanGuo97 / log-linear-attention

sail-sg / Attention-Sink

qiuzh20 / gated_attention

SandAI-org / MagiAttention

PKU-ML / LongPPL

huyiwen / awesome-llm-pretraining

RUCAIBox / awesome-llm-pretraining