Skip to content
View tohskai's full-sized avatar

Block or report tohskai

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

MoE training for Me and You and maybe other people

Python 380 33 Updated Mar 15, 2026

A compact implementation of SGLang, designed to demystify the complexities of modern LLM serving systems.

Python 3,762 509 Updated Mar 13, 2026

Accelerating MoE with IO and Tile-aware Optimizations

Python 611 65 Updated Mar 17, 2026
Python 7 Updated Dec 19, 2025

NVIDIA cuTile learn

Python 165 2 Updated Dec 9, 2025

Cuda extensions for PyTorch

Cuda 12 2 Updated Dec 2, 2025

Utilities for writing performant, readable Triton and Gluon kernels

Python 3 1 Updated Jan 5, 2026

Unofficial description of the CUDA assembly (SASS) instruction sets.

Python 205 19 Updated Jul 18, 2025

All material for CS140E, winter 2023.

C 95 40 Updated Mar 12, 2024
Python 7 Updated Feb 18, 2026

Groebner bases in (almost) pure Julia

Julia 75 14 Updated Mar 3, 2026

LauzHack Deep Learning Bootcamp

Jupyter Notebook 125 22 Updated Jul 19, 2025