Zkli-hub

Kyle Zkli-hub

CS Ph.D. @ SJTU | ML Research intern @ ByteDance, @ miHoYo

7 followers · 27 following

Shanghai Jiao Tong University
Shanghai
13:42 (UTC -12:00)

Stars

ROCm / composable_kernel

[DEPRECATED] Moved to ROCm/rocm-libraries repo. NOTE: develop branch is maintained as a read-only mirror

C++ 524 277 Updated Mar 18, 2026

ims-kdks / Learning-to-Parallel-Decoding

[ICLR 2026] Learning to Parallel: Accelerating Diffusion Large Language Models via Learnable Parallel Decoding

Python 31 Updated Jan 27, 2026

VILA-Lab / Awesome-DLMs

The official GitHub repo for the survey paper "A Survey on Diffusion Language Models".

880 37 Updated Mar 10, 2026

LMCache / LMCache

Supercharge Your LLM with the Fastest KV Cache Layer

Python 7,716 1,026 Updated Mar 18, 2026

UChi-JCL / CacheGen

Python 153 23 Updated Oct 9, 2024

BasedHardware / OpenGlass

Turn any glasses into AI-powered smart glasses

C 3,954 514 Updated Sep 22, 2025

SJTU-DENG-Lab / Discrete-Diffusion-Forcing

Discrete Diffusion Forcing (D2F): dLLMs Can Do Faster-Than-AR Inference

Python 245 17 Updated Feb 3, 2026

showlab / livecc

LiveCC: Learning Video LLM with Streaming Speech Transcription at Scale (CVPR 2025)

Python 431 55 Updated Oct 29, 2025

NVlabs / Fast-dLLM

Official implementation of "Fast-dLLM: Training-free Acceleration of Diffusion LLM by Enabling KV Cache and Parallel Decoding"

Python 883 110 Updated Jan 28, 2026

NUWiNS / sigcomm-5gmemu-5g-mmWave-uplink-data

8 2 Updated Apr 2, 2023

mit-han-lab / llm-awq

[MLSys 2024 Best Paper Award] AWQ: Activation-aware Weight Quantization for LLM Compression and Acceleration

Python 3,466 304 Updated Jul 17, 2025

NVlabs / VILA

VILA is a family of state-of-the-art vision language models (VLMs) for diverse multimodal AI tasks across the edge, data center, and cloud.

Python 3,786 318 Updated Mar 12, 2026

IDEA-Research / GroundingDINO

[ECCV 2024] Official implementation of the paper "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"

Python 9,867 1,010 Updated Aug 12, 2024

OPPOMKLab / recognize-anything

Forked from xinyu1205/recognize-anything

Codebase for the Recognize Anything Model (RAM)

Jupyter Notebook 88 6 Updated Dec 11, 2023

haotian-liu / LLaVA

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Python 24,581 2,748 Updated Aug 12, 2024

NVlabs / describe-anything

[ICCV 2025] Implementation for Describe Anything: Detailed Localized Image and Video Captioning

Python 1,468 89 Updated Jun 26, 2025

zihuixue / ProgCaptioner

Code release for the paper "Progress-Aware Video Frame Captioning" (CVPR 2025)

Python 21 1 Updated Jul 16, 2025

THUNLP-MT / StreamingBench

StreamingBench: Assessing the Gap for MLLMs to Achieve Streaming Video Understanding

Python 151 7 Updated May 16, 2025

wdndev / llm_interview_note

主要记录大语言大模型（LLMs）算法（应用）工程师相关的知识及面试题

HTML 13,331 1,320 Updated Apr 30, 2025

xinding-sys / StreamMind

[ICCV 2025] StreamMind: Unlocking Full Frame Rate Streaming Video Dialogue through Event-Gated Cognition

Python 63 3 Updated Jun 25, 2025

Mark12Ding / Dispider

[CVPR 2025]Dispider: Enabling Video LLMs with Active Real-Time Interaction via Disentangled Perception, Decision, and Reaction

Python 169 10 Updated Mar 23, 2025

karpathy / nanoGPT

The simplest, fastest repository for training/finetuning medium-sized GPTs.

Python 55,142 9,399 Updated Nov 12, 2025

westermo / network-traffic-dataset

The Westermo network traffic dataset

26 4 Updated Apr 18, 2023

umassos / WiFiTrace

Network-centric WiFi Contact Tracing

Python 15 5 Updated Jan 29, 2022

yunlong10 / Awesome-LLMs-for-Video-Understanding

🔥🔥🔥 [IEEE TCSVT] Latest Papers, Codes and Datasets on Vid-LLMs.

3,114 140 Updated Dec 20, 2025

ollama / ollama

Get up and running with Kimi-K2.5, GLM-5, MiniMax, DeepSeek, gpt-oss, Qwen, Gemma and other models.

Go 165,503 15,047 Updated Mar 19, 2026

biomedical-cybernetics / Relative-importance-and-activation-pruning

Python 57 10 Updated Jun 10, 2024

locuslab / wanda

A simple and effective LLM pruning approach.

Python 856 123 Updated Aug 9, 2024

Lucky-Lance / Expert_Sparsity

[ACL 2024] Not All Experts are Equal: Efficient Expert Pruning and Skipping for Mixture-of-Experts Large Language Models

Python 116 12 Updated May 24, 2024

facebookresearch / ego4d-goalstep

Ego4D Goal-Step: Toward Hierarchical Understanding of Procedural Activities (NeurIPS 2023)

Python 55 1 Updated Apr 15, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly