Skip to content
View Zkli-hub's full-sized avatar
  • Shanghai Jiao Tong University
  • Shanghai
  • 13:42 (UTC -12:00)

Block or report Zkli-hub

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

[DEPRECATED] Moved to ROCm/rocm-libraries repo. NOTE: develop branch is maintained as a read-only mirror

C++ 524 277 Updated Mar 18, 2026

[ICLR 2026] Learning to Parallel: Accelerating Diffusion Large Language Models via Learnable Parallel Decoding

Python 31 Updated Jan 27, 2026

The official GitHub repo for the survey paper "A Survey on Diffusion Language Models".

880 37 Updated Mar 10, 2026

Supercharge Your LLM with the Fastest KV Cache Layer

Python 7,716 1,026 Updated Mar 18, 2026
Python 153 23 Updated Oct 9, 2024

Turn any glasses into AI-powered smart glasses

C 3,954 514 Updated Sep 22, 2025

Discrete Diffusion Forcing (D2F): dLLMs Can Do Faster-Than-AR Inference

Python 245 17 Updated Feb 3, 2026

LiveCC: Learning Video LLM with Streaming Speech Transcription at Scale (CVPR 2025)

Python 431 55 Updated Oct 29, 2025

Official implementation of "Fast-dLLM: Training-free Acceleration of Diffusion LLM by Enabling KV Cache and Parallel Decoding"

Python 883 110 Updated Jan 28, 2026

[MLSys 2024 Best Paper Award] AWQ: Activation-aware Weight Quantization for LLM Compression and Acceleration

Python 3,466 304 Updated Jul 17, 2025

VILA is a family of state-of-the-art vision language models (VLMs) for diverse multimodal AI tasks across the edge, data center, and cloud.

Python 3,786 318 Updated Mar 12, 2026

[ECCV 2024] Official implementation of the paper "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"

Python 9,867 1,010 Updated Aug 12, 2024

Codebase for the Recognize Anything Model (RAM)

Jupyter Notebook 88 6 Updated Dec 11, 2023

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Python 24,581 2,748 Updated Aug 12, 2024

[ICCV 2025] Implementation for Describe Anything: Detailed Localized Image and Video Captioning

Python 1,468 89 Updated Jun 26, 2025

Code release for the paper "Progress-Aware Video Frame Captioning" (CVPR 2025)

Python 21 1 Updated Jul 16, 2025

StreamingBench: Assessing the Gap for MLLMs to Achieve Streaming Video Understanding

Python 151 7 Updated May 16, 2025

主要记录大语言大模型(LLMs) 算法(应用)工程师相关的知识及面试题

HTML 13,331 1,320 Updated Apr 30, 2025

[ICCV 2025] StreamMind: Unlocking Full Frame Rate Streaming Video Dialogue through Event-Gated Cognition

Python 63 3 Updated Jun 25, 2025

[CVPR 2025]Dispider: Enabling Video LLMs with Active Real-Time Interaction via Disentangled Perception, Decision, and Reaction

Python 169 10 Updated Mar 23, 2025

The simplest, fastest repository for training/finetuning medium-sized GPTs.

Python 55,142 9,399 Updated Nov 12, 2025

The Westermo network traffic dataset

26 4 Updated Apr 18, 2023

Network-centric WiFi Contact Tracing

Python 15 5 Updated Jan 29, 2022

🔥🔥🔥 [IEEE TCSVT] Latest Papers, Codes and Datasets on Vid-LLMs.

3,114 140 Updated Dec 20, 2025

Get up and running with Kimi-K2.5, GLM-5, MiniMax, DeepSeek, gpt-oss, Qwen, Gemma and other models.

Go 165,503 15,047 Updated Mar 19, 2026

A simple and effective LLM pruning approach.

Python 856 123 Updated Aug 9, 2024

[ACL 2024] Not All Experts are Equal: Efficient Expert Pruning and Skipping for Mixture-of-Experts Large Language Models

Python 116 12 Updated May 24, 2024

Ego4D Goal-Step: Toward Hierarchical Understanding of Procedural Activities (NeurIPS 2023)

Python 55 1 Updated Apr 15, 2024
Next