Popular repositories Loading
-
exllamav3
exllamav3 PublicForked from turboderp-org/exllamav3
An optimized quantization and inference library for running LLMs locally on modern consumer-class GPUs
Python
-
exllamav2
exllamav2 PublicForked from turboderp-org/exllamav2
A fast inference library for running LLMs locally on modern consumer-class GPUs
Python
-
-
tabbyAPI
tabbyAPI PublicForked from theroyallab/tabbyAPI
The official API server for Exllama. OAI compatible, lightweight, and fast.
Python
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.