Stars
Community recipes for serving LLMs on RTX 3090. Multi-engine (vLLM, llama.cpp, SGLang) and model-agnostic. Currently shipping Qwen3.6-27B configs for 1× and 2× cards.
A general fine-tuning kit geared toward image/video/audio diffusion models.
likelovewant / ollama-for-amd
Forked from ollama/ollamaGet up and running with Llama 3, Mistral, Gemma, and other large language models.by adding more amd gpu support.
Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.
Vulkan compute tool for testing video memory stability
Dead simple FLUX LoRA training UI with LOW VRAM support
The ultimate training toolkit for finetuning diffusion models
Make agents prove that their code is correct.
Voice AI runtime. Local first transcription, speaker diarization, TTS, and voice cloning with an OpenAI compatible API.
mini cli search engine for your docs, knowledge bases, meeting notes, whatever. Tracking current sota approaches while being all local
A lightweight alternative to OpenClaw that runs in containers for security. Connects to WhatsApp, Telegram, Slack, Discord, Gmail and other messaging apps,, has memory, scheduled jobs, and runs dir…
The open-source reactive database for app developers
A text-to-speech (TTS), speech-to-text (STT) and speech-to-speech (STS) library built on Apple's MLX framework, providing efficient speech analysis on Apple Silicon.
The Context Optimization Layer for LLM Applications
Open Source Standalone DJ Deck using an old CDJ 100 S
stackblitz-labs / bolt.diy
Forked from stackblitz/bolt.newPrompt, run, edit, and deploy full-stack web applications using any LLM you want!
A Weaviate Next.js template
The documentation repo for Weaviate Database, Cloud, Agents and much more!
From the team behind Gatsby, Mastra is a framework for building AI-powered applications and agents with a modern TypeScript stack.
It's like v0 but in your Cursor/WindSurf/Cline. 21st dev Magic MCP server for working with your frontend like Magic
Pruna is a model optimization framework built for developers, enabling you to deliver faster, more efficient models with minimal overhead.
AgentKit: Build multi-agent networks in TypeScript with deterministic routing and rich tooling via MCP.
Docusaurus authentication with Keycloak
express and hono servers in javascript (typescript coming soon)




