Highlights
- Pro
Stars
Warcraft III Peon voice notifications (+ more!) for Claude Code, Codex, IDEs, and any AI agent. Stop babysitting your terminal. Employ a Peon today.
💡 All-in-one AI framework for semantic search, LLM orchestration and language model workflows
NextPlaid, ColGREP: Multi-vector search, from database to coding agents.
Nearly Inference Free Embeddings: make your RAG queries 500x faster
Highly Performant, Modular, Memory Safe and Production-ready Inference, Ingestion and Indexing built in Rust 🦀
TF-ID: Table/Figure IDentifier for academic papers
Move and resize windows on macOS with keyboard shortcuts and snap areas
Github homepage releases feed is always broken - so I created my own
Provide with pre-build flash-attention package wheels on Linux and Windows platforms using GitHub Actions
A streamlined and customizable framework for efficient large model (LLM, VLM, AIGC) evaluation and performance benchmarking.
The training codes of Jasper-Token-Compression-600M
A modern static site generator by the Material for MkDocs team
Automating Releases via SemVer and Commit Message Conventions
A deep dive into the ACPI.sys DPC latency problems on Asus ROG laptops
Backported and experimental type hints for Python
A massively multilingual modern encoder language model
FastStream is a powerful and easy-to-use asynchronous Python framework for building asynchronous services interacting with event streams such as Apache Kafka, RabbitMQ, NATS and Redis.
The simplest way to serve AI/ML models in production
Evaluate and Enhance Your LLM Deployments for Real-World Inference Needs
Create web-based user interfaces with Python. The nice way.
Python Implementation of MUVERA (Multi-Vector Retrieval via Fixed Dimensional Encodings)
Robust and fast topic models with sentence-transformers.
Make beautiful isometric infrastructure diagrams
Open-source vector similarity search for Postgres






