semantic-cache

Here are 45 public repositories matching this topic...

codefuse-ai / ModelCache

A LLM semantic caching system aiming to enhance user experience by reducing response time via cached query-result pairs.

llm semantic-cache

Updated Jun 30, 2025
Python

redis / redis-vl-python

Star

Redis Vector Library (RedisVL) -- the AI-native Python client for Redis.

python search redis openai embedding redis-search vector-search huggingface vector-database large-language-models llm anthropic semantic-cache retrieval-augmented-generation llmcache

Updated May 14, 2026
Python

peva3 / SmarterRouter

Star

SmarterRouter: An intelligent LLM gateway and VRAM-aware router for Ollama, llama.cpp, and OpenAI. Features semantic caching, model profiling, and automatic failover for local AI labs.

docker self-hosted model-serving gpu-monitoring fastapi llm openai-proxy semantic-cache local-llm ollama llm-proxy ollama-api ai-gateway llm-router self-hosted-ai ai-cache

Updated May 10, 2026
Python

vcache-project / vCache

Star

Reliable and Efficient Semantic Prompt Caching with vCache

Updated Dec 17, 2025
Python

redis-developer / adk-redis

Star

Redis integration for Google Agent Development Kit (ADK) - Memory, Sessions, Search Tools, MCP

memory semantic-search long-term-memory vector-search hybrid-search semantic-cache mcp-server agent-memory session-memory redis-agent-memory

Updated May 20, 2026
Python

zakariaf / RAG-Cache

Star

High-performance LLM query cache with semantic search. Reduce API costs 80% and latency from 8.5s to 1ms using Redis + Qdrant vector DB. Multi-provider support (OpenAI, Anthropic).

redis embeddings openai cost-optimization rag fastapi vector-database qdrant semantic-cache llm-caching

Updated Dec 2, 2025
Python

jonathanscholtes / LLM-Performance-with-Azure-Cosmos-DB-Semantic-Cache

Star

Enhance LLM retrieval performance with Azure Cosmos DB Semantic Cache. Learn how to integrate and optimize caching strategies in real-world web applications.

vector-search azurecosmosdb semantic-cache

Updated Mar 22, 2024
Python

yastman / rag

Star

AI real-estate automation platform: Telegram bot, RAG, apartment search, CRM workflows, voice agent, Langfuse observability, and Dockerized AI runtime.

Updated May 25, 2026
Python

mar1boroman / redis-movies-gen-ai

Star

Redis Vector Similarity Search, Semantic Caching, Recommendation Systems and RAG

redis vector vector-search llm semantic-cache redis-vector-search retrieval-augmented-generation dalle-3

Updated Apr 3, 2024
Python

Das-rebel / adaptive-memory-multi-model-router

Star

🔀 One prompt in. The right model out. Open-source LLM router with 100% routing accuracy, 47+ providers. Budget enforcement, semantic cache, intelligent failover. Zero ML, 19.5KB. MIT.

Updated May 25, 2026
Python

mar1boroman / ask-redis-blogs

Star

A ChatBot using Redis Vector Similarity Search, which can recommend blogs based on user prompt

python redis chatbot vector-search vector-database sentence-transformers huggingface-transformers large-language-models llm generative-ai semantic-cache redis-vector-search llmcache

Updated Sep 30, 2023
Python

benitomartin / semantic-caching-qdrant-splade

Star

Optimized RAG Retrieval with Indexing, Quantization, Hybrid Search and Caching

quantization hnsw huggingface hybrid-search large-language-models splade qdrant-vector-database semantic-cache retrieval-augmented-generation

Updated Nov 6, 2024
Python

GabinVr / MobilityCopilot

Star

Multi-agent LangGraph assistant for Montreal urban mobility — RAG, semantic caching, and a predictive ML collision model on FastAPI.

python machine-learning assistant multi-agent copilot mobility rag fastapi ai-agent llm semantic-cache chromadb langgraph

Updated Apr 16, 2026
Python

munimx / recallm

Star

Semantic cache layer for LLM APIs — embed prompts locally, find near-matches, skip redundant LLM calls.

python embeddings openai llm semantic-cache

Updated Mar 10, 2026
Python

cnomic-dev / semantic-translator-architecture

Star

A universal open protocol for LLM semantic caching and cross-platform alignment (v0.1). High-efficiency semantic hashing based on S³ topology.

python ai topology edge-computing open-protocol semantic-cache

Updated Apr 3, 2026
Python

awesome-pro / smartmemo

Sponsor

Star

Semantic memory and caching for LLM agents with classifier-validated equivalence instead of naive cosine thresholds.

python machine-learning sqlite pytorch embeddings developer-tools ai-agents cost-optimization faiss vector-search sentence-transformers semantic-memory llm llmops semantic-cache semantic-caching

Updated May 20, 2026
Python

nethra0906 / ClusterCache

Star

A semantic search system using vector embeddings, fuzzy clustering, FAISS indexing, and a custom semantic cache with a FastAPI service.

nlp machine-learning semantic-search fuzzy-clustering faiss fastapi vector-database sentence-transformers semantic-cache

Updated Mar 8, 2026
Python

obielin / llm-cache

Star

SQLite-backed LLM response cache. Exact match + fuzzy match. Decorator API. Zero mandatory server dependencies.

python sqlite cache decorator llm cost-reduction anthropic semantic-cache

Updated Apr 12, 2026
Python

soneylegal / vortex

Star

Orquestrador de agentes RAG corretivo (CRAG) para resolução de problemas de TI com rastreamento LangGraph, FastAPI, ChromaDB e OpenTelemetry/Phoenix.

python docker asyncio semantic-search observability rag fastapi opentelemetry vector-database sentence-transformers crag llm langchain semantic-cache chromadb langgraph arize-phoenix agentic-ai

Updated May 20, 2026
Python

vinaybudideti / intent-atoms

Star

Sub-query level semantic caching for LLM APIs — 3-tier hybrid engine with FAISS vector search. 87.5% cache hit rate, 71.8% cost savings on 100 real API calls.

react python caching cost-optimization faiss fastapi vector-search llm semantic-cache intent-decomposition

Updated Mar 4, 2026
Python

Improve this page

Add a description, image, and links to the semantic-cache topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the semantic-cache topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

semantic-cache

Here are 45 public repositories matching this topic...

codefuse-ai / ModelCache

redis / redis-vl-python

peva3 / SmarterRouter

vcache-project / vCache

redis-developer / adk-redis

zakariaf / RAG-Cache

jonathanscholtes / LLM-Performance-with-Azure-Cosmos-DB-Semantic-Cache

yastman / rag

mar1boroman / redis-movies-gen-ai

Das-rebel / adaptive-memory-multi-model-router

mar1boroman / ask-redis-blogs

benitomartin / semantic-caching-qdrant-splade

GabinVr / MobilityCopilot

munimx / recallm

cnomic-dev / semantic-translator-architecture

awesome-pro / smartmemo

nethra0906 / ClusterCache

obielin / llm-cache

soneylegal / vortex

vinaybudideti / intent-atoms

Improve this page

Add this topic to your repo