(ACL 2025 Main) A Comprehensive Benchmark for Code Information Retrieval.
-
Updated
Jun 30, 2025 - Python
(ACL 2025 Main) A Comprehensive Benchmark for Code Information Retrieval.
State aware knowledge compression, ingestion, and hybrid retrieval engine. Zero dependencies. Sub-100ms queries.
🌈 Paper-implementations in Code Search (Baseline).
Deterministic method-first retrieval for AI coding agents.
Semantic Language-Indexed Code Extraction with Backward Slicing for Repository-Scale Code Generation
Adaptive q-log BM25 for code retrieval under fixed generic tokenization
A hpc LLVM Pass extracting semantic Control-Data Flow Graphs (CDFG) from Intermediate Representation for Graph Neural Networks. Enables cross-language code retrieval and clone detection beyond token-based approaches.
A production-grade LLM context compression and retrieval engine built entirely on the Python standard library. It solves the single biggest bottleneck in LLM agent effectiveness: context window waste.
Agentic code retrieval benchmark for coding agents
Add a description, image, and links to the code-retrieval topic page so that developers can more easily learn about it.
To associate your repository with the code-retrieval topic, visit your repo's landing page and select "manage topics."