title

GraphRAG-SDK

nav_order

description

Build intelligent GraphRAG applications with FalkorDB and LLMs.

parent

GenAI Tools

redirect_from

/graphrag_sdk.html

/graphrag_sdk

/graphrag-sdk.html

/graphrag-sdk

GraphRAG-SDK

Build intelligent GraphRAG applications with FalkorDB and LLMs

Ingest raw documents (text, PDF, Markdown) directly into a FalkorDB knowledge graph with schema-guided entity extraction.
Retrieve cited answers via a hybrid pipeline that combines vector search, full-text search, Cypher generation, and relationship expansion.
Update the graph incrementally on PR merges with apply_changes() — re-sync individual documents without rebuilding.
Plug in any LLM or embedder via LiteLLM (OpenAI, Azure, Anthropic, Gemini, Groq, Cohere, local models, etc.).

Resources:

GraphRAG-SDK GitHub Repository
Examples — runnable starters covering quick start, schemas, custom strategies, and incremental updates
API Reference

Quick Start

Install and start FalkorDB

pip install graphrag-sdk[litellm]
docker run -d -p 6379:6379 -p 3000:3000 --name falkordb falkordb/falkordb:latest
export OPENAI_API_KEY="sk-..."

For PDF ingestion, install the pdf extra instead: pip install graphrag-sdk[litellm,pdf].

Or sign up for FalkorDB Cloud.

Ingest and query

import asyncio
from graphrag_sdk import GraphRAG, ConnectionConfig, LiteLLM, LiteLLMEmbedder

async def main():
    async with GraphRAG(
        connection=ConnectionConfig(host="localhost", graph_name="my_graph"),
        llm=LiteLLM(model="openai/gpt-4o-mini"),
        embedder=LiteLLMEmbedder(model="openai/text-embedding-3-large", dimensions=256),
    ) as rag:
        # Ingest raw text (or pass a file path for .md / .txt / .pdf)
        result = await rag.ingest(
            text="Alice Johnson is a software engineer at Acme Corp in London.",
            document_id="my_doc",
        )
        print(f"Nodes: {result.nodes_created}, Edges: {result.relationships_created}")

        # Finalize: deduplicate entities, backfill embeddings, build indexes
        await rag.finalize()

        # Full RAG: retrieve + generate with provenance
        answer = await rag.completion("Where does Alice work?")
        print(answer.answer)

asyncio.run(main())

Define a schema (optional)

A schema constrains which entity and relation types the extractor produces. Without one the SDK runs open-world extraction; with one you get a tighter, more queryable graph.

from graphrag_sdk import GraphSchema, EntityType, RelationType

schema = GraphSchema(
    entities=[
        EntityType(label="Person", description="A human being"),
        EntityType(label="Organization", description="A company or institution"),
        EntityType(label="Location", description="A geographic location"),
    ],
    relations=[
        RelationType(label="WORKS_AT", description="Is employed by", patterns=[("Person", "Organization")]),
        RelationType(label="LOCATED_IN", description="Is situated in", patterns=[("Organization", "Location")]),
    ],
)

async with GraphRAG(
    connection=ConnectionConfig(host="localhost", graph_name="my_graph"),
    llm=LiteLLM(model="openai/gpt-4o-mini"),
    embedder=LiteLLMEmbedder(model="openai/text-embedding-3-large", dimensions=256),
    schema=schema,
) as rag:
    ...  # ingest / completion as above

Incremental Updates

Re-sync individual documents without rebuilding the graph. The canonical CI use case is updating the graph on PR merge — added, modified, and deleted files in one batch:

async with GraphRAG(connection=ConnectionConfig(...), llm=..., embedder=...) as graph:
    result = await graph.apply_changes(
        added=["docs/new_feature.md"],
        modified=["docs/api.md"],
        deleted=["docs/removed_page.md"],
    )
    await graph.finalize()  # once per batch — finalize is O(graph size)

    # Per-file outcomes are wrapped in BatchEntry — the batch never raises.
    for entry in result.added + result.modified + result.deleted:
        if not entry.is_success:
            print(f"failed: {entry.error_type}: {entry.error}")

The three primitives behind the wrapper:

Method	When to use
`update(source, document_id=...)`	Document content changed. A SHA-256 content hash short-circuits no-op updates (touch-only PRs cost ~1 Cypher query). Pass `if_missing="ingest"` for upsert semantics.
`delete_document(document_id)`	Document removed. Cleans up entities orphaned by the deletion; preserves entities still referenced by other documents.
`apply_changes(added=..., modified=..., deleted=...)`	Heterogeneous batch. Per-file errors are collected, not raised. Does not call `finalize()` — caller drives that cadence.

Cost model. finalize() runs cross-document deduplication, which scans the full entity table — its cost is O(graph size), not O(change size). For CI use cases, batch all PR changes through apply_changes and call finalize once at the end of the run, not per file.

Key Features

Schema-guided extraction — Constrain the entity and relation types the LLM produces, or run open-world if you don't have a schema yet.
Hybrid retrieval — Vector search, full-text search, Cypher generation, and relationship expansion combined in one pipeline.
Cited answers — Every answer is traceable to its source chunks via MENTIONS edges; pass return_context=True to completion() to get the retrieval trail.
Incremental updates — apply_changes() handles added/modified/deleted documents in one call, with crash-safe rollforward cutover.
Multi-tenant — graph_name on the connection provides per-tenant isolation in a single FalkorDB instance.
Provider-agnostic — Any LLM or embedder reachable via LiteLLM works out of the box; custom providers plug in behind clean ABCs.

How it works

1️⃣ Ingest documents into a knowledge graph

Document loading: Markdown, plain text, and (with the pdf extra) PDF inputs go through structural-aware chunkers.
Entity extraction: An LLM-driven extractor reads each chunk, optionally constrained by your schema.
Resolution: Surface forms of the same entity are merged via exact-match plus optional LLM-verified resolution.
Provenance: Chunks, entities, and relationships are connected by PART_OF, NEXT_CHUNK, and MENTIONS edges so every answer can be traced back to its source text.

2️⃣ Retrieve relevant context

Vector search ranks chunks and entities by embedding similarity.
Full-text search complements vectors with keyword precision.
Cypher generation (experimental) lets the LLM emit graph queries directly for aggregate or structural questions.
Relationship expansion traverses the graph from the seed hits to surface multi-hop context.
Reranking orders the final passages by cosine similarity to the question.

3️⃣ Generate cited answers

Retrieval-grounded completion combines the retrieved context with the user's question and feeds it to the LLM.
Source attribution is built into the response — every answer points back to the chunks that supported it.

📓 Understanding Ontologies and Knowledge Graphs

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

GraphRAG-SDK

Build intelligent GraphRAG applications with FalkorDB and LLMs

Quick Start

Install and start FalkorDB

Ingest and query

Define a schema (optional)

Incremental Updates

Key Features

How it works

1️⃣ Ingest documents into a knowledge graph

2️⃣ Retrieve relevant context

3️⃣ Generate cited answers

FilesExpand file tree

graphrag-sdk.md

Latest commit

History

graphrag-sdk.md

File metadata and controls

GraphRAG-SDK

Build intelligent GraphRAG applications with FalkorDB and LLMs

Quick Start

Install and start FalkorDB

Ingest and query

Define a schema (optional)

Incremental Updates

Key Features

How it works

1️⃣ Ingest documents into a knowledge graph

2️⃣ Retrieve relevant context

3️⃣ Generate cited answers