ChatCrystal

Crystallize knowledge from your AI conversations

ChatCrystal collects conversations from AI coding tools (Claude Code, Cursor, Codex CLI), uses LLM to distill them into searchable structured notes, and builds your personal knowledge base — all running locally.

Conversations	Notes

Semantic Search	Knowledge Graph

Features

Multi-source ingestion — Auto-imports conversations from Claude Code, Codex CLI, and Cursor with real-time file watching
LLM summarization — Distills conversations into structured notes (title, summary, key conclusions, code snippets, tags) via Vercel AI SDK
Semantic search — Embedding-powered vector search (vectra) with relation-aware result expansion
Knowledge graph — LLM-discovered relationships (causal, dependency, similarity, etc.) with force-directed visualization
Conversation viewer — Markdown rendering, code highlighting, collapsible tool calls, noise filtering
Multi-provider support — Ollama, OpenAI, Anthropic, Google AI, Azure OpenAI, or any OpenAI-compatible API, switchable at runtime
Task queue — Batch summarization/embedding via p-queue with real-time progress tracking and cancellation
Desktop app — Electron with system tray, minimize-to-tray on close

Tech Stack

Layer	Technology
Backend	Node.js + Fastify v5 + TypeScript
Frontend	Vite v8 + React 19 + Tailwind CSS v4 + TanStack React Query v5
Desktop	Electron + electron-builder (NSIS installer)
Database	sql.js (WASM SQLite)
LLM	Vercel AI SDK v6 — Ollama / OpenAI / Anthropic / Google / Azure / Custom
Embedding	vectra vector index, multi-provider
File watching	chokidar

Getting Started

Prerequisites

Node.js >= 20
An LLM service (pick one):
- Ollama (local inference, free)
- OpenAI / Anthropic / Google AI API key
- Any OpenAI-compatible service (OpenRouter, Poe, etc.)

If using Ollama, pull the required models:

ollama pull qwen2.5:7b          # LLM summarization
ollama pull nomic-embed-text     # Embedding

Installation

git clone https://github.com/ZengLiangYi/ChatCrystal.git
cd ChatCrystal
npm install
cp .env.example .env             # Edit configuration as needed

Desktop App (Recommended)

npm run dev:electron             # Dev mode (Electron + Vite HMR)
npm run build:electron           # Build NSIS installer → release/

The installer is in the release/ directory. Data is stored in %APPDATA%/chatcrystal/data/.

Web Dev Mode

npm run dev                      # Starts backend (3721) + frontend (13721)

Visit http://localhost:13721

Web Production Mode

npm run build                    # Build backend + frontend
npm start                        # Start server (frontend served statically)

Visit http://localhost:3721

Workflow

Click "Import" in the sidebar to scan Claude Code / Codex CLI / Cursor conversations
Browse imported conversations on the Conversations page
Click "Summarize" or use "Batch Summarize" to distill conversations into notes
Search your knowledge on the Search page; enable "Expand related notes" to follow relation edges
Explore note relationships on the Graph page (force-directed, draggable, zoomable)
Filter and browse all notes by tag on the Notes page
Switch LLM/Embedding providers and models on the Settings page

Configuration

Configure via .env file or the Settings page (hot-swappable at runtime):

# Server port
PORT=3721

# Data sources
CLAUDE_PROJECTS_DIR=~/.claude/projects
CODEX_SESSIONS_DIR=~/.codex/sessions
# CURSOR_DATA_DIR=          # Auto-detected per platform, can override

# LLM summarization (ollama/openai/anthropic/google/azure/custom)
LLM_PROVIDER=ollama
LLM_BASE_URL=http://localhost:11434
LLM_MODEL=qwen2.5:7b

# Embedding (ollama/openai/google/azure/custom)
EMBEDDING_PROVIDER=ollama
EMBEDDING_BASE_URL=http://localhost:11434
EMBEDDING_MODEL=nomic-embed-text

Note: LLM and Embedding must be configured separately. Semantic search requires a dedicated embedding model that supports the /v1/embeddings endpoint. Large language models (Claude, GPT-4, Qwen, etc.) cannot be used as embedding models. Common embedding models:

Provider Models

Ollama (local) nomic-embed-text, mxbai-embed-large

OpenAI text-embedding-3-small, text-embedding-3-large

Google text-embedding-004

Provider Configuration Examples

# OpenAI
LLM_PROVIDER=openai
LLM_API_KEY=sk-...
LLM_MODEL=gpt-4o

# Anthropic
LLM_PROVIDER=anthropic
LLM_API_KEY=sk-ant-...
LLM_MODEL=claude-sonnet-4-20250514

# Google AI
LLM_PROVIDER=google
LLM_API_KEY=AIza...
LLM_MODEL=gemini-2.0-flash

# OpenAI-compatible service (Poe / OpenRouter / etc.)
LLM_PROVIDER=custom
LLM_BASE_URL=https://openrouter.ai/api/v1
LLM_API_KEY=your-key
LLM_MODEL=anthropic/claude-sonnet-4

Project Structure

ChatCrystal/
├── electron/                # Electron main process (window, tray, lifecycle)
├── shared/types/            # Shared TypeScript types
├── server/src/
│   ├── db/                  # SQLite schema + utilities
│   ├── parser/              # Plugin-based conversation parsers (Claude Code / Codex / Cursor)
│   ├── services/            # Import, summarization, LLM, embedding, relations, providers
│   ├── routes/              # Fastify API routes
│   ├── watcher/             # chokidar file watching
│   └── queue/               # p-queue task queue + TaskTracker
├── client/src/
│   ├── pages/               # Page components (Dashboard, Conversations, Notes, Search, Graph, Settings)
│   ├── components/          # Shared components (StatusBar, ActivityPanel, etc.)
│   ├── hooks/               # React Query hooks
│   ├── themes/              # Theme definitions
│   └── providers/           # ThemeProvider
├── scripts/                 # Legacy tray scripts (superseded by Electron)
├── electron-builder.yml     # Electron packaging config
└── data/                    # Runtime data (gitignored)

Adding Data Sources

Implement the SourceAdapter interface to add a new AI tool:

interface SourceAdapter {
  name: string;
  displayName: string;
  detect(): Promise<SourceInfo | null>;
  scan(): Promise<ConversationMeta[]>;
  parse(meta: ConversationMeta): Promise<ParsedConversation>;
}

Built-in adapters:

Adapter	Data Source	Format
`claude-code`	`~/.claude/projects/*/.jsonl`	JSONL conversation log
`codex`	`~/.codex/sessions/*/rollout-.jsonl`	JSONL event stream
`cursor`	Cursor `workspaceStorage/state.vscdb`	SQLite KV store

Create a new adapter file in server/src/parser/adapters/ and register it in parser/index.ts.

API

Method	Path	Description
GET	`/api/status`	Server status + statistics
GET	`/api/config`	Current configuration (secrets redacted)
POST	`/api/config`	Update provider configuration
POST	`/api/config/test`	Test LLM connection
GET	`/api/providers`	Available provider list
POST	`/api/import/scan`	Trigger full scan import
GET	`/api/conversations`	Conversation list (filterable, paginated)
GET	`/api/conversations/:id`	Conversation detail + messages
POST	`/api/conversations/:id/summarize`	Generate summary for one conversation
POST	`/api/summarize/batch`	Batch summarization
POST	`/api/summarize/reset-errors`	Reset error status
GET	`/api/notes`	Note list
GET	`/api/notes/:id`	Note detail
POST	`/api/notes/:id/embed`	Generate embedding
POST	`/api/embeddings/batch`	Batch embedding generation
GET	`/api/search?q=...&expand=true`	Semantic search (expand follows relation edges)
GET	`/api/notes/:id/relations`	Note relations list
POST	`/api/notes/:id/relations`	Create relation manually
DELETE	`/api/relations/:id`	Delete relation
POST	`/api/notes/:id/discover-relations`	LLM auto-discover relations
POST	`/api/relations/batch-discover`	Batch relation discovery
GET	`/api/relations/graph`	Knowledge graph data (nodes + edges)
GET	`/api/tags`	Tag list
GET	`/api/queue/status`	Queue status
POST	`/api/queue/cancel`	Cancel queued tasks

Knowledge Graph

After generating note summaries, the LLM automatically analyzes relationships between notes. Supported relation types:

Relation	Meaning	Example
`CAUSED_BY`	Causation	Login failure ← Token expiration logic bug
`LEADS_TO`	Leads to	Route refactor → page flicker bug
`RESOLVED_BY`	Resolved by	Memory leak → added cleanup function
`SIMILAR_TO`	Similar topic	Two conversations both discussing deployment
`CONTRADICTS`	Contradiction	Use Redux vs Context is enough
`DEPENDS_ON`	Dependency	New feature depends on auth middleware refactor
`EXTENDS`	Extension	Added eviction policy on top of caching solution
`REFERENCES`	Reference	Conversation mentions a previous architecture decision

View related notes at the bottom of the note detail page. Supports AI discovery, manual addition, and search-to-link. Browse the entire knowledge network via the force-directed graph on the Graph page.

FAQ

Semantic search returns 500 "Not Found"

Embedding model misconfigured. Make sure EMBEDDING_MODEL is a dedicated embedding model (e.g., nomic-embed-text), not a large language model (e.g., claude-haiku, qwen2.5). LLMs do not support the /v1/embeddings endpoint.

Can't connect to Ollama on startup

Make sure Ollama is running and the models are pulled:

ollama pull qwen2.5:7b
ollama pull nomic-embed-text

No conversations after import

Check that CLAUDE_PROJECTS_DIR in .env points to the correct path and contains .jsonl files.

Knowledge graph is empty

You need to generate notes first, then click "Discover" on a note detail page, or use POST /api/relations/batch-discover for batch discovery.

License

MIT

Name		Name	Last commit message	Last commit date
Latest commit History 16 Commits
.github		.github
client		client
docs/screenshots		docs/screenshots
electron		electron
scripts		scripts
server		server
shared		shared
.env.example		.env.example
.gitignore		.gitignore
CLAUDE.md		CLAUDE.md
LICENSE		LICENSE
README.md		README.md
README.zh-CN.md		README.zh-CN.md
electron-builder.yml		electron-builder.yml
package-lock.json		package-lock.json
package.json		package.json
tsconfig.base.json		tsconfig.base.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ChatCrystal

Features

Tech Stack

Getting Started

Prerequisites

Installation

Desktop App (Recommended)

Web Dev Mode

Web Production Mode

Workflow

Configuration

Provider Configuration Examples

Project Structure

Adding Data Sources

API

Knowledge Graph

FAQ

License

About

Uh oh!

Releases 2

Packages

Uh oh!

Contributors 1

Languages

Provider	Models
Ollama (local)	`nomic-embed-text`, `mxbai-embed-large`
OpenAI	`text-embedding-3-small`, `text-embedding-3-large`
Google	`text-embedding-004`

Folders and files

Latest commit

History

Repository files navigation

ChatCrystal

Features

Tech Stack

Getting Started

Prerequisites

Installation

Desktop App (Recommended)

Web Dev Mode

Web Production Mode

Workflow

Configuration

Provider Configuration Examples

Project Structure

Adding Data Sources

API

Knowledge Graph

FAQ

License

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 2

Packages 0

Uh oh!

Contributors 1

Languages

Packages