DiagramAgent — AI-Powered Architecture Diagram Generator

Generate professional cloud architecture diagrams from natural language using AI and D2 diagram-as-code.

Sample Output

Azure SQL Always On with Disaster Recovery

Prompt: "SQL Always On Availability Group on Azure with disaster recovery"

Azure Platform Architecture

Prompt: "Azure SQL HA architecture with primary and DR regions, private endpoints, and monitoring"

UI Overview

DiagramAgent uses a three-panel layout:

┌──────────────────────────────────────────────────────────────────┐
│  DiagramAgent              [Vision Refine ○]  [Model: GPT-5.2]  │
├────────────┬──────────────┬──────────────────────────────────────┤
│            │              │                                      │
│   Chat     │  D2 Code     │   Diagram Preview                    │
│   Panel    │  Editor      │   (pan/zoom/export)                  │
│            │  (Monaco)    │                                      │
│  Clarify   │              │      ┌────┐    ┌────┐    ┌────┐     │
│  Questions │  direction:  │      │ LB │───→│ VM │───→│ DB │     │
│  appear    │  right       │      └────┘    └────┘    └────┘     │
│  here      │  classes: {  │                                      │
│            │    ...       │                                      │
│ [textarea] │  }           │              [SVG] [PNG]             │
├────────────┴──────────────┴──────────────────────────────────────┤
│  Ctrl+Enter to send                                              │
└──────────────────────────────────────────────────────────────────┘

Flow:

Type a prompt in the chat panel (left)
Answer clarifying questions via clickable pills
Watch D2 code stream into the editor (center)
See the diagram render live in the preview (right)
Export as SVG or PNG, or zoom/pan to inspect

Example Prompts

Try these prompts to see what DiagramAgent can generate:

Prompt	What You Get
`SQL Always On Availability Group on Azure with disaster recovery`	Multi-region Azure architecture with AG listeners, replication, blob backup, and monitoring
`Three-tier web application on AWS with auto-scaling and CDN`	CloudFront → ALB → EC2 Auto Scaling → RDS with read replicas and ElastiCache
`Microservices architecture on Kubernetes with service mesh`	K8s cluster with Istio/Linkerd, API gateway, 4+ services, Prometheus monitoring
`Serverless event-driven architecture on AWS`	API Gateway → Lambda → DynamoDB/SQS/SNS → CloudWatch
`CI/CD pipeline with GitHub Actions, Docker, and Kubernetes`	Source → Build → Test → Container Registry → K8s Deployment
`Multi-region active-active setup on Azure`	Two regions with Traffic Manager, paired App Services, Cosmos DB geo-replication
`Data pipeline with Kafka, Spark, and Snowflake`	Producers → Kafka → Spark Streaming → Snowflake → BI dashboards
`Real-time analytics platform with Kafka and Elasticsearch`	Event ingestion → Kafka → Logstash → Elasticsearch → Kibana

Features

Natural Language → Diagram — Describe any architecture and get a styled, horizontal-layout diagram
Clarifying Questions — AI asks targeted follow-up questions before generating, with clickable option pills and "Other" freetext support
5 LLM Models — GPT-4o, GPT-5, GPT-5.2 Chat, GPT-5 Nano, o3-mini (all via Azure OpenAI)
Vision Refinement — GPT-4o evaluates the rendered diagram image and iterates up to 3 times to hit quality targets (8+/10)
Color-Coded Containers — Eraser.io-style colored boundaries: orange (access), green (network), blue (compute), pink (data), purple (ops), yellow (security), gray (platform)
Horizontal Layout — Containers ordered left-to-right: entry points → compute → data → security/monitoring
200+ Cloud Icons — AWS, Azure, GCP, Kubernetes, and general tech icons via Iconify
Split-Pane Editor — Monaco code editor + live D2 preview side by side
Streaming Generation — Watch diagram code appear in real-time
Export — Download diagrams as SVG or PNG
Pan & Zoom — Interactive diagram viewer with mouse drag and scroll zoom
Iterative Refinement — Modify existing diagrams with follow-up prompts

Quick Start

Prerequisites

Node.js 18+
Azure OpenAI resource with models deployed (gpt-4o required for vision; others optional)
Azure CLI logged in (az login) — uses DefaultAzureCredential for auth

Setup

# Clone
git clone https://github.com/outbackops/DiagramAgent.git
cd DiagramAgent

# Install dependencies
npm install

# Configure environment
cp .env.local.example .env.local
# Edit .env.local with your Azure OpenAI endpoint

Environment Variables

Create .env.local:

# Azure OpenAI endpoint (required)
AZURE_AI_FOUNDRY_ENDPOINT=https://your-resource.openai.azure.com

# API key (optional — DefaultAzureCredential is used by default)
AZURE_AI_FOUNDRY_API_KEY=your-key-if-needed

# Default model for diagram generation
AZURE_AI_FOUNDRY_MODEL=gpt-5.2-chat

# Role-based model overrides (optional)
# MODEL_GENERATOR=gpt-5.2-chat
# MODEL_CLARIFIER=gpt-5.2-chat
# MODEL_PLANNER=gpt-5.2-chat
# MODEL_JUDGE=gpt-4o

Note: The model selector in the UI controls the generator role only. Clarifier, planner, and judge models are configured via the MODEL_* env vars above. The judge defaults to gpt-4o (vision-capable) for diagram quality assessment.

Note: Key-based auth may be disabled on your Azure resource. The app uses DefaultAzureCredential from @azure/identity, which works with Azure CLI login, managed identities, and environment credentials.

Run

npm run dev

Open http://localhost:3000.

How It Works

User enters a prompt (e.g., "SQL Always On Availability Group on Azure with disaster recovery")
Clarify API generates 4-7 targeted questions with clickable options (powered by GPT-4o)
User answers by clicking option pills — selecting "Other" reveals a freetext input
Enhanced prompt is built from the original request + user selections
LLM streams valid D2 code with color-coded container classes, proper nesting, and full dot-path connections
D2 WASM engine renders it as SVG with dagre layout
Vision refinement (optional): SVG is converted to PNG via sharp, sent to GPT-4o for assessment — if score < 8/10, the feedback is sent back to the LLM for up to 3 refinement rounds
User can edit D2 code in the Monaco editor, modify via follow-up prompts, or export as SVG/PNG

Tech Stack

Layer	Technology
Framework	Next.js 16 (App Router, TypeScript)
Styling	Tailwind CSS 4
Diagram Engine	D2 via WASM (`@terrastruct/d2`)
Code Editor	Monaco Editor (`@monaco-editor/react`)
LLM Backend	Azure OpenAI (chat completions, streaming)
Vision Assessment	GPT-4o with image input (SVG→PNG via `sharp`)
Auth	`@azure/identity` DefaultAzureCredential
Icons	Iconify CDN + Azure icon collection (200+ icons)

Project Structure

src/
├── app/
│   ├── api/
│   │   ├── assess/route.ts      # Vision-based diagram quality assessment
│   │   ├── clarify/route.ts     # Clarifying questions generation
│   │   ├── generate/route.ts    # LLM streaming D2 code generation
│   │   ├── models/route.ts      # Available models list
│   │   └── render/route.ts      # D2 WASM rendering to SVG
│   ├── globals.css
│   ├── layout.tsx
│   └── page.tsx                 # Main app — chat, editor, preview
├── components/
│   ├── ClarifyPanel.tsx         # Clickable question pills + Other freetext
│   ├── CodeEditor.tsx           # Monaco D2 editor (read-only during generation)
│   ├── D2Renderer.tsx           # SVG viewer with pan/zoom/export
│   └── PromptInput.tsx          # Chat panel with message history
└── lib/
    ├── azure-auth.ts            # Shared Azure credential + token caching
    ├── icon-registry.ts         # 200+ icon key → URL mappings
    ├── models.ts                # Model configs (tokens, temperature, streaming)
    └── system-prompt.ts         # D2 generation prompt with styling + layout rules

Supported Models

Model	Description	Temperature
GPT-5.2 Chat	Best architecture reasoning (default)	Default only
GPT-5	Good balance of speed and quality	Default only
GPT-4o	Fast and reliable, supports vision	Configurable
GPT-5 Nano	Quick iterations	Default only
o3-mini	Deep reasoning, slower	Not supported

License

MIT

Name		Name	Last commit message	Last commit date
Latest commit History 53 Commits
.github/workflows		.github/workflows
public		public
samples		samples
scripts		scripts
src		src
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
TODO.md		TODO.md
eslint.config.mjs		eslint.config.mjs
next.config.ts		next.config.ts
package-lock.json		package-lock.json
package.json		package.json
postcss.config.mjs		postcss.config.mjs
tsconfig.json		tsconfig.json
vitest.config.ts		vitest.config.ts

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

DiagramAgent — AI-Powered Architecture Diagram Generator

Sample Output

Azure SQL Always On with Disaster Recovery

Azure Platform Architecture

UI Overview

Example Prompts

Features

Quick Start

Prerequisites

Setup

Environment Variables

Run

How It Works

Tech Stack

Project Structure

Supported Models

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

DiagramAgent — AI-Powered Architecture Diagram Generator

Sample Output

Azure SQL Always On with Disaster Recovery

Azure Platform Architecture

UI Overview

Example Prompts

Features

Quick Start

Prerequisites

Setup

Environment Variables

Run

How It Works

Tech Stack

Project Structure

Supported Models

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages