CodeSure

Privacy-first MCP security scanner for AI-generated code. Runs as a stdio MCP server — your AI coding assistant calls it directly via standard input/output. No HTTP, no ports, no network. 100% local execution.

Install

Run once. CodeSure auto-detects your AI coding tools and registers itself:

npx -y codesure install

🔍 CodeSure — Auto-install MCP

  ✅ Claude Code: codesure added
  ✅ Codex: codesure added
  ⏭  Opencode: already configured
  ✅ Claude Desktop: codesure added
  ✅ Cursor: codesure added

✨ Done! Restart your MCP client to activate CodeSure.

Restart your client. scan_code, scan_package, and other tools appear automatically.

Supported clients: Claude Code · Codex · Opencode · Claude Desktop · Cursor · VS Code (Copilot)

How It Works

CodeSure is a stdio MCP server. Your AI client spawns it as a child process and communicates via JSON-RPC over stdin/stdout. No HTTP server, no open ports, no network listeners — the process exists only while your client is running.

npx codesure install patches your client's config file to register codesure. Clients not installed are silently skipped. Already-configured clients are never overwritten.

Client	Config patched
Claude Code	`~/.claude.json` → `mcpServers`
Codex	`codex mcp add codesure` (CLI)
Opencode	`~/.config/opencode/config.json` → `mcp`
Claude Desktop	`~/Library/.../Claude/claude_desktop_config.json`
Cursor	`~/.cursor/mcp.json`
VS Code	`settings.json` → `github.copilot.mcp.servers`

Detection Pipeline

CodeSure runs a 5-stage analysis pipeline on every scan_code call. Each stage catches what the previous one misses, and later stages reduce false positives from earlier ones.

Code input
  │
  ├─ Stage 1: Regex Pattern Matching    36 YAML rules (OWASP + malicious)
  ├─ Stage 2: AST Taint Analysis        3-hop source→sink data flow tracking
  ├─ Stage 3: Entropy Analysis          Shannon entropy for secrets/obfuscation
  ├─ Stage 4: Behavioral Chain          Cross-signal confidence boosting
  └─ Stage 5: Context Filter            Path-based FP reduction

Stage 1 — Regex Pattern Matching

Matches each line against 36 rules (16 vulnerability + 20 malicious). Each rule is a YAML file with pattern (positive match) and pattern_not (negative exclusion). Catches known patterns like eval(), innerHTML, SQL concatenation, child_process.exec, reverse shells, crypto miners.

Research basis: Pattern-based detection remains the foundation of production SAST tools. Semgrep, CodeQL, and ESLint security plugins all use rule-based pattern matching as their primary detection layer.

Stage 2 — AST Taint Analysis

Tracks data flow from sources (req.query, document.cookie, process.env) to sinks (eval(), db.query(), innerHTML) through variable assignments, up to 3 hops. Distinguishes eval("hello") (safe constant) from eval(userInput) (tainted external input).

Research basis: Static taint analysis is the established method for detecting injection vulnerabilities. Almashfi & Lu (2021) demonstrated precise detection of DOM-based XSS via source-to-sink taint tracking in JavaScript. Ghebremichael et al. (2025) showed that combining taint specifications with static analysis detects 106 previously-undetectable vulnerabilities in npm packages.

Almashfi, N. & Lu, L. (2021). Static Taint Analysis for JavaScript Programs. CCIS vol. 1288, pp. 155-167. DOI: 10.1007/978-3-030-71472-7_13

Ghebremichael, J. et al. (2025). SemTaint: Multi-Agent Taint Specification Extraction for Vulnerability Detection. arXiv:2601.10865

Dutta, S. et al. (2021). InspectJS: Leveraging Code Similarity and User-Feedback for Effective Taint Specification Inference for JavaScript. arXiv:2111.09625

Stage 3 — Entropy Analysis

Computes Shannon entropy of string literals. High-entropy strings (≥ 4.5 bits/byte) in sensitive contexts (password, secret, api_key) are flagged as potential hardcoded secrets or obfuscated payloads. UUIDs, JWTs, hex hashes, base64 images, and version strings are exempted.

Research basis: Lyda & Hamrock (2007) established entropy analysis as an effective technique for identifying encrypted and packed malware. Sujon et al. (2025) validated Shannon entropy as a discriminating feature across file types with ML models. Veracode (2021) demonstrated its application to detecting obfuscated malicious code in JavaScript packages.

Lyda, R. & Hamrock, J. (2007). Using Entropy Analysis to Find Encrypted and Packed Malware. IEEE Security & Privacy, 5(2), 40-45. DOI: 10.1109/MSP.2007.48

Sujon, K.M. et al. (2025). A novel framework for malware detection using entropy-based statistical features. Engineering Research Express, 7(2). DOI: 10.1088/2631-8695/add645

Veracode. (2021). Using Entropy to Identify Obfuscated Malicious Code. veracode.com

Stage 4 — Behavioral Chain Analysis

Groups findings by file using a 7-category taxonomy (EXS/EXM/EXF/NET/SYS/DEF/MET). When multiple categories co-occur in one file — e.g., code execution source (EXS) + execution method (EXM) + data exfiltration (EXF) — confidence is boosted (up to 2x). Individual signals may be benign; combined signals strongly indicate malicious intent.

Research basis: Zhang et al. (2024) demonstrated that modeling malicious behavior as sequences achieves cross-ecosystem detection (npm + PyPI simultaneously). Huang et al. (2024) built a behavioral knowledge base with hierarchical classification for malicious npm packages using API call sequences.

Zhang, J. et al. (2024). Killing Two Birds with One Stone: Malicious Package Detection in NPM and PyPI using a Single Model of Malicious Behavior Sequence. ACM TOSEM, 34(4). DOI: 10.1145/3705304

Huang, C. et al. (2024). DONAPI: Malicious NPM Packages Detector using Behavior Sequence Knowledge Mapping. USENIX Security. usenix.org

Gokkaya, B. et al. (2026). Software supply chain: A taxonomy of attacks, mitigations and risk assessment strategies. J. Information Security and Applications, 97. DOI: 10.1016/j.jisa.2025.104324

Stage 5 — Context Filter

Adjusts confidence based on file path. Test files (*.test.ts, __tests__/) get reduced confidence (×0.3). Vendor/generated code (node_modules/, .min.js) gets zeroed out. Production code stays at full confidence. Prevents false positives from test fixtures and third-party code.

Research basis: Du et al. (2026) showed that hybrid static analysis + context-aware filtering eliminates 94-98% of false positives in industrial codebases (Tencent). Lin (2025) demonstrated that adaptive source-sink identification with context validation reduces FP by 43.7% on average.

Du, X. et al. (2026). Reducing False Positives in Static Bug Detection with LLMs: An Empirical Study in Industry. arXiv:2601.18844

Lin, S. (2025). LLM-Driven Adaptive Source-Sink Identification and False Positive Mitigation for Static Analysis. arXiv:2511.04023

MCP Tools

Once installed, your AI assistant gains these tools:

Tool	What it does
`scan_code`	Scans source code through the 5-stage pipeline. Returns findings with severity, confidence, line numbers, and fix suggestions.
`scan_package`	Checks npm packages for typosquatting and supply-chain risks before you install them.
`scan_manifest`	Audits browser extension and app manifests for dangerous permissions.
`report_pattern`	Anonymizes a detected malicious pattern and reports it as a GitHub issue to the community rules repo. Requires explicit user confirmation.
`update_rules`	Downloads the latest community detection rules.

Auto-Scan Setup

npx codesure install automatically writes auto-scan rules to your client's rule file. No manual setup needed:

Client	Rule file created
Claude Code	`~/.claude/rules/codesure.md`
Codex	`~/.codex/AGENTS.md`
Opencode	`~/.config/opencode/AGENTS.md`
Cursor	`~/.cursorrules`

The rule tells your AI assistant to call scan_code after writing or modifying code, fix critical findings before returning code, and warn on high findings with fix suggestions.

Manual setup — Add this to your client's rule file:

After writing or modifying code, call scan_code to check for vulnerabilities.
If critical findings exist, fix them before presenting the code.
If high findings exist, warn the user with fix suggestions.

What It Detects

Vulnerabilities — SQL injection, XSS, eval injection, hardcoded secrets, CSRF, path traversal, prototype pollution, insecure random, command injection

Malicious code — Data exfiltration (DNS/HTTP), obfuscation, reverse shells, crypto miners, keyloggers, malicious install scripts, suspicious domains

Privacy & Data Policy

Your code never leaves your machine. Your prompts are never collected.

stdio MCP: CodeSure runs as a child process of your AI client, communicating only via stdin/stdout. No HTTP server, no open ports, no network listeners during scanning.
Zero telemetry: No usage statistics, no scan results, no prompts, no user information is collected or transmitted. Ever.
No prompt access: CodeSure receives only the code string passed to scan_code. It has no access to your conversation history, prompts, or any other context from your AI client.
Bug reports only: The report_pattern tool sends only anonymized pattern metadata (rule ID, taxonomy category, confidence score) as a GitHub issue to the community rules repo. It contains zero source code, zero user information, and runs only when you explicitly confirm. This is the only network call CodeSure ever makes.

Accuracy

Benchmarked on 60+ test fixtures (development 70% / holdout 30%):

Metric	Dev Set	Holdout Set
Youden Index	0.854	0.917
True Positive Rate	> 90%	> 90%
False Positive Rate	< 10%	< 10%

Inline Suppression

Suppress specific rules on a per-line basis:

eval(trusted)  // codesure-ignore: js.security.eval-injection

Suppressed findings stay in the output with suppressed: true for audit trail. Broad suppression without a rule ID triggers a warning.

.codesureignore

Exclude files from scanning. Uses gitignore syntax. Smart defaults apply when no .codesureignore exists:

node_modules/
vendor/
dist/
build/
*.min.js
*.d.ts

Ignored files are not blocked — their confidence is reduced to 0 so findings remain visible in audit logs.

Changelog

1.2.3

refactor: ai-native compliance — userMessage field, TSDoc, error context, function splitting

1.2.2

fix: atomic upsert for shared JSON configs, encoding consistency

1.2.1

feat: automate README changelog on npm version
docs: remove constitution from AGENTS.md (now in rules files only)
docs: collapse secondary sections with details tags
feat: apply ai-native v2 — typed errors, TSDoc, AGENTS.md
docs: add changelog, inline suppression, and codesureignore sections to README
fix: resolve RULES_DIR for dist runtime; add auto-scan rules installer; bump to 1.1.4
fix: sanitize all JSON output to strip lone surrogate escape sequences
chore: bump version to 1.1.2
fix: use surrogate-safe truncation in snippet output to prevent JSON parse errors
chore: bump version to 1.1.1
docs: add detection pipeline research citations, stdio MCP details, privacy policy
docs: update README with auto-install command and client table
feat(install): add auto-install command for Claude Code, Codex, Opencode, Cursor
chore: ignore package-lock.json (bun project)
docs: add README, PRIVACY policy, RULES guide, and BENCHMARK report
test(benchmark): add Youden Index benchmark suite
test(fixtures): add security test fixtures (60+ cases)
feat(server): add MCP server entry point with auto-scan prompt
feat(tools): add community reporting and rule update tools
feat(tools): add integrated scan_code pipeline

1.2.0

feat: Apply ai-native v2 — CodeSureError typed error class with 8 error codes, TSDoc on 10 core public APIs, AGENTS.md + per-client rules files.

1.1.4

fix: Rule loading was silently broken when running via npx or node dist/. RULES_DIR now resolves correctly from both src/ and dist/ contexts.
feat: npx codesure install now writes auto-scan rules to Claude Code, Codex, Opencode, and Cursor rule files automatically.

1.1.3

fix: Sanitize all JSON output to strip lone surrogate escape sequences that caused MCP clients to fail parsing scan results.

1.1.2

fix: Use surrogate-safe truncation in snippet output.

1.1.1

docs: Detection pipeline research citations, stdio MCP architecture details, privacy policy.
feat: Auto-install command (npx codesure install) with client detection.

1.0.0

Initial release. 5-stage detection pipeline, 36 YAML rules, 5 MCP tools, 60+ test fixtures, Youden Index 0.854.

V1 Limitations

Taint analysis is limited to 3-hop AST (single file); cross-file taint is not tracked
Tier 3-6 obfuscation (control flow flattening, virtualization) may not be detected

License

MIT

Name		Name	Last commit message	Last commit date
Latest commit History 36 Commits
.claude/rules		.claude/rules
.cursor/rules		.cursor/rules
.windsurf/rules		.windsurf/rules
scripts		scripts
src		src
tests		tests
.codesureignore		.codesureignore
.gitignore		.gitignore
.npmignore		.npmignore
AGENTS.md		AGENTS.md
BENCHMARK.md		BENCHMARK.md
LICENSE		LICENSE
PRIVACY.md		PRIVACY.md
README.md		README.md
RULES.md		RULES.md
bunfig.toml		bunfig.toml
package.json		package.json
tsconfig.json		tsconfig.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

CodeSure

Install

Detection Pipeline

MCP Tools

What It Detects

Privacy & Data Policy

Accuracy

1.2.3

1.2.2

1.2.1

1.2.0

1.1.4

1.1.3

1.1.2

1.1.1

1.0.0

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

CodeSure

Install

Detection Pipeline

MCP Tools

What It Detects

Privacy & Data Policy

Accuracy

1.2.3

1.2.2

1.2.1

1.2.0

1.1.4

1.1.3

1.1.2

1.1.1

1.0.0

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages