token-compression

Here are 42 public repositories matching this topic...

open-compress / claw-compactor

14-stage Fusion Pipeline for LLM token compression — reversible compression, AST-aware code analysis, intelligent content routing. Zero LLM inference cost. MIT licensed.

Updated Apr 1, 2026
Python

cokeshao / Awesome-Multimodal-Token-Compression

Star

[TMLR 2026] Survey: https://arxiv.org/pdf/2507.20198

awesome-list model-acceleration long-context mllm efficient-ai token-compression efficient-mllm

Updated May 17, 2026

xuyang-liu16 / Awesome-Token-level-Model-Compression

Star

📚 Collection of token-level model compression resources.

computer-vision model-compression model-acceleration efficient-deep-learning token-pruning token-merging token-compression

Updated Sep 3, 2025

claudioemmanuel / squeez

Sponsor

Star

Hook-based token compressor for 5 AI CLI hosts (Claude Code, Copilot CLI, OpenCode, Gemini CLI, Codex CLI). Up to 95% bash compression, signature-mode for code reads, cross-call dedup, MCP server, self-teaching protocol. Zero runtime deps.

rust opencode zero-dependency signature-extraction bash-hook llm copilot-cli ai-cli llm-tools context-window gemini-cli mcp-server token-compression claude-code session-memory codex-cli context-engineering token-optimizer

Updated May 18, 2026
Rust

HelgeSverre / toon-php

Sponsor

Star

Token-Oriented Object Notation - A compact data format for reducing token consumption when sending structured data to LLMs (PHP implementation)

php serialization ai data-format toon llm token-compression

Updated Dec 6, 2025
PHP

HumanMLLM / LLaVA-Scissor

Star

The official code for the paper: LLaVA-Scissor: Token Compression with Semantic Connected Components for Video LLMs

video-understanding connected-components video-language-understanding mllm multimodal-large-language-models token-compression

Updated Jul 1, 2025
Python

Fanziyang-v / FlashVID

Star

[ICLR 2026 Oral] FlashVID: Efficient Video Large Language Models via Training-free Tree-based Spatiotemporal Token Merging

efficiency multimodal video-llms token-compression flashvid

Updated Apr 30, 2026
Python

HVision-NKU / GlimpsePrune

Star

Official repository of the paper "A Glimpse to Compress: Dynamic Visual Token Pruning for Large Vision-Language Models"

inference-efficiency lvlms mllms visual-token-pruning token-compression

Updated Feb 13, 2026
Python

edgee-ai / edgee

Star

Open-source AI gateway written in Rust, with token compression for Claude Code, Codex... and any other LLM client.

cli cost-optimization coding-assistant agentic edgee llm-gateway token-compression context-optimization

Updated May 21, 2026
Rust

ilang-ai / autocode

Star

You say it. AutoCode ships it. 46 skills. Code to deployment in one session. I-Lang v3.0 powered. Free forever.

developer-tools persistent-memory ai-agents claude prompt-engineering anthropic anthropic-claude ai-memory token-compression claude-code claude-code-plugin claude-code-skills anthropic-skills

Updated May 11, 2026
Shell

YiwengXie / FluxMem

Star

[CVPR 2026] FluxMem: Adaptive Hierarchical Memory for Streaming Video Understanding

streaming-video video-understanding large-multimodal-models token-compression

Updated Mar 16, 2026
Python

hanxunyu / VisionTrim

Star

[ICLR 2026] Official code repository for "⚡️VisionTrim: Unified Vision Token Compression for Training-Free MLLM Acceleration"

efficiency multimodal token-compression lightweight-vlm

Updated Feb 24, 2026
Shell

overseek944 / twotrim

Star

ultra-lightweight, mathematically robust prompt compression middleware

ai compression-algorithm token-compression ai-cost-optimization

Updated Apr 13, 2026
Python

jee599 / contextzip

Star

⚡ Cut Claude Code context 60-90%. Live stdout today, session-history compression coming v0.2.

rust cli ai developer-tools rtk claude llm context-window token-compression

Updated Apr 17, 2026
Rust

JinXins / MergeMix

Star

[ICLR 2026] MergeMix: A Unified Augmentation Paradigm for Visual and Multi-Modal Understanding

image-classification data-augmentation preference-learning mixup multimodal ranking-loss mmcv llava token-merging token-compression iclr2026

Updated Feb 27, 2026
Python

plasmate-labs / plasmate

Star

The browser engine for agents. HTML in, Semantic Object Model out. 10x token compression, V8 JS rendering, CDP compatible. Apache-2.0.

rust mcp som semantic-web web-scraping cdp browser-engine ai-agents web-automation puppeteer headless-browser llm model-context-protocol token-compression agent-web-protocol

Updated May 25, 2026
HTML

sriinnu / clipforge-PAKT

Star

Lossless-first prompt compression for JSON, YAML, CSV, and Markdown. Library, CLI, MCP server, desktop app, and browser extension.

markdown cli yaml json csv mcp developer-tools lossless-compression llm pakt prompt-compression token-compression coding-agent

Updated May 11, 2026
TypeScript

sangminwoo / awesome-token-redundancy-reduction

Star

😎 Awesome papers on token redundancy reduction

token-pruning token-reduction token-merging token-compression token-sparsification token-redundancy-reduction

Updated Mar 12, 2025

mvish7 / dycoke_token_compression

Star

This repo integrates DyCoke's token compression method with VLMs such as Gemma3 and InternVL3

inference-optimization vlms video-large-language-models token-compression

Updated Nov 11, 2025
Python

MouxiaoHuang / PPE

Star

[ICLR 2026] Official code of PPE: Positional Preservation Embedding for Token Compression in Multimodal Large Language Models.

multimodal positional-encoding large-language-models vision-language-model token-merging token-compression iclr2026 token-clustering

Updated Mar 16, 2026
Python

Improve this page

Add a description, image, and links to the token-compression topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the token-compression topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

token-compression

Here are 42 public repositories matching this topic...

open-compress / claw-compactor

cokeshao / Awesome-Multimodal-Token-Compression

xuyang-liu16 / Awesome-Token-level-Model-Compression

claudioemmanuel / squeez

HelgeSverre / toon-php

HumanMLLM / LLaVA-Scissor

Fanziyang-v / FlashVID

HVision-NKU / GlimpsePrune

edgee-ai / edgee

ilang-ai / autocode

YiwengXie / FluxMem

hanxunyu / VisionTrim

overseek944 / twotrim

jee599 / contextzip

JinXins / MergeMix

plasmate-labs / plasmate

sriinnu / clipforge-PAKT

sangminwoo / awesome-token-redundancy-reduction

mvish7 / dycoke_token_compression

MouxiaoHuang / PPE

Improve this page

Add this topic to your repo