token-compression

Here are 28 public repositories matching this topic...

open-compress / claw-compactor

14-stage Fusion Pipeline for LLM token compression — reversible compression, AST-aware code analysis, intelligent content routing. Zero LLM inference cost. MIT licensed.

Updated Apr 1, 2026
Python

cokeshao / Awesome-Multimodal-Token-Compression

Star

[TMLR 2026] Survey: https://arxiv.org/pdf/2507.20198

awesome-list model-acceleration long-context mllm efficient-ai token-compression efficient-mllm

Updated Feb 22, 2026

xuyang-liu16 / Awesome-Token-level-Model-Compression

Star

📚 Collection of token-level model compression resources.

computer-vision model-compression model-acceleration efficient-deep-learning token-pruning token-merging token-compression

Updated Sep 3, 2025

HelgeSverre / toon-php

Sponsor

Star

Token-Oriented Object Notation - A compact data format for reducing token consumption when sending structured data to LLMs (PHP implementation)

php serialization ai data-format toon llm token-compression

Updated Dec 6, 2025
PHP

HumanMLLM / LLaVA-Scissor

Star

The official code for the paper: LLaVA-Scissor: Token Compression with Semantic Connected Components for Video LLMs

video-understanding connected-components video-language-understanding mllm multimodal-large-language-models token-compression

Updated Jul 1, 2025
Python

HVision-NKU / GlimpsePrune

Star

Official repository of the paper "A Glimpse to Compress: Dynamic Visual Token Pruning for Large Vision-Language Models"

inference-efficiency lvlms mllms visual-token-pruning token-compression

Updated Feb 13, 2026
Python

ilang-ai / autocode

Star

You say it. AutoCode builds it. 38 professional skills, persistent memory, 60%+ dev cost savings. Zero dependencies. Free forever.

developer-tools persistent-memory ai-agents claude prompt-engineering anthropic anthropic-claude ai-memory token-compression claude-code claude-code-plugin claude-code-skills anthropic-skills

Updated Mar 20, 2026
Shell

YiwengXie / FluxMem

Star

[CVPR 2026] FluxMem: Adaptive Hierarchical Memory for Streaming Video Understanding

streaming-video video-understanding large-multimodal-models token-compression

Updated Mar 16, 2026
Python

Fanziyang-v / FlashVID

Star

[ICLR 2026 Oral] FlashVID: Efficient Video Large Language Models via Training-free Tree-based Spatiotemporal Token Merging

efficiency multimodal video-llms token-compression flashvid

Updated Mar 31, 2026
Python

hanxunyu / VisionTrim

Star

[ICLR 2026] Official code repository for "⚡️VisionTrim: Unified Vision Token Compression for Training-Free MLLM Acceleration"

efficiency multimodal token-compression lightweight-vlm

Updated Feb 24, 2026
Shell

edgee-ai / edgee

Star

Open-source AI gateway written in Rust, with token compression for Claude Code, Codex... and any other LLM client.

cli cost-optimization coding-assistant agentic edgee llm-gateway token-compression context-optimization

Updated Apr 2, 2026
Rust

JinXins / MergeMix

Star

[ICLR 2026] MergeMix: A Unified Augmentation Paradigm for Visual and Multi-Modal Understanding

image-classification data-augmentation preference-learning mixup multimodal ranking-loss mmcv llava token-merging token-compression iclr2026

Updated Feb 27, 2026
Python

sangminwoo / awesome-token-redundancy-reduction

Star

😎 Awesome papers on token redundancy reduction

token-pruning token-reduction token-merging token-compression token-sparsification token-redundancy-reduction

Updated Mar 12, 2025

jee599 / contextzip

Star

⚡ Compress Claude Code context by 60-90%. Six noise filters RTK doesn't have.

rust cli ai developer-tools rtk claude llm context-window token-compression

Updated Mar 20, 2026
Rust

mvish7 / dycoke_token_compression

Star

This repo integrates DyCoke's token compression method with VLMs such as Gemma3 and InternVL3

inference-optimization vlms video-large-language-models token-compression

Updated Nov 11, 2025
Python

plasmate-labs / plasmate

Star

The browser engine for agents. HTML in, Semantic Object Model out. 10x token compression, V8 JS rendering, CDP compatible. Apache-2.0.

rust mcp som semantic-web web-scraping cdp browser-engine ai-agents web-automation puppeteer headless-browser llm token-compression agent-web-protocol

Updated Apr 2, 2026
HTML

MouxiaoHuang / PPE

Star

[ICLR 2026] Official code of PPE: Positional Preservation Embedding for Token Compression in Multimodal Large Language Models.

multimodal positional-encoding large-language-models vision-language-model token-merging token-compression iclr2026 token-clustering

Updated Mar 16, 2026
Python

AP3008 / Janus

Star

Rust Local Token Compression Proxy for coding agents, built solo for GenAI Genesis 2026. 🏆 1st Google Sustainability Hack

rust redis local proxy-server tui tokio deduplication ratatui axum-framework token-compression semantic-caching

Updated Mar 16, 2026
Rust

claudioemmanuel / squeez

Sponsor

Star

Token compression + context memory for Claude Code etc. Runs automatically. No configuration required.

rust opencode developer-tools llm token-compression context-optimization claude-code opencode-ai

Updated Apr 2, 2026
Rust

pzrain / DiViCo

Star

Official implementation of TCSVT 2025 paper: DiViCo: Disentangled Visual Token Compression For Efficient Large Vision-Language Model

multimodal large-vision-language-model token-compression

Updated May 13, 2025
Python

Improve this page

Add a description, image, and links to the token-compression topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the token-compression topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

token-compression

Here are 28 public repositories matching this topic...

open-compress / claw-compactor

cokeshao / Awesome-Multimodal-Token-Compression

xuyang-liu16 / Awesome-Token-level-Model-Compression

HelgeSverre / toon-php

HumanMLLM / LLaVA-Scissor

HVision-NKU / GlimpsePrune

ilang-ai / autocode

YiwengXie / FluxMem

Fanziyang-v / FlashVID

hanxunyu / VisionTrim

edgee-ai / edgee

JinXins / MergeMix

sangminwoo / awesome-token-redundancy-reduction

jee599 / contextzip

mvish7 / dycoke_token_compression

plasmate-labs / plasmate

MouxiaoHuang / PPE

AP3008 / Janus

claudioemmanuel / squeez

pzrain / DiViCo

Improve this page

Add this topic to your repo