token-optimization

Here are 1,073 public repositories matching this topic...

rtk-ai / rtk

CLI proxy that reduces LLM token consumption by 60-90% on common dev commands. Single Rust binary, zero dependencies

rust cli productivity open-source developer-tools command-line-tool llm cost-reduction anthropic ai-coding claude-code token-optimization agentic-coding

Updated Jul 28, 2026
Rust

headroomlabs-ai / headroom

Star

Compress tool outputs, logs, files, and RAG chunks before they reach the LLM. 20% fewer tokens for coding agents, 60-95% fewer tokens for JSON, same answers. Library, proxy, MCP server.

Updated Jul 28, 2026
Python

Control what your AI can see. LeanCTX (Lean Context) is the context intelligence layer for AI agents — one local Rust binary that decides what they read, remembers what they learn, guards what they touch, and proves what they save. 60–90% fewer tokens as the receipt. 76 MCP tools, 30+ agents, local-first.

rust ai mcp developer-tools cursor copilot ai-agents llm gemini-cli ai-coding mcp-server claude-code token-optimization agentic-coding context-engineering context-layer reduce-token-costs lean-context context-intelligence

Updated Jul 28, 2026
Rust

jgravelle / jcodemunch-mcp

Star

Cut AI token costs 95%+ on code exploration. The leading MCP server for precise, symbol-level GitHub code retrieval via tree-sitter AST. Works with Claude Code, Cursor & any MCP client. 313B+ tokens saved.

Updated Jul 28, 2026
Python

cytostack / openwolf

Star

Sharper context. Fewer tokens. Open-source middleware for Claude Code.

cli open-source middleware opencode openai developer-tools codex anthropic claude-ai claude-code token-optimization

Updated Jul 15, 2026
TypeScript

alexgreensh / token-optimizer

Sponsor

Star

Find the ghost tokens. Fix them. Survive compaction. Avoid context quality decay.

token-usage context-window claude-code token-optimization context-engineering claude-plugin claude-code-skill token-optimizer agentskills ghost-tokens

Updated Jul 28, 2026
Python

lucasrosati / claude-code-memory-setup

Star

Up to 71.5x fewer tokens per session on Claude Code with Obsidian + Graphify. Persistent memory, codebase knowledge graphs, and chat import pipeline. 🇧🇷 PT-BR included.

knowledge-graph obsidian zettelkasten developer-productivity second-brain ai-tools graphify claude-code token-optimization coding-agent

Updated Jun 1, 2026
Python

ojuschugh1 / sqz

Star

Compress LLM context to save tokens and reduce costs

javascript python api rust cli open-source ai extensions context tokens developer-tools token cost-optimization llms agentic-ai token-optimization

Updated Jun 21, 2026
Rust

zdk / lowfat

Star

lowfat - slim your command output. strips noise, saves tokens.

rust cli open-source developer-tools shell-script llm cost-reduction token-optimization agentic-coding-tool token-savings token-saving

Updated Jul 8, 2026
Rust

nadimtuhin / claude-token-optimizer

Star

Optimize token usage for Claude API calls

documentation automation opensource developer-tools ai-assistant claude-code token-optimization setup-template

Updated Jul 27, 2026
JavaScript

gglucass / headroom-desktop

Star

Unlock 2x more Claude Code and Codex usage

react macos rust typescript ai proxy openai developer-tools codex tauri menu-bar-app llm anthropic prompt-compression claude-code token-optimization

Updated Jul 28, 2026
Rust

Tura-AI / tura

Star

Across 348 long-horizon benchmark sessions, Tura used up to 83.1% fewer turns on the rewrite benchmark and improved the DeepSWE pass rate by up to 16.7 percentage points compared with Codex CLI.

agent developer-tools terminal-based llm token-usage agentic-ai token-optimization coding-agent context-engineering developer-tools-ai-agent harness-engineering

Updated Jul 28, 2026
Rust

ooples / token-optimizer-mcp

Sponsor

Star

Intelligent token optimization for Claude Code - achieving 95%+ token reduction through caching, compression, and smart tool intelligence

caching compression ai mcp claude llm mcp-server token-optimization gemini-cli-extension

Updated Jul 25, 2026
TypeScript

GMaN1911 / claude-cognitive

Star

Working memory for Claude Code - persistent context and multi-instance coordination

productivity developer-tools claude-ai context-management claude-code token-optimization

Updated Jan 17, 2026
Python

juyterman1000 / entroly

Star

Compress tool outputs, logs, files, conversations, and RAG context before they reach the model. On measured workloads, Entroly reduces unnecessary tokens by up to 90% while preserving answer-critical evidence through budget-aware selection, content-addressed recovery, and auditable Context Receipt. Works with Claude, Chatgpt/Codex,Openclaw.

Updated Jul 28, 2026
Python

IyadhKhalfallah / clauditor

Star

Stop Claude Code from burning through your quota in 20 minutes. Auto-rotates oversized sessions and preserves context.

cli hooks claude-code token-optimization

Updated Apr 16, 2026
TypeScript

edouard-claude / snip

Star

CLI proxy that reduces LLM token usage by 60-90%. Declarative YAML filters for Claude Code, Cursor, Copilot, Gemini. rtk alternative in Go.

Updated Jul 28, 2026
Go

skibidiskib / ai-codex

Star

Generate a compact codebase index for AI assistants — saves 50K+ tokens per conversation

typescript nextjs developer-tools cursor claude llm-tools ai-coding claude-code token-optimization codebase-index

Updated May 30, 2026
TypeScript

ratel-ai / ratel

Star

Context engineering for AI agents. ~80% fewer tokens. Fix tool overload. Skills and memory with in-process BM25 and semantic retrieval. Progressive Disclosure. No vector DB.

skills memory optimization mcp context accuracy agents harness rag llm tool-selection tool-calling llm-routing mcp-server token-optimization claude-skills

Updated Jul 28, 2026
Rust

Lap-Platform / LAP

Star

Your agents are guessing at APIs. Give them the actual Agent-Native spec. 1500+ API's Ready To-Use skills, Compile any API spec into a lean, agent-native format. 10× smaller. OpenAPI, GraphQL, AsyncAPI, Protobuf, Postman.

Updated Mar 26, 2026
Python

Improve this page

Add a description, image, and links to the token-optimization topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the token-optimization topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

token-optimization

Here are 1,073 public repositories matching this topic...

rtk-ai / rtk

headroomlabs-ai / headroom

yvgude / lean-ctx

jgravelle / jcodemunch-mcp

cytostack / openwolf

alexgreensh / token-optimizer

lucasrosati / claude-code-memory-setup

ojuschugh1 / sqz

zdk / lowfat

nadimtuhin / claude-token-optimizer

gglucass / headroom-desktop

Tura-AI / tura

ooples / token-optimizer-mcp

GMaN1911 / claude-cognitive

juyterman1000 / entroly

IyadhKhalfallah / clauditor

edouard-claude / snip

skibidiskib / ai-codex

ratel-ai / ratel

Lap-Platform / LAP

Improve this page

Add this topic to your repo