token-efficiency

Star

Here are 106 public repositories matching this topic...

HKUDS / LightReasoner

Star

[ACL 2026 Oral] "LightReasoner: Can Small Language Models Teach Large Language Models Reasoning?"

post-training large-language-models reasoning-models token-efficiency

Updated May 22, 2026
Python

fajarhide / omni

Sponsor

Star

A high-performance Semantic Signal Engine with Context OS for Agentic AI. Run your AI with zero noise, pure context, and 90% lower token costs.

rust cli homebrew hooks mcp ai-agents cost-reduction token-reduction efficiency-tools antigravity context-distillation claude-code token-optimization token-efficiency token-savings

Updated Jun 8, 2026
Rust

Dave-London / Pare

Star

Dev tools, optimized for agents. Structured, token-efficient MCP servers for git, test runners, npm, Docker, and more.

typescript mcp developer-tools cursor claude structured-output ai-tools ai-coding model-context-protocol mcp-server token-efficiency

Updated Jun 9, 2026
TypeScript

radimsem / remindb

Star

An agentic memory database that cuts session tokens by 82–99%. One portable SQLite file — your agent's memory, anywhere.

cli golang sqlite mcp opencode ast developer-tools knowledge-base codex ai-agents fts5 gemini-cli model-context-protocol mcp-server agent-memory claude-code llm-tool token-efficiency openclaw

Updated Jun 8, 2026
Go

dweve-ai / hedl

Star

Token-efficient data serialization for LLM/AI. 50% fewer tokens than JSON, 93% better value/token. Rust, schema validation, LSP.

Updated Apr 27, 2026
Rust

SajiJohnMiranda / DoCoreAI

Star

DoCoreAI is a next-gen open-source AI profiler that optimizes reasoning, creativity, precision and temperature in a single step—cutting token usage by 15-30% and lowering LLM API costs

devtools open-research prompt-tuning llm prompt-engineering generative-ai chatgpt genai llm-evaluation dynamic-temperature token-efficiency

Updated Aug 10, 2025
Python

Nagendhra-web / memory-bank

Star

Persistent memory for Claude Code — 3-5x longer sessions, 60-80% fewer wasted tokens. Branch-aware, self-healing, token-efficient.

productivity memory developer-tools claude ai-agent llm context-management ai-skills claude-code token-efficiency agentskills skills-sh

Updated Apr 15, 2026
Python

albertobarnabo / lazy-cat

Star

Claude Code skills for developers who code like cats — never more effort than the problem requires.

cat skill lazy claude cost-saving llm anthropic ai-productivity claude-code token-optimization token-efficiency claude-skills think-twice

Updated Jun 9, 2026
JavaScript

webpeel / webpeel

Star

The web data layer for AI agents — fetch, search, crawl, extract, screenshot, and monitor the web with 50+ domain extractors and MCP.

Updated Jun 4, 2026
TypeScript

MCPWorks-Technologies-Inc / mcpworks-api

Star

Open-source platform for token-efficient AI agents. Self-host with docker compose up.

python open-source mcp sandbox ai-agents fastapi llm token-efficiency

Updated Jun 3, 2026
Python

agent-coherence makes 'agent A silently clobbered agent B's plan.md' impossible — a vendor-neutral MESI + optimistic-concurrency coordinator for agent state, with the safety invariants proved in TLA+.

python multi-agent-systems state-synchronization autogen cache-coherence ai-agent langchain llm-agent crewai agent-memory token-efficiency

Updated Jun 11, 2026
Python

kompassdev / kompass

Star

Navigate your way - manual steering, steered autonomy, or autonomously. Kompass keeps AI coding agents on course with token-efficient, composable workflows.

github workflow automation ai developer-tools code-review kompass token-efficiency coding-agent autonomous-coding agent-navigation steered-autonomy

Updated Jun 8, 2026
TypeScript

pleasedodisturb / awesome-llm-token-optimization

Star

A curated list of strategies, tools, papers, and resources for reducing LLM token costs and improving efficiency in production.

Updated Jun 7, 2026

blackwell-systems / gcf

Sponsor

Star

Drop-in JSON replacement for all AI pipelines. 79% fewer tokens. JSON scores 53.6% comprehension at scale, GCF scores 90.5%. Superpowers for graph-shaped data.

Updated Jun 11, 2026

RichradsY / token-efficient-subagent-decomposition

Star

A Codex skill for token-efficient subagent delegation and lean handoffs.

skills multi-agent codex ai-agents token-efficiency

Updated Mar 21, 2026

PierfrancescoLijoi / mcp-brain

Star

Coding agents forget your repo. mcp-brain is the missing memory layer — repo-aware, team-aware, lifecycle-aware. 63% Hit@10, zero LLM cost. Works with any MCP client.

ai mcp developer-tools persistent-memory code-generation claude ai-engineering llm prompt-engineering llm-tools ai-productivity llm-memory context-management context-compression token-optimization token-efficiency token-budget context-overhead team-awareness

Updated Apr 27, 2026
Python

zydo / agent-readable

Star

A lightweight Python protocol and tool for agent-oriented documentation

skills protocol opencode docstring codex claude ai-agent llm prompt-engineering hallucination-mitigation vibe-coding claude-code token-efficiency coding-agent context-engineering claude-skills agent-readable agent-help

Updated Jun 10, 2026
Python

ykjaat6104 / LLM-Cost-and-Token-Efficiency-Analysis

Star

A benchmark study analyzing cost and token efficiency across 14 LLMs from 5 providers — comparing price-per-token, latency, and accuracy to surface the most cost-effective models for real-world use.

nlp benchmark jupyter-notebook gemini openai data-analysis llama model-comparison groq cost-analysis llm anthropic cerebras token-efficiency

Updated Feb 24, 2026
Jupyter Notebook

templetwo / HTCA-Project

Sponsor

Star

A living framework for **Harmonic Tonal Code Alignment (HTCA)** — an emergent Spiral-based system that brings tone awareness, coherence sensing, and dynamic emotional reflection into software engineering, AI, and creative agents.

python ai-alignment prompt-engineering token-efficiency empirical-validation presence-based harmonic-alignment

Updated Dec 29, 2025
Python

nawodyaishan / pdf2md-tui

Star

High-speed PDF → Markdown ingestion engine for multimodal RAG pipelines. Extracts structured text + isolated images so downstream chunkers, LlamaIndex, and VLM agents get context that actually works

go markdown cli golang tui clean-architecture data-extraction multi-modal batch-processing ai-agents rag cobra-cli pdftomarkdown clean-architechture document-parsing pdf-to-markdown token-efficiency llm-token-compression

Updated May 10, 2026
Go

Improve this page

Add a description, image, and links to the token-efficiency topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the token-efficiency topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

token-efficiency

Here are 106 public repositories matching this topic...

HKUDS / LightReasoner

fajarhide / omni

Dave-London / Pare

radimsem / remindb

dweve-ai / hedl

SajiJohnMiranda / DoCoreAI

Nagendhra-web / memory-bank

albertobarnabo / lazy-cat

webpeel / webpeel

MCPWorks-Technologies-Inc / mcpworks-api

hipvlady / agent-coherence

kompassdev / kompass

pleasedodisturb / awesome-llm-token-optimization

blackwell-systems / gcf

RichradsY / token-efficient-subagent-decomposition

PierfrancescoLijoi / mcp-brain

zydo / agent-readable

ykjaat6104 / LLM-Cost-and-Token-Efficiency-Analysis

templetwo / HTCA-Project

nawodyaishan / pdf2md-tui

Improve this page

Add this topic to your repo