Skip to content
#

token-efficiency

Here are 106 public repositories matching this topic...

A curated list of strategies, tools, papers, and resources for reducing LLM token costs and improving efficiency in production.

  • Updated Jun 7, 2026
gcf

Drop-in JSON replacement for all AI pipelines. 79% fewer tokens. JSON scores 53.6% comprehension at scale, GCF scores 90.5%. Superpowers for graph-shaped data.

  • Updated Jun 11, 2026

Coding agents forget your repo. mcp-brain is the missing memory layer — repo-aware, team-aware, lifecycle-aware. 63% Hit@10, zero LLM cost. Works with any MCP client.

  • Updated Apr 27, 2026
  • Python

High-speed PDF → Markdown ingestion engine for multimodal RAG pipelines. Extracts structured text + isolated images so downstream chunkers, LlamaIndex, and VLM agents get context that actually works

  • Updated May 10, 2026
  • Go

Improve this page

Add a description, image, and links to the token-efficiency topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the token-efficiency topic, visit your repo's landing page and select "manage topics."

Learn more