Context Compression: Economic Token Pruning

Economics · updated Mon Feb 23

Reducing token costs by summarizing history while keeping logic intact.

Steps

  1. Summarize older turns once context exceeds 50% of the window.
  2. Protect 'Hard Facts' (entities/dates) from being lost in summaries.
  3. Use 'Recursive Summarization' for 100k+ token sessions.
  4. Compare 'Summary Accuracy' against raw history periodically.
  5. Drop 'Low-Utility' turns (e.g., greetings, filler) from the context.

view raw JSON →