Stop paying to resend file reads, build logs, and git diffs that your agent will never reference again.
In a multi-turn coding session, every request resends the entire conversation history. By turn 20, you're paying for 15,000 tokens of old file reads, build logs, test results, and git diffs. The agent won't look at any of it again, but you're billed for every token on every subsequent turn.
Observation Masking detects tool outputs from earlier turns and replaces them with a one-line placeholder before the request is sent. The most recent 2 messages stay intact. Research from JetBrains shows this outperforms LLM-based summarization because agents re-read files when they need to instead of trusting a lossy summary.
A request arrives with a multi-turn conversation history
The proxy identifies tool outputs from earlier turns (file reads, build logs, test results, git diffs)
Large outputs are replaced with compact placeholders preserving metadata
Recent messages (last 2) are always left intact. The optimized request forwards to the provider
Create an account, add your first app, and swap one URL. Takes about 5 minutes.
Get Started Free