46% smaller tool responses on average. One bearer token.
Point your MCP client at https://api.gotcontext.ai/mcp, add your key, and every tool response is compressed before your agent reads it. Works with Claude Code, Cursor, Codex, and Gemini CLI — 150+ tools.
Free tier — 1,000 compressions/month, no credit card.
Compatible with
How a response gets compressed.
Same input, same output, every run. Four steps, no model in the loop. The output is a re-ranking of your own sentences — every token in the compressed response appears in the original.
Why the output is auditable.
Not a summary. A re-ranking of your own sentences. Documents are chunked, embedded, and scored on a semantic graph; only the highest-ranked nodes survive into the output. Typically 46% smaller on production traffic. Methodology & benchmark peak in the measurement section. measurement section.
- Three compression modes: fast / balanced / SBERT
- AST-aware code compression for 7+ languages
- Per-workspace key scoping. Keys cannot read across workspaces.
- Command Palette — Cmd+K navigation, G+D shortcuts, full-text search
- GitHub Integration — auto-compress on push with PR token summaries
- Real-Time Queue Monitor — live SSE streaming for batch jobs
- Roles: Owner, Admin, Member, Viewer. Shared projects, activity feed.
- Stacks with native prompt caching (Anthropic / OpenAI / Gemini). When both apply, total input-cost reduction can reach 95% — see methodology.
150+ MCP tools behind one endpoint.
Works with any MCP-compatible client. Claude Code, Cursor, Windsurf, VS Code — they all get the full compression layer, GitHub webhook integration, and real-time SSE monitoring. Configuration is one JSON block, shown below.
- CWE-22 path traversal prevention on all file I/O
- Async batch ingest — 4× throughput
- Prometheus metrics, OpenTelemetry tracing, health checks
gc_compress_manifestshrinks MCP tool-description bloat
{
"mcpServers": {
"gotcontext": {
"url": "https://api.gotcontext.ai/mcp",
"headers": {
"Authorization": "Bearer gc_your_key_here"
}
}
}
}Try it now
Paste any text and see how much you can save. No signup required.
Text is processed in-memory and is not stored, logged with PII, or used for training. Do not paste secrets or production credentials. Privacy details →
Pay for tool calls. Compression is included.
Every MCP tool response is compressed before it returns to your agent, so each call delivers more context per token. The multiplier scales with the live compression ratio (see hero). Covers solo developers to enterprise teams.
Free
- 1,000 compressions/month
- 100KB max document
- Standard compression
- Command Palette & shortcuts
- Activity Feed
- Dark/Light theme
- Community support
Pro
- 50,000 compressions/month
- All 150+ MCP tools (incl. ACE, knowledge mgmt, multimodal)
- Priority queue: 2 concurrent compression slots
- 1MB max document
- Accelerated compression (3-5x faster)
- Queue Monitor (real-time SSE)
- Usage analytics
- Webhook Notifications
- Priority support
Business
- 500,000+ compressions / month
- All 150+ MCP tools
- Priority queue: 8 concurrent compression slots
- Self-hosted Docker (run in your VPC)
- OIDC federation (Okta, Auth0, Azure AD)
- Audit-log export (NDJSON/CSV) for SOC2
- SBERT embeddings (higher fidelity than the default MiniLM tier)
- SSO / SAML
- Email support · SLA on request (custom MSA)
- DPA / IP indemnity / custom MSA
Start free.
1,000 compressions/month, all 150+ tools, no credit card.