LLM Prompt Compression API
LLM Prompt Compression API compresses verbose prompts and documents into high-signal text for LLM workflows, RAG pipelines, semantic search, and vector database retrieval. Built with native C++, FAISS-backed vector search, and NLTK-based NLP preprocessing, it improves retrieval quality, reduces token cost, and preserves intent, entities, and key facts.
LLM Prompt Compression API endpoints
| Method | Endpoint | Description |
|---|---|---|
| Query Optimization | ||
| POST |
prepareQuery /prepare-query |
|
| POST |
prepareQueryFile /prepare-query-file |
|
LLM Prompt Compression API pricing
| Plan | Price | Rate limit | Quotas |
|---|---|---|---|
| BASIC | Free | — |
|