Qwen3.6 35B Long Context API
OpenAI-compatible Qwen3.6 35B A3B inference for long-context workloads. The API is designed for RAG, long documents, coding agents, and structured extraction, with a 262K-token context window and OpenAI-compatible /v1 endpoints. Streaming is disabled at launch so usage can be billed and capped safely through RapidAPI plans. Plan input limits are enforced by the backend: Basic 8K, Pro 64K, Ultra…
Qwen3.6 35B Long Context API endpoints
| Method | Endpoint | Description |
|---|---|---|
| POST |
Chat Completions /v1/chat/completions |
Create OpenAI-compatible chat completions using Qwen3.6 35B A3B with long-context inputs. Streaming is disabled on RapidAPI at launch; responses include usage tokens for billing. |
| GET |
Models /v1/models |
List the available OpenAI-compatible model served by LighterHub. |
Qwen3.6 35B Long Context API pricing
| Plan | Price | Rate limit | Quotas |
|---|---|---|---|
| PRO Recommended | $9.99 / month | — |
|
| ULTRA | $29.99 / month | — |
|
| MEGA | $99 / month | — |
|