Qwen3.6 35B Long Context API

Artificial Intelligence/Machine Learning Paid View on RapidAPI ↗

OpenAI-compatible Qwen3.6 35B A3B inference for long-context workloads. The API is designed for RAG, long documents, coding agents, and structured extraction, with a 262K-token context window and OpenAI-compatible /v1 endpoints. Streaming is disabled at launch so usage can be billed and capped safely through RapidAPI plans. Plan input limits are enforced by the backend: Basic 8K, Pro 64K, Ultra…

3 subscribers

1.9/10 popularity

486 ms avg latency

25% success rate

2 endpoints

The in-depth APIMemo review for this API hasn't been published yet — the data below comes straight from the public marketplace listing.

Qwen3.6 35B Long Context API endpoints

Method	Endpoint	Description
POST	Chat Completions /v1/chat/completions	Create OpenAI-compatible chat completions using Qwen3.6 35B A3B with long-context inputs. Streaming is disabled on RapidAPI at launch; responses include usage tokens for billing.
GET	Models /v1/models	List the available OpenAI-compatible model served by LighterHub.

Qwen3.6 35B Long Context API pricing

Plan	Price	Rate limit	Quotas
PRO Recommended	$9.99 / month	—	Requests: 500 / monthly
ULTRA	$29.99 / month	—	Requests: 2,000 / monthly
MEGA	$99 / month	—	Requests: 5,000 / monthly

Qwen3.6 35B Long Context API

Qwen3.6 35B Long Context API endpoints

Qwen3.6 35B Long Context API pricing

More Artificial Intelligence/Machine Learning APIs

Low-Cost Image Generate API

OPEN AI

Best Astrology API - Natal Charts, Transits & Synastry

AI Content Detector | AI/GPT

ChatGPT VISION

ChatGPT 4