GPT-4.1 nano
GPT-4.1 Nano is designed for extreme efficiency, offering rapid responses with minimal resource consumption. It is the fastest model in the GPT-4.1 series, delivering the first token in under five seconds for a 128,000 token context, while still supporting a full 1 million token context for long documents and extensive code. It excels in benchmarks like MMLU (80.1%) and GPQA (50.3%), proving…
GPT-4.1 nano endpoints
| Method | Endpoint | Description |
|---|---|---|
| POST |
Chat Completion /chat/completions |
Creates a model response for the given chat conversation. |
GPT-4.1 nano pricing
| Plan | Price | Rate limit | Quotas |
|---|---|---|---|
| BASIC | $1 / month | — |
|
| PRO | $5 / month | — |
|
| ULTRA | $25 / month | — |
|
| MEGA Recommended | $75 / month | — |
|