LLaMA 4 70B
Access LLaMA 4 70B via AWS Bedrock — blazing-fast, low-latency API for advanced LLM tasks including text generation, summarization, reasoning, and more. Ideal for developers building intelligent apps or integrating generative AI.
LLaMA 4 70B endpoints
| Method | Endpoint | Description |
|---|---|---|
| default | ||
| POST |
invokeBedrockModel /default/bedrock_proxy |
Sends a POST request to invoke a Bedrock model. The body is editable JSON with modelId, region, and body (containing prompt and max_tokens_to_sample). |
LLaMA 4 70B pricing
| Plan | Price | Rate limit | Quotas |
|---|---|---|---|
| BASIC | Free | — |
|
| PRO | $5 / month | — |
|
| ULTRA | $15 / month | — |
|
| MEGA | Free | — |
|