textclf/llama3.1-8b-icq-4bit
Inference endpoint for Llama-3.1-8B-Instruct 4-bit ICQ quantized
6 subscribers
9.2/10 popularity
15274 ms avg latency
88% success rate
2 endpoints
The in-depth APIMemo review for this API hasn't been published yet —
the data below comes straight from the public marketplace listing.
textclf/llama3.1-8b-icq-4bit endpoints
| Method | Endpoint | Description |
|---|---|---|
| Chat | ||
| POST |
/chat/completions /chat/completions |
Same as /v1/chat/completions. |
| POST |
/v1/chat/completions /v1/chat/completions |
Generates a completion for a chat conversation. |
textclf/llama3.1-8b-icq-4bit pricing
| Plan | Price | Rate limit | Quotas |
|---|---|---|---|
| BASIC | Free | — |
|