Groq-Inference
Our platform, developed by Devs Do Code (Sreejan), offers the fastest inferencing services, with speeds ranging from 600 to 800 words per second, for all models listed on Groq's official website. We take pride in providing these services at even cheaper rates than the official site, making it an affordable and efficient option for our clients. Our platform is designed to deliver high-quality…
Groq-Inference endpoints
| Method | Endpoint | Description |
|---|---|---|
| GET |
/chat /chat |
The /chat route of our API enables users to interact with a large language model by sending queries and receiving responses. The route processes user queries, sends them to the… |
Groq-Inference pricing
| Plan | Price | Rate limit | Quotas |
|---|---|---|---|
| BASIC | Free | 10 / minute |
|
| PRO Recommended | $5 / month | 100 / minute |
|
| ULTRA | $10 / month | 1000 / minute |
|
| MEGA | $20 / month | 5000 / minute |
|