Yingzhi Innovation
Yingzhi is a production-ready inference provider built on a scalable GPU compute infrastructure. We currently operate a high-performance AI compute cluster (including RTX 4090, H100, and H200 nodes) capable of supporting both large-scale and real-time inference workloads. Our platform already supports multiple LLM families (such as DeepSeek, Qwen, GLM, Gemma and other open-source models), and is…
Yingzhi Innovation endpoints
| Method | Endpoint | Description |
|---|---|---|
| GET |
Get models /v1/models |
Get models |
Yingzhi Innovation pricing
| Plan | Price | Rate limit | Quotas |
|---|---|---|---|
| BASIC | Free | 64 / second |
|