Distributed NVIDIA CUDA inference API with OpenAI-compatible endpoints. Run AI models (Llama, Mistral, SDXL) on GPU-accelerated infrastructure at scale.

2 subscribers
The in-depth APIMemo review for this API hasn't been published yet — the data below comes straight from the public marketplace listing.

exo-cuda-inference pricing

PlanPriceRate limitQuotas
BASIC Free
  • Requests: 1,000 / monthly
PRO $19 / month
  • Requests: 50,000 / monthly
ULTRA $49 / month
  • Requests: 200,000 / monthly
MEGA $149 / month
  • Requests: 1,000,000 / monthly

More Artificial Intelligence/Machine Learning APIs

View all →
  • An almost free AI image generation API for cost-conscious developers. including text to image, object…

    Artificial Intelligence/Machine LearningFreemium56 subscribers
  • Harness the potential (100x affordable) of OPEN AI ( with internet access ), Claude 3 , GPT-4 (at…

    Artificial Intelligence/Machine LearningFreemium8.9k subscribers
  • Professional astrology API with natal charts, transits, synastry analysis. 23 house systems, fixed stars,…

    Artificial Intelligence/Machine LearningFreemium186 subscribers
  • Detects ChatGPT, GPT4 & Gemini Content: Simple Way & High Accuracy; OpenAI Detection API; AI Essay Detector…

    Artificial Intelligence/Machine LearningFreemium1.7k subscribers
  • 100x affordable than OpenAI same AI, with Chatgpt Vision, GPT4o vision , GPT 3.5. image processing ,Text to…

    Artificial Intelligence/Machine LearningFreemium1.8k subscribers
  • The ChatGPT 4 API from PR Labs is a multi-model AI gateway hosted on RapidAPI that bundles access to GPT-4o,…

    ReviewedArtificial Intelligence/Machine LearningFreemium21.1k subscribers