## Welcome to Voice API A single endpoint for everything voice — **text-to-speech**, **speech-to-text**, and **voice cloning** — built for production workloads. ### Highlights - 🎙️ **Studio-grade TTS** with multi-speaker scripting, inline emotion cues (e.g. `[excited]`, `[whisper]`, `[sigh]`), and per-speaker model binding. Long scripts within the per-call character cap are auto-chunked and…

8 subscribers
9.1/10 popularity
256 ms avg latency
96% success rate
10 endpoints
The in-depth APIMemo review for this API hasn't been published yet — the data below comes straight from the public marketplace listing.

Voice API endpoints

MethodEndpointDescription
models
GET models_list
/v1/models
Filter by title (fuzzy match), language, tag and sort order. **Frequently requested pages are cached**; pass `refresh=true` to bypass the cache. ### Query parameters | Name |…
tts
POST tts_submit
/v1/tts
Synthesise text into MP3 audio. - **Multi-speaker**: prefix text with `` / `` / … — each marker maps to the N-th model in `model_ids` - **Emotion hints**: content inside square…
GET tts_status
/v1/tts/{task_id}
Returns the task's current `status` and progress. When `status=succeeded`, download the audio from `/v1/tts/{task_id}/audio`.
GET tts_audio
/v1/tts/{task_id}/audio
Call after `status=succeeded`. Returns `Content-Type: audio/mpeg` with `Content-Disposition` defaulting to `{task_id}.mp3`.
stt
POST stt_submit
/v1/stt
Upload audio (mp3 / wav / m4a, etc.). The backend handles the full "upload → create remote task → poll → fetch result" pipeline; the client only needs to poll once. Optional…
GET stt_status
/v1/stt/{task_id}
`progress` is updated to 50 / 80 during the polling phase and 100 on completion. `result` is always `null` here; fetch the transcript from `/v1/stt/{task_id}/result` to avoid…
GET stt_result
/v1/stt/{task_id}/result
When `plain_text=true`, returns only the merged transcript. When `false` (default), includes segments (timestamps + speaker labels).
clone
GET clone_status
/v1/clone/{task_id}
Once `status=succeeded`, read `result.model_id` for the trained model ID.
POST clone_submit
/v1/clone
Upload a reference audio clip to train a private TTS model. **Reference audio requirements**: duration **≥ 10 s**, recommended 10–90 s. Clips that are too long will be rejected.…
health
GET health
/v1/health
Public endpoint, no authentication required. Suitable for load balancers and Kubernetes health checks.

Voice API pricing

PlanPriceRate limitQuotas
BASIC Free
  • Requests: 1,000 / monthly
  • Seconds: 1,500 / monthly
  • Characters: 30,000 / monthly
PRO Recommended $15 / month 2000 / hour
  • Requests: 10,000 / monthly
  • Seconds: 15,000 / monthly
  • Characters: 300,000 / monthly
ULTRA $30 / month 3000 / hour
  • Requests: 30,000 / monthly
  • Seconds: 45,000 / monthly
  • Characters: 1,000,000 / monthly

More Artificial Intelligence/Machine Learning APIs

View all →
  • An almost free AI image generation API for cost-conscious developers. including text to image, object…

    Artificial Intelligence/Machine LearningFreemium56 subscribers
  • Harness the potential (100x affordable) of OPEN AI ( with internet access ), Claude 3 , GPT-4 (at…

    Artificial Intelligence/Machine LearningFreemium8.9k subscribers
  • Professional astrology API with natal charts, transits, synastry analysis. 23 house systems, fixed stars,…

    Artificial Intelligence/Machine LearningFreemium186 subscribers
  • Detects ChatGPT, GPT4 & Gemini Content: Simple Way & High Accuracy; OpenAI Detection API; AI Essay Detector…

    Artificial Intelligence/Machine LearningFreemium1.7k subscribers
  • 100x affordable than OpenAI same AI, with Chatgpt Vision, GPT4o vision , GPT 3.5. image processing ,Text to…

    Artificial Intelligence/Machine LearningFreemium1.8k subscribers
  • Harness the potential of alternatives oof GPT-5, ChatGPT 4 (100x affordable), o3-mini, Deepseek R1, GPT-4…

    Artificial Intelligence/Machine LearningFreemium21.1k subscribers