Speech To Text API
Transcribe audio files to text using AI. Supports MP3, WAV, OGG, FLAC, M4A, WebM formats. Detect language and get time-stamped segments.
Speech To Text API endpoints
| Method | Endpoint | Description |
|---|---|---|
| transcribe | ||
| POST |
transcribeAudio /transcribe |
Transcribe audio file to text using Azure or Whisper engine |
| detect-language | ||
| POST |
detectLanguage /detect-language |
Detect the language of an audio file |
| languages | ||
| GET |
listLanguages /languages |
List all supported languages for transcription |
| models | ||
| GET |
listModels /models |
List available transcription models |
| engines | ||
| GET |
listEngines /engines |
List available speech-to-text engines |
Speech To Text API pricing
| Plan | Price | Rate limit | Quotas |
|---|---|---|---|
| BASIC | Free | — |
|