API Real time Speech Processing
Real-Time Speech Processing API A cloud API that processes audio instantly to convert speech to text or analyze spoken content in real time. It enables live transcription, voice commands, speaker detection, and supports multiple languages. Key Features: Live speech-to-text Instant voice analysis Low latency streaming Multi-language and accent support Use Cases: Live captions, virtual assistants,…
API Real time Speech Processing endpoints
| Method | Endpoint | Description |
|---|---|---|
| Endpoints | ||
| POST |
detect_language_detect_language_post /detect-language |
Identifies spoken languages in audio content with confidence scoring. Supports multi-language detection and dialect identification. |
| POST |
asr_asr_post /asr |
Converts audio to text with real-time processing capabilities. Supports streaming input, multiple audio formats, and configurable output options. |
API Real time Speech Processing pricing
| Plan | Price | Rate limit | Quotas |
|---|---|---|---|
| BASIC | Free | — |
|
| PRO | $4.99 / month | 100 / hour |
|
| ULTRA | $9.99 / month | 1000 / hour |
|
| MEGA Recommended | $19.99 / month | 10000 / hour |
|