Speech Segmentation API
Speech Segmentation API divides audio speech into meaningful segments or units (words, sentences, speakers, or topics). Key Features: Word-level segmentation: Splits audio into individual words. Sentence or phrase segmentation: Detects sentence boundaries. Speaker diarization: Separates audio segments by speaker. Topic segmentation: Identifies topic-based sections in audio content. Common Use…
Speech Segmentation API endpoints
| Method | Endpoint | Description |
|---|---|---|
| Endpoints | ||
| POST |
asr_asr_post /asr |
Convert audio content to text with enterprise-grade accuracy. Features include multi-language support, timestamp generation, and multiple output formats including subtitles. |
| POST |
detect_language_detect_language_post /detect-language |
Automatically identify the primary language in audio content using advanced linguistic models. Supports detection across 90+ languages with high accuracy. |
Speech Segmentation API pricing
| Plan | Price | Rate limit | Quotas |
|---|---|---|---|
| BASIC | Free | — |
|
| PRO | $4.99 / month | 100 / hour |
|
| ULTRA | $9.99 / month | 1000 / hour |
|
| MEGA Recommended | $19.99 / month | 10000 / hour |
|