Realtime ASR

Realtime ASR is for live captions, voice input, and meeting-style transcription. On the overseas Open API, realtime ASR availability is account and rollout dependent. Use the current GET /api/openapi.json schema and account configuration as the source of truth.

Request

Realtime ASR uses a WebSocket-style streaming model when enabled. The connection authenticates with the normal bearer token:

Authorization: Bearer YOUR_API_TOKEN

Send audio in the encoding and chunk size required by the enabled model. If your product records audio in the browser, route it through your backend or an approved token exchange flow rather than embedding the API token in client code.

Realtime ASR

Realtime ASR

Request

Response

Billing And Credits

Errors

On this page