Endpoint
For SambaStack, developers should check with their system administrator for the correct URL.
Request parameters
The following tables outline the parameters required to make a transcription request, parameter type, description, and default values.For improved accuracy, we strongly recommend specifying the
language
parameter when using any audio model.Whisper Large v3
Parameter | Type | Description | Default |
---|---|---|---|
model | String | The ID of the model to use. | Required |
file | File | Audio file in FLAC, MP3, MP4, MPEG, MPGA, M4A, Ogg, WAV, or WebM format. File size limit is 25MB. | Required |
prompt | String | Prompt provided to influence transcription style or vocabulary. Example: “Please transcribe carefully, including pauses and hesitations.” | Optional |
response_format | String | Output format: JSON or text. | json |
language | String | The language of the input audio. Supplying the input language in ISO-639-1 (e.g. en) format will improve accuracy and latency. | Optional |
stream | Boolean | Enables streaming responses. | false |
stream_options | Object | Additional streaming configuration (e.g., {“include_usage”: true}). | Optional |