Overview
Transcribe audio to text using OpenAI. Supports various models and optional timestamps.- Output Type: string
- Estimated Cost: 5 credits
- Handler: modal
Parameters
Required Parameters
Audio to transcribe (e.g., mp3, mp4, wav, m4a). Max filesize is 25MB.
- Label: Input audio
Optional Parameters
The OpenAI model to use for transcription. If timestamps are enabled, ‘whisper-1’ will be used regardless of this setting.
For the absolute best results, use gpt-4o-transcribe.
- Label: Transcription Model
- Options:
gpt-4o-transcribe
,gpt-4o-mini-transcribe
,whisper-1
Whether to include timestamps. If true, ‘whisper-1’ model will be used automatically.
- Label: Enable Timestamps
The granularity of the timestamps to include in the transcription.
- Label: Timestamp Granularity
- Options:
segment
,word
Optional text to guide the model’s style or improve recognition of specific words/acronyms.
- Label: Prompt