Transcription

Overview
Parameters
Required Parameters
Optional Parameters

Overview

Transcribe audio to text using OpenAI. Supports various models and optional timestamps.

Output Type: string
Estimated Cost: 5 credits
Handler: modal

Parameters

Required Parameters

audio

required

Audio to transcribe (e.g., mp3, mp4, wav, m4a). Max filesize is 25MB.

Label: Input audio

Optional Parameters

model

string

default:"gpt-4o-mini-transcribe"

The OpenAI model to use for transcription. If timestamps are enabled, ‘whisper-1’ will be used regardless of this setting.

For the absolute best results, use gpt-4o-transcribe.

Label: Transcription Model
Options: gpt-4o-transcribe, gpt-4o-mini-transcribe, whisper-1

use_timestamps

boolean

default:false

Whether to include timestamps. If true, ‘whisper-1’ model will be used automatically.

Label: Enable Timestamps

timestamp_granularity

string

default:"segment"

The granularity of the timestamps to include in the transcription.

Label: Timestamp Granularity
Options: segment, word

prompt

string

Optional text to guide the model’s style or improve recognition of specific words/acronyms.

Label: Prompt

Zonos Weather

Image Generation

Video Generation

Audio Generation

Text & Data Tools

Model Training

Other Tools

Overview

Parameters

Required Parameters

Optional Parameters

Image Generation

Video Generation

Audio Generation

Text & Data Tools

Model Training

Other Tools

​Overview

​Parameters

​Required Parameters

​Optional Parameters

Overview

Parameters

Required Parameters

Optional Parameters