Overview

Generate music from text prompt using Ace Step model.
  • Output Type: audio
  • Base Model: musicgen
  • Estimated Cost: 0.2 * duration * (number_of_steps/60) credits
  • Handler: replicate

Parameters

Required Parameters

tags
string
required
Text prompts to guide music generation (e.g., genre, mood, instruments, BPM). Supports tags, descriptions, and scenes. Use commas to separate different tags. E.g., ‘synth-pop, electronic, pop, synthesizer, drums, bass, piano, 128 BPM, energetic, uplifting, modern’
  • Label: Tags (Prompt)

Optional Parameters

lyrics
string
default:"[inst]"
Optional lyrics for the music. Can be multi-line. Use [verse], [chorus], and [bridge] to separate different parts of the lyrics. Use [instrumental] or [inst] to generate instrumental music.
  • Label: Lyrics
duration
integer
default:30
Desired duration of the generated audio in seconds.
  • Label: Duration
  • Minimum: 1
  • Maximum: 240
seed
integer
default:"random"
Seed for the generation. Set for reproducibility. If blank, a random value will be used.
  • Label: Seed
  • Minimum: 0
  • Maximum: 2147483647
number_of_steps
integer
default:60
Number of inference steps. (More can improve quality but will increase cost/latency.)
  • Label: Number Of Steps
  • Minimum: 10
  • Maximum: 200
granularity_scale
float
default:10
Omega scale for APG guidance (granularity scale), or similar for other CFG types. Higher values can reduce artifacts.
  • Label: Granularity Scale
  • Minimum: -100.0
  • Maximum: 100.0
guidance_interval
float
default:0.5
Guidance interval.
  • Label: Guidance Interval
  • Minimum: 0.0
  • Maximum: 1.0
guidance_interval_decay
float
default:0
Guidance interval decay.
  • Label: Guidance Interval Decay
  • Minimum: 0.0
  • Maximum: 1.0
guidance_scale
float
default:15
Overall guidance scale. Controls how much the generation adheres to the prompt. Note that when lyric_guidance_scale > 1 and tag_guidance_scale > 1, this global guidance scale might not be applied.
  • Label: Guidance Scale
  • Minimum: 0.0
  • Maximum: 30.0
min_guidance_scale
float
default:3
Minimum guidance scale.
  • Label: Min Guidance Scale
  • Minimum: 0.0
  • Maximum: 100.0
tag_guidance_scale
float
default:0
Guidance scale for tags (text prompt). This is the specific guidance scale for the text component and typically applies to CFG. A suggested starting point could be around 5.0 (especially if lyric guidance is around 1.5).
  • Label: Tag Guidance Scale
  • Minimum: 0.0
  • Maximum: 10.0
lyric_guidance_scale
float
default:0
Specific guidance scale for the lyric component.
  • Label: Lyric Guidance Scale
  • Minimum: 0.0
  • Maximum: 10.0