Lesson 54 of 1570
Making Music with Suno and Udio
Type a prompt, get a full song — vocals, drums, mix, even in Portuguese. Here's how Suno v5, Udio, and ElevenMusic work — and what they can't yet do.
Lesson map
What this lesson covers
Learning path
The main moves in order
- 1From a prompt to a song
- 2music generation
- 3Suno
- 4Udio
Concept cluster
Terms to connect while reading
Section 1
From a prompt to a song
Suno v5 (released late 2025) produces full songs — vocals, instruments, arrangement — from a short text prompt or lyrics in seconds. Udio competes at 48 kHz higher audio fidelity. ElevenLabs entered the market April 2026 with ElevenMusic, licensed-data-only, commercial-use-friendly. Stable Audio 2 is the open-source contender.
A minimal Suno-style prompt
A typical Suno prompt with style, tempo, mood, and structured lyrics using section tags.
STYLE: indie folk, acoustic guitar, soft harmonies, female lead vocal
TEMPO: 90 BPM
MOOD: bittersweet, nostalgic
LYRICS:
[Verse 1]
The light in your kitchen still looks the same / but you aren't there to turn it on
[Chorus]
I carry the summer / in a jar on my sill / the fireflies are sleeping still
LENGTH: 2:30Section tags that work
- [Intro], [Verse 1], [Pre-Chorus], [Chorus], [Bridge], [Outro].
- [Instrumental Break], [Guitar Solo], [Drum Fill].
- [Fade Out], [Key Change].
- Emotional tags inside lyrics: (whispered), (screamed), (backing vocals).
The three-model comparison
Compare the options
| Model | Strength | Weakness |
|---|---|---|
| Suno v5 | Largest platform, ~2M subscribers; fastest iteration; best vocal clarity. | Copyright claims pending (major labels lawsuit). |
| Udio v2 | 48 kHz audio fidelity; crisper mixes. | Smaller community; less iteration. |
| ElevenMusic (Apr 2026) | Licensed data only — safe for commercial use; great vocals. | Newer; fewer genres proven. |
| Stable Audio 2 | Open source; run locally; non-commercial license for the model. | Less polished; generally instrumental. |
What the tools do well
- Demos for songwriters — hear how your lyrics might sound.
- Custom jingles for ads, podcasts, YouTube intros.
- Royalty-friendly background music (especially ElevenMusic).
- Genre parody, karaoke versions, language dubs of existing lyrics.
- Mood music for games, films, meditation apps.
What they struggle with
- Long-form pieces with clear song structure (8+ minute prog rock).
- Specific named-artist mimicry — increasingly blocked.
- Perfect lyrical diction in rare languages.
- Complex production nuance — dynamic mixing, mastering.
Key terms in this lesson
End-of-lesson quiz
Check what stuck
15 questions · Score saves to your progress.
Tutor
Curious about “Making Music with Suno and Udio”?
Ask anything about this lesson. I’ll answer using just what you’re reading — short, friendly, grounded.
Progress saved locally in this browser. Sign in to sync across devices.
Related lessons
Keep going
Builders · 28 min
How Diffusion Models Actually Work
An AI that paints starts with pure noise and removes it, one step at a time, until a picture appears. Here's the surprisingly beautiful math behind it.
Builders · 32 min
Video AI — Sora, Veo, Runway, Kling
Text-to-video became practical in 2025 and cinematic in 2026. Here's the state of the art and how to choose.
Builders · 28 min
Voice Cloning — Power and Ethics
ElevenLabs can clone a voice from 30 seconds of audio. That's useful for accessibility — and dangerous in the wrong hands. Here's how to use it well.
