Music Creation
Create original music, beats, and sounds from text descriptions using AI. Any genre, any style.
POST a prompt to /generation, poll the job, download the finished MP3 or WAV. Everything the TwoShot web app can do, exposed as REST.
The song-creation endpoint is the single most-used tool in the TwoShot API. You pass a prompt (plus optional lyrics, genre, tempo, key, style references) and get back a full song — vocals, instrumentation, arrangement, intro and outro included — typically in 60 to 120 seconds. It's the same pipeline the web app uses, so output quality matches what users see in Coproducer. Most integrations use it as the backend for a consumer app: a user types a description in your UI, you POST to TwoShot, poll until done, and hand back a pre-signed URL for playback. The shape is intentionally simple — one endpoint, one job object, one result URL — so your app layer stays thin. Advanced parameters (reference tracks, voice cloning, duration overrides) are available but optional. For high-volume integrations, we run per-tenant capacity so your jobs don't queue behind other users. Pricing is per-credit, and credits map to compute-time. A typical 3-minute song costs roughly the same whether you run 1 a day or 10,000 — the system scales horizontally.
Read docs.twoshot.app for the full REST reference. POST /generation with {modelId, inputs} and poll /generation/{jobId}.
First tool call opens a browser to sign you in with your TwoShot account — no separate API key.
Free-form text. 'Upbeat pop song about summer road trips, 130 BPM, female vocals' works fine. Advanced: pass structured fields (genre, tempo, key, mood, duration) as separate inputs for more predictable output.
Yes — include a lyrics parameter in the inputs. The model will compose melody and arrangement around your words.
Typical 2-3 minute song: 60-120 seconds from POST to finished file. Longer songs scale roughly linearly.
Not yet — jobs return the complete file when done. For most consumer apps the wait is short enough to show a simple progress bar. Streaming is on the roadmap.
MP3 320 by default; WAV 24-bit on request via the outputFormat parameter. Stems (vocals, instrumental) available as a separate request against the finished job ID.
Don't use the TwoShot API regularly? The full make a song experience is a free page on TwoShot — same models, no setup.
Everything you need to create, transform, and perfect your audio, images, and video
Create original music, beats, and sounds from text descriptions using AI. Any genre, any style.
Create stunning visuals, album covers, thumbnails, and art from text descriptions. Edit and upscale existing images.
Create videos from text or images. Animate photos, create music videos, and produce motion content for social media.
Text-to-speech, voice enhancement, and vocal transformation.
Isolate vocals, drums, bass, and instruments from any track in seconds.
Remove background noise, upscale images, enhance video quality, and polish your media.
Generate custom SFX and foley for games, videos, and podcasts.
Remix tracks in new styles or extend songs seamlessly with AI.
Cowrite lyrics and scripts - draft, refine, and iterate together until every line is right.
Arrange, compose, and produce directly in your browser. Audio, video, images — all in one workspace.
200,000+ royalty-free sounds and samples ready for commercial use.
AI tools for music, video, images, and voice
Turn ideas into tracks faster. Create beats, sounds, and full productions with AI assistance.
Complete video production with AI. Generate videos, images, music, and voiceovers.
Studio-quality audio from any recording. Clean up interviews, enhance voices, and add music.
Production-ready AI for audio, video, and visuals. Full rights clearance, API access, team collaboration.
Transform your creative ideas into tangible sounds with our AI powered tools. Simply describe what you want - "fast drum & bass jungle-style drum loop" or "layered flutes inspired by nature" - and see the magic unfold.
Create stunning visuals from text descriptions. Design album covers, thumbnails, portraits, and art — all through conversation.
Create videos from text or images. Animate photos, produce music videos, and make motion content for social media.
Upload any photo and watch it move. AI-powered motion control turns still images into dynamic dance videos and animations.
Change backgrounds, remove objects, upscale resolution, and edit images through simple conversation. No Photoshop needed.
A creative partner for lyrics and scripts. Get a draft, then go back and forth - refine lines, try new angles, iterate together until it's exactly what you envisioned.
Text-to-speech, voice enhancement, and vocal transformation. Create professional voiceovers in any style or voice.
Isolate vocals, drums, bass, and instruments from any track in seconds. Perfect for remixing, sampling, or creating karaoke versions.
Remove background noise, upscale images, enhance video quality, and polish your media.
Generate custom sound effects and foley for games, videos, and podcasts. From explosions to footsteps, create exactly what you need.
Leverage the power of our AI to reimagine existing samples. Extract particular elements from a sample, or create a completely new sample based on a reference.
Arrange, compose, and produce directly in your browser with our online DAW. Drag and drop samples, add effects, and export your creations.




Explore our library of 200,000+ royalty-free samples. From old-school chops to hyper-pop melodies - chat naturally with stem to find exactly what you need.
From Grammy-winning producers to major labels, see who's creating with TwoShot



