Music Creation
Create original music, beats, and sounds from text descriptions using AI. Any genre, any style.
Hit the TwoShot REST API with an admin bearer. Stem-separate any audio source URL, poll the job, download all four stems. Same infrastructure TwoShot uses internally.
If you're building something that processes audio at scale — a transcription pipeline, a karaoke app, a music-search product, a podcast editor — the stem separation endpoint is the same one TwoShot uses internally. You authenticate with an admin bearer, POST the audio URL plus the model ID, and get back a job you poll until it's done. Every stem lives on CDN-backed storage; downloads are pre-signed URLs you can hand straight to your user. The infrastructure handles concurrency, retries, and GPU scheduling; your code stays thin. The quality target is broadcast-usable: vocals clean enough to drop into a karaoke app without noticeable bleed, drums isolated sharply enough to sample, bass kept coherent even on tracks with layered 808s. We run the same model tier in the API as on the web app, so if the browser output works for your use case, the API output is identical. For most integrations, the integration shape is: receive audio from your user, POST to TwoShot, return download links for the stems after ~30 seconds. The library returns per-stem URLs for vocals, drums, bass, and 'other' — plus an instrumental-only pass if you request it. For high-volume operations (thousands of concurrent jobs), ping us before you ship so we can pre-warm capacity on your tenant.
There isn't a hard limit — pricing is per-credit, and the system scales horizontally. If you're doing high volume (thousands of concurrent jobs), ping the team before you ship so we can pre-warm capacity.
Either an HTTPS URL we can fetch, or a pre-uploaded audio ID from /audio/upload. URL-based jobs are faster to start; upload-based jobs avoid rate-limiting issues when the source domain blocks server-to-server fetches.
Admin bearer tokens only. Request one from the team with your intended use case. Do not hand out admin tokens to clients — use them server-side and expose your own scoped surface.
Typical 3-4 minute track: 20-40 seconds end-to-end from POST to finished files. Longer tracks (10 minutes) take proportionally longer. The job is queued if GPU capacity is saturated; you can subscribe to webhooks for async notification.
WAV 44.1kHz/16-bit stems by default. Request MP3 or FLAC via the output format parameter. Original bit depth is preserved where the source allows it.
The default model returns vocals/drums/bass/other (classic 4-stem split). For a finer breakdown into guitar/keys/strings, use the advanced multi-stem model by setting modelId to the premium variant — costs more credits, runs slightly slower.
Per-credit pricing scales with duration. A 3-minute track costs roughly X credits (see docs.twoshot.app for current rates). Bulk commitments available for enterprise volume.
Official SDKs in TypeScript and Python are published in the docs; community wrappers exist for Go and Ruby. The REST API is straightforward enough that most integrations just use fetch/httpx directly.
Pass a callback URL in the job POST; we'll hit it when the job transitions to completed or failed. Includes HMAC signature verification so you can trust it came from us.
The full separate stems experience is a free page on TwoShot — same models, no setup.
Everything you need to create, transform, and perfect your audio, images, and video
Create original music, beats, and sounds from text descriptions using AI. Any genre, any style.
Create stunning visuals, album covers, thumbnails, and art from text descriptions. Edit and upscale existing images.
Create videos from text or images. Animate photos, create music videos, and produce motion content for social media.
Text-to-speech, voice enhancement, and vocal transformation.
Isolate vocals, drums, bass, and instruments from any track in seconds.
Remove background noise, upscale images, enhance video quality, and polish your media.
Generate custom SFX and foley for games, videos, and podcasts.
Remix tracks in new styles or extend songs seamlessly with AI.
Cowrite lyrics and scripts - draft, refine, and iterate together until every line is right.
Arrange, compose, and produce directly in your browser. Audio, video, images — all in one workspace.
200,000+ royalty-free sounds and samples ready for commercial use.
AI tools for music, video, images, and voice
Turn ideas into tracks faster. Create beats, sounds, and full productions with AI assistance.
Complete video production with AI. Generate videos, images, music, and voiceovers.
Studio-quality audio from any recording. Clean up interviews, enhance voices, and add music.
Production-ready AI for audio, video, and visuals. Full rights clearance, API access, team collaboration.
Transform your creative ideas into tangible sounds with our AI powered tools. Simply describe what you want - "fast drum & bass jungle-style drum loop" or "layered flutes inspired by nature" - and see the magic unfold.
Create stunning visuals from text descriptions. Design album covers, thumbnails, portraits, and art — all through conversation.
Create videos from text or images. Animate photos, produce music videos, and make motion content for social media.
Upload any photo and watch it move. AI-powered motion control turns still images into dynamic dance videos and animations.
Change backgrounds, remove objects, upscale resolution, and edit images through simple conversation. No Photoshop needed.
A creative partner for lyrics and scripts. Get a draft, then go back and forth - refine lines, try new angles, iterate together until it's exactly what you envisioned.
Text-to-speech, voice enhancement, and vocal transformation. Create professional voiceovers in any style or voice.
Isolate vocals, drums, bass, and instruments from any track in seconds. Perfect for remixing, sampling, or creating karaoke versions.
Remove background noise, upscale images, enhance video quality, and polish your media.
Generate custom sound effects and foley for games, videos, and podcasts. From explosions to footsteps, create exactly what you need.
Leverage the power of our AI to reimagine existing samples. Extract particular elements from a sample, or create a completely new sample based on a reference.
Arrange, compose, and produce directly in your browser with our online DAW. Drag and drop samples, add effects, and export your creations.




Explore our library of 200,000+ royalty-free samples. From old-school chops to hyper-pop melodies - chat naturally with stem to find exactly what you need.
From Grammy-winning producers to major labels, see who's creating with TwoShot



