Question 1

Is my audio uploaded to a server?

Accepted Answer

No. All processing happens locally in your browser using WebAssembly. Your files never leave your device.

Question 2

What does "2-stems" mean?

Accepted Answer

Spleeter 2-stems separates audio into two tracks: vocals (singing, speech) and accompaniment (instruments, music). This is ideal for creating karaoke tracks or isolating vocals.

Question 3

How large can the input file be?

Accepted Answer

The model needs to process the entire file in memory. Files up to ~10 minutes work well on most devices. Longer files may be slower or require more RAM.

Question 4

What AI model is used?

Accepted Answer

We use Spleeter 2-stems (fp16), originally from Deezer, running via sherpa-onnx WASM. It is one of the most popular source separation models.

Question 5

Does it work with video files?

Accepted Answer

Yes! Upload a video and we automatically extract the audio, separate it, and provide two downloadable tracks.

🎶 Free AI Audio Source Separator

📖 How to Use

❓ Frequently Asked Questions

🔗 Related Tools

🚀 Need More Power?