We have a lot of updates. First of all we redid the site from scratch. It has new features like user registration, more informative pages, better design etc. But also we added set of new algorithms:
1) We have released MDX23C models and made update for them. One of models reached 10 SDR on multisong dataset. Currently it's best single models for separation of vocals/instrumental.
2) We added new algorithm Demucs4 Vocals 2023. It's algorithm demucsht_ft but finetuned on big dataset. Metrics are better than for original, but slightly worse than MDX23C. On some melodies it can give more cleaner results.
3) We added new Ensemble algorithms. First is "Ensemble 4 models (vocals, instrum)". It includes: UVR-MDX-NET-Voc_FT, Demucs4 Vocals 2023 and two MDX23C models. Algorithm gives the highest possible quality for vocal and instrumental stems. Also if you need more detailed separation including 3 more stems "bass", "drums", "other" you can use "Ensemble 8 models (vocals, bass, drums, other)". This ensemble gives state of art results for 4 stems separation.
You can find comparison tables below (larger SDR is better).
Algorithm name | Multisong dataset | Synth dataset | MDX23 Leaderboard | ||
SDR Vocals | SDR Instrumental | SDR Vocals | SDR Instrumental | SDR Vocals | |
Ensemble of 4 models | 10.18 | 16.48 | 12.25 | 11.95 | 10.95 |
MDX23C, 8K FFT, Full Band | 10.01 | 16.32 | 12.07 | 11.77 | 10.85 |
UVR-MDX-NET-Voc_FT | 9.64 | 15.95 | 11.40 | 11.10 | 10.50 |
Demucs4 HT Vocals 2023 | 9.04 | 15.35 | 11.59 | 11.29 | 9.61 |
Demucs4 HT default (htdemucs_ft) | 8.33 | 14.63 | 10.23 | 9.94 | 9.08 |
Algorithm name | Multisong dataset | ||||
SDR Bass | SDR Drums | SDR Other | SDR Vocals | SDR Instrumental | |
Ensemble of 8 models | 12.52 | 11.73 | 6.93 | 10.17 | 16.48 |
Demucs 4 HT default (htdemucs_ft) | 12.05 | 11.24 | 5.74 | 8.33 | 14.63 |