Ensemble of best vocal models. Algorithm gives the highest possible quality for vocal and instrumental stems.
Quality table
Algorithm name | Multisong dataset | Synth dataset | MDX23 Leaderboard | ||
SDR Vocals | SDR Instrumental | SDR Vocals | SDR Instrumental | SDR Vocals | |
Ensemble (2023.09) (UVR-MDX-NET-Voc_FT, Demucs4 Vocals 2023, MDX23C, VitLarge23) |
10.44 | 16.74 | 12.76 | 12.46 | 11.17 |
Ensemble (2024.02) (BS Roformer (v1), MDX23C, VitLarge23) |
10.75 | 17.06 | 12.72 | 12.42 | --- |
Ensemble (2024.03) (BS Roformer (viperx), MDX23C) |
11.06 | 17.37 | 13.00 | 12.70 | --- |
Ensemble (2024.04) (BS Roformer (finetuned), MDX23C) |
11.33 | 17.63 | 13.57 | 13.27 | --- |
Ensemble (2024.08) (BS Roformer (finetuned), MelBand Roformer) |
11.50 | 17.81 | 13.79 | 13.50 | --- |