Ensemble of best vocal models. Algorithm gives the highest possible quality for vocal and instrumental stems. The latest ensemble consists of BSRoformer, MelRoformer and SCNet XL vocal models.
Quality table
Algorithm name | Multisong dataset | Synth dataset | MDX23 Leaderboard | ||
SDR Vocals | SDR Instrumental | SDR Vocals | SDR Instrumental | SDR Vocals | |
Ensemble (2023.09) (UVR-MDX-NET-Voc_FT, Demucs4 Vocals 2023, MDX23C, VitLarge23) |
10.44 | 16.74 | 12.76 | 12.46 | 11.17 |
Ensemble (2024.02) (BS Roformer (v1), MDX23C, VitLarge23) |
10.75 | 17.06 | 12.72 | 12.42 | --- |
Ensemble (2024.03) (BS Roformer (viperx), MDX23C) |
11.06 | 17.37 | 13.00 | 12.70 | --- |
Ensemble (2024.04) (BS Roformer (finetuned), MDX23C) |
11.33 | 17.63 | 13.57 | 13.27 | --- |
Ensemble (2024.08) (BS Roformer (finetuned), MelBand Roformer) |
11.50 | 17.81 | 13.79 | 13.50 | --- |
Ensemble (2024.12) (BS Roformer (finetuned), MelBand Roformer, SCNet XL) |
11.61 | 17.92 | 14.09 | 13.79 | --- |