1) ViperX released his weights for BS Roformer model which is doing separation on vocal and instrumental parts. Quality of separation currently the best available in the world. We added these weights on MVSep. SDR metrics increased comparing to our own BS Roformer model.
Multisong dataset:
SDR vocals changed: 10.43 -> 10.87
SDR instrumental changed: 16.73 -> 17.17
Synth dataset:
SDR vocals changed: 12.45 -> 12.76
SDR instrumental changed: 12.16 -> 12.46
2) Based on new ViperX model we updated our Ensembles alogrithms:
Ensemble (vocals, instrum) on Multisong dataset:
SDR vocals: 10.75 -> 11.06
SDR instrum: 17.06 -> 17.37
Ensemble (vocals, instrum) on Synth dataset:
SDR vocals: 12.76 -> 13.00
SDR instrum: 12.46 -> 12.70
Ensemble (vocals, instrum, bass, drums, other):
SDR vocals: 10.75 -> 11.06
SDR instrum: 17.06 -> 17.37
SDR bass: 12.53 -> 12.57
SDR drums: 11.84 -> 11.94
SDR other: 7.15 -> 7.22
3) We added more functionality to our MVSep API for developers.