We had to move to a new server due to lack of space on the old one. Positive effect - the video card has been changed to a more powerful one with more memory. As a result, the waiting queues have decreased and there are fewer errors associated with a lack of GPU memory. The downside is that server costs have doubled.
A new algorithm has been added Ultimate Vocal Remover (UVR). It splits the track into two parts, music and vocals. UVR usually does it better than spleeter. There are a lot of models and different settings in the original UVR. We have chosen one of the best models and optimal settings. Perhaps later, a flexible choice of settings for the algorithm will be added.
The winner of the Music Demuxing Challenge has finally released his code. We added its models to the site under the names Demux3 Model A and Demux3 Model B. Demux3 Model B gives a better result, and works better for bass and drums comparing to other models, but is slightly inferior in vocals to the MDX-B algorithm.
Below is an updated table comparing the quality of algorithms (data for UVR are not available). The values in the table are calculated on private Music Demuxing Challenge dataset (available only to organizers). The higher the value, the better the algorithm works.
|Algorithm||Quality (Bass)||Quality (Drums)||Quality (Other)||Quality (Vocals)||Example|
|Spleeter (4 stems)||5.774||5.845||4.321||6.939||Example|
|MDX B (Orig)||7.495||7.554||5.533||8.896||---|
|MDX B (UVR)||7.495||7.554||5.533||9.482||Example|
|Ultimate Vocal Remover HQ||---||---||---||---||Example|
|Demucs 3 Model A||8.115||8.037||5.193||7.968||Example|
|Demucs 3 Model B||8.856||8.850||5.978||8.756||Example|