Ensemble of best vocal models. Algorithm gives the highest possible quality for vocal and instrumental stems. The latest ensemble consists of BSRoformer, MelRoformer and SCNet XL vocal models.
Monthly usage: 4 962, Monthly rating: 3.7059 (17 votes)This ensemble is based on algorithm which took 2nd place at Music Demixing Track of Sound Demixing Challenge 2023. The main changes comparing to contest version is much better individual stem models.
Monthly usage: 1 509, Monthly rating: 3.6667 (6 votes)It's Ensemble (vocals, instrum, bass, drums, other) + more models included like guitars, piano, back/lead vocals and drumsep.
Monthly usage: 2 493, Monthly rating: 3.8000 (10 votes)Algorithm Demucs4 HT. It's fast and gives relatively good quality for bass/drums/other stems.
Monthly usage: 49 480, Monthly rating: 4.4765 (170 votes)BS Roformer model. Excellent quality for vocals/instrumental separation.
Monthly usage: 51 034, Monthly rating: 4.6329 (158 votes)Algorithm for separating tracks into vocal and instrumental parts based on the MelBand Roformer neural network
Monthly usage: 31 592, Monthly rating: 4.7914 (187 votes)Set of MDX23C models which is based on code released by kuielab for Sound Demixing Challenge 2023. Very good for vocals/instrumental separation.
Monthly usage: 12 268, Monthly rating: 4.5610 (41 votes)Algorithm for separating tracks into vocal and instrumental parts based on the SCNet neural network
Monthly usage: 1 684, Monthly rating: 4.0000 (5 votes)MDX B models are based on kuielab code from Music Demixing Challenge 2021. Models were retrained by UVR team on big dataset. For long time models were best for vocals/instrumental separation.
Monthly usage: 2 821, Monthly rating: 4.5000 (8 votes)A set of models from the Ultimate Vocal Remover program, which are based on the old VR architecture. Most of the models are vocal, but there are also special models for karaoke, piano, removing reverberation effects, etc.
Monthly usage: 13 056, Monthly rating: 4.7065 (92 votes)Demucs4 Vocals 2023 model - it's Demucs4 HT model fine-tuned on big vocals dataset.
Monthly usage: 1 214, Monthly rating: 4.6667 (6 votes)The MDX-B Karaoke model was prepared as part of the Ultimate Vocal Remover project. The model produces high-quality lead vocal extraction from a music track.
Monthly usage: 11 517, Monthly rating: 4.4048 (42 votes)No data found
Monthly usage: 14 924, Monthly rating: 4.4700 (100 votes)MVSep Piano model is based on MDX23C, MelRoformer and SCNet Large architectures. It produces high quality separation for piano and other stems.
Monthly usage: 5 121, Monthly rating: 4.6875 (64 votes)The MVSep Guitar model produces high-quality separation of music into a guitar part (including acoustic and electronic) and everything else.
Monthly usage: 9 892, Monthly rating: 4.6235 (85 votes)The MVSep Bass model produces high-quality separation of music into a bass part and everything else.
Monthly usage: 6 886, Monthly rating: 4.7736 (53 votes)The MVSep Drums model produces high-quality separation of music into a drums part and everything else.
Monthly usage: 10 535, Monthly rating: 4.8947 (19 votes)The MVSep Strings model is a model based on the MDX23C architecture for separating music into bowed string instruments and everything else.
Monthly usage: 3 613, Monthly rating: 4.4545 (22 votes)The MVSep Wind model produces high-quality separation of music into a wind part and everything else.
Monthly usage: 4 012, Monthly rating: 4.4250 (40 votes)The MVSep Organ model produces high-quality separation of music into an organ part and everything else.
Monthly usage: 284, Monthly rating: 4.0000 (2 votes)Experimental model VitLarge23 based on Vision Transformers. In terms of metrics, it is slightly inferior to the MDX23C, but may work better in some cases.
Monthly usage: 381, Monthly rating: 0 (0 votes)Set of different models to remove reverberation effect from music.
Monthly usage: 7 310, Monthly rating: 4.5517 (29 votes)An unique model for removing crowd sounds from music recordings (applause, clapping, whistling, noise, laugh etc.).
Monthly usage: 5 004, Monthly rating: 4.5652 (46 votes)No data found
Monthly usage: 1 793, Monthly rating: 5.0000 (1 votes)BandIt Plus model for separating tracks into speech, music and effects.
Monthly usage: 1 844, Monthly rating: 2.2000 (5 votes)Bandit v2 is a model for cinematic audio source separation in 3 stems: speech, music, effects/sfx. It was trained on DnR v3 dataset.
Monthly usage: 1 226, Monthly rating: 3.2000 (5 votes)MVSep DnR v3 is a cinematic model for splitting tracks into 3 stems: music, sfx and speech.
Monthly usage: 2 956, Monthly rating: 3.5000 (2 votes)The DrumSep model divides the drums stem into 4 types: 'kick', 'snare', 'cymbals', 'toms'.
Monthly usage: 3 241, Monthly rating: 4.5714 (7 votes)No data found
Monthly usage: 5 352, Monthly rating: 2.9362 (47 votes)Whisper is a pre-trained model for automatic speech recognition (ASR) and speech translation.
Monthly usage: 794, Monthly rating: 3.5000 (2 votes)Medley Vox is an algorithm for separating multiple singers within a single music track and evaluation dataset for this task.
Monthly usage: 4 535, Monthly rating: 3.3200 (25 votes)MVSep Multichannel BS - uses the best vocal model to extract sound from multi-channel audio (5.1, 7.1, etc.).
Monthly usage: 2 213, Monthly rating: 4.6667 (9 votes)A model for separating male and female voices within a single vocal track. The track should contain only voices, no music.
Monthly usage: 3 577, Monthly rating: 2.6471 (17 votes)Algorithm Demucs3 (A and B versions)
Monthly usage: 712, Monthly rating: 0 (0 votes)No data found
Monthly usage: 240, Monthly rating: 5.0000 (1 votes)No data found
Monthly usage: 160, Monthly rating: 0 (0 votes)No data found
Monthly usage: 214, Monthly rating: 0 (0 votes)No data found
Monthly usage: 156, Monthly rating: 4.5000 (2 votes)No data found
Monthly usage: 61, Monthly rating: 0 (0 votes)No data found
Monthly usage: 42, Monthly rating: 0 (0 votes)No data found
Monthly usage: 74, Monthly rating: 0 (0 votes)No data found
Monthly usage: 155, Monthly rating: 0 (0 votes)No data found
Monthly usage: 129, Monthly rating: 0 (0 votes)No data found
Monthly usage: 380, Monthly rating: 0 (0 votes)The LarsNet model divides the drums stem into 5 types: 'kick', 'snare', 'cymbals', 'toms', 'hihat'.
Monthly usage: 493, Monthly rating: 5.0000 (20 votes)MVSep MultiSpeaker (MDX23C) - this model tries to isolate the most loud voice from all other voices.
Monthly usage: 608, Monthly rating: 3.0000 (4 votes)The algorithm restores the quality of audio. For example MP3 files compressed to 128 kbps or lower and other types.
Monthly usage: 9 054, Monthly rating: 4.6667 (27 votes)The algorithm adds "whispering" effect to vocals.
Monthly usage: 588, Monthly rating: 0 (0 votes)No data found
Monthly usage: 2 665, Monthly rating: 5.0000 (3 votes) No data found Revert to old select대기 중인 처리되지 않은 파일: 17. 현재 GPU로 처리 중: 8