Algorithms

MVSep Pedal Steel Guitar

Pedal steel guitar is a type of electric guitar mounted on a special horizontal stand (console). The musician plays it while seated, using not only their hands, but also their feet and knees.

Key features of the instrument:

Pedal and lever mechanism: This is the instrument’s main distinguishing feature. A system of foot pedals and knee levers mechanically changes the tension (and pitch) of specific strings during performance. This allows the player to perform complex harmonic transitions and change chords without moving the hand along the neck.
Playing technique: A pedal steel guitar has no traditional frets to press with the fingers. With the left hand, the guitarist smoothly glides a heavy metal bar (slide or “steel”) across the strings, creating the characteristic sliding effect (glissando). The right hand plucks the strings using a plastic thumb pick and metal finger picks on the remaining fingers.
Necks and strings: Professional models often feature two parallel necks with different tunings (for example, E9 for country music and C6 for jazz and swing). Each neck usually has 10 or 12 strings.

The instrument is famous for its sustained, singing, and often “crying” tone with very long sustain. Historically and culturally, the pedal steel guitar is strongly associated with country music and western swing. However, thanks to its unique expressive capabilities, today it can be heard in a wide variety of genres, from jazz and Hawaiian music to ambient, indie rock, and pop music.

Algorithm name	Wind dataset
Algorithm name	SDR Wind	SDR Other
MelBand Roformer	6.73	16.10
SCNet Large	6.76	16.13
MelBand + SCNet Ensemble	7.22	16.59
MelBand + SCNet Ensemble (+extract from Instrumental)	---	---
BS Roformer	9.82	19.19

Algorithm name	DnR dataset (test)
Algorithm name	SDR Speech	SDR Music	SDR Effects
BandIt Plus	15.62	9.21	9.69

Algorithm name	SDR Metric on DnR v3 leaderboard
	music (SDR)	sfx (SDR)	speech (SDR)
SCNet Large	9.94	11.35	12.59
Mel Band Roformer	9.45	11.24	12.27
Ensemble (Mel + SCNet)	10.15	11.67	12.81
Bandit v2 (for reference)	9.06	10.82	12.29

Author	Architecture	Works with	SDR (no independent testing yet)	Link
FoxJoy	MDX-B	Full track	~6.50
aufr33 and jarredou	MDX23C	Full track	---	Github
anvuew	MelRoformer	Only vocals	7.56
anvuew	BSRoformer	Only vocals	8.07
anvuew v2	MelRoformer	Only vocals	---
Sucial	MelRoformer	Only vocals	10.01
anvuew	BSRoformer	Only vocals (Room)	13.74	HF Link
anvuew	BSRoformer	Only vocals (Stereo)	22.50	HF Link

Algorithms

MVSep Pedal Steel Guitar

MVSep Plucked Strings (plucked-strings, other)

MVSep Bowed Strings (strings, other)

MVSep Wind (wind, other)

MVSep Brass (brass, other)

MVSep Woodwind (woodwind, other)

MVSep Bagpipes (bagpipes , other)

MVSep Percussion (percussion, other)

BandIt Plus (speech, music, effects)

BandIt v2 (speech, music, effects)

MVSep DnR v3 (speech, music, effects)

MVSep Braam

MVSep Risers

Apollo Enhancers (by JusperLee, Lew, baicai1145)

Reverb Removal (noreverb)

Validation dataset 80 tracks: reverb for all stems

Validation dataset 27 tracks: reverb for vocals only

What does reverberation consist of?

Main parameters in reverb plugins

Why is reverb necessary in music mixing?

Why is it necessary to remove the reverb effect?

AudioSR (Super Resolution)

FlashSR (Super Resolution)

Stable Audio Open Gen

Whisper (extract text from audio)

Parakeet (extract text from audio)

Parakeet v2 (Parakeet TDT 0.6B v2)

Parakeet v3 (Parakeet TDT 0.6B v3)

VibeVoice (Voice Cloning)

Key features:

How to use the model

How to generate a reference track?

Option 1: Universal (Balanced & Clear)

Option 2: Conversational (Vlog & Social Media)

Option 3: Professional (Business & Narration)

Tips for recording:

VibeVoice (TTS)

Key Features:

How to use the model

Correct format:

Incorrect format:

Example scenarios:

Qwen3-TTS (Custom Voice)

Qwen3-TTS (Voice Design)

Qwen3-TTS (Voice Cloning)

Mega 53-stem Model

Bark (Speech Gen)

Instructions for coding emotions and sounds in Bark

1. Basic Principle

2. List of supported tags (Non-speech sounds)

3. Generating singing and music

4. Pauses and Intonation (Prosody)

5. Important nuances of operation (Disclaimer)

MVSep MultiSpeaker (MDX23C)

Aspiration (by Sucial)

Phantom Centre extraction

What is the Phantom Center in audio engineering?

Why is the phantom center so important in mixing?

Site information

Company

Extra