You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
* Add support for MDXC models
* Updated poetry lockfile to match dependencies
* fix err: CLI does not work
* Fixed MDXC config YAML download, formatted mdxc separator class, bumped version ready for release
* Added progress bar for file downloads
* Added error handling for failed model load due to incomplete/corrupt download
* Fixed outstanding issues with YAML config loading and file download, added todo list for integration tests to write
* Moved load model into own method for consistency with mdxc class
* Refactored MDXC class to use more descriptive variable names, removed dead code, added debug logging and clearer parameters etc.
* Fixed and tested pitch shift logic for MDXC, added CLI params for other MDXC config parameters and tested these
* Added MDXC to readme
* Added thanks!
---------
Co-authored-by: Andrew Beveridge <andrew@beveridge.uk>
Summary: Easy to use audio stem separation from the command line or as a dependency in your own Python project, using the amazing MDX-Net and VR Arch models available in UVR by @Anjok07 & @aufr33.
8
+
Summary: Easy to use audio stem separation from the command line or as a dependency in your own Python project, using the amazing MDX-Net, VR Arch, Demucs and MDXC models available in UVR by @Anjok07 & @aufr33.
9
9
10
10
Audio Separator is a Python package that allows you to separate an audio file into various stems, using models trained by @Anjok07 for use with UVR (https://github.com/Anjok07/ultimatevocalremovergui).
11
11
@@ -136,8 +136,9 @@ Any file listed in the list models output can be specified (with file extension)
--demucs_stem DEMUCS_STEM stem to extract from audio file, e.g. Vocals, Drums, Bass, Other (default: All Stems). Example: --demucs_stem=vocals
188
188
--demucs_segment_size DEMUCS_SEGMENT_SIZE size of segments into which the audio is split, 1-100. higher = slower but better quality (default: Default). Example: --demucs_segment_size=256
189
189
--demucs_shifts DEMUCS_SHIFTS number of predictions with random shifts, higher = slower but better quality (default: 2). Example: --demucs_shifts=4
190
190
--demucs_overlap DEMUCS_OVERLAP overlap between prediction windows, 0.001-0.999. higher = slower but better quality (default: 0.25). Example: --demucs_overlap=0.25
--mdxc_segment_size MDXC_SEGMENT_SIZE larger consumes more resources, but may give better results (default: 256). Example: --mdxc_segment_size=256
195
+
--mdxc_use_model_segment_size use model default segment size instead of the value from the config file. Example: --mdxc_use_model_segment_size
196
+
--mdxc_overlap MDXC_OVERLAP amount of overlap between prediction windows, 2-50. higher is better but slower (default: 8). Example: --mdxc_overlap=8
197
+
--mdxc_batch_size MDXC_BATCH_SIZE larger consumes more RAM but may process slightly faster (default: 1). Example: --mdxc_batch_size=4
198
+
--mdxc_pitch_shift MDXC_PITCH_SHIFT shift audio pitch by a number of semitones while processing. may improve output for deep/high vocals. (default: 0). Example: --mdxc_pitch_shift=2
192
199
```
193
200
194
201
### As a Dependency in a Python Project
@@ -348,6 +355,7 @@ This project is licensed under the MIT [License](LICENSE).
348
355
- [Kuielab & Woosung Choi](https://github.com/kuielab) - Developed the original MDX-Net AI code.
349
356
- [KimberleyJSN](https://github.com/KimberleyJensen) - Advised and aided the implementation of the training scripts for MDX-Net and Demucs. Thank you!
350
357
- [Hv](https://github.com/NaJeongMo/Colab-for-MDX_B) - Helped implement chunks into the MDX-Net AI code. Thank you!
0 commit comments