Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Adding feature to exclude group of information during export #28

Open
HarikalarKutusu opened this issue May 7, 2022 · 1 comment
Open

Comments

@HarikalarKutusu
Copy link
Contributor

HarikalarKutusu commented May 7, 2022

Is it possible to implement optional "--exclude-xxx fn" flags to exclude recordings during cv export?

--exclude-voices voices.txt            // E.g. to measure the effect of a single person recording too much
--exclude-sentences sentences.txt             // E.g. to exclude reported sentences
--exclude-gender [male|female|other|empty]             // E.g. to train with male voices and test with female voices
etc

That would very much ease any experiments on biasing effects.

PS: The correct place to implement these would be CorporaCreator but it is not actively maintained as you know.

Similar can be implemented for opus corpora.

Bülent

@ftyers
Copy link
Owner

ftyers commented Sep 2, 2022

That's an interesting idea. I guess I would probably try and do it in a different way and have some kind of configuration file to avoid overloading the commandline. If you give some concrete examples I can think how to implement it.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants