You can use an input audio with multiple speakers and multiple genders. Each speaker will be separated into a fragment and from that fragment the gender will be identified. There is an example audio in this project with this case. It is in ./assets/example_audio_1.mp3 You can try it wi...
Morphosyntactic Features (n = 18), the proportion of pronouns and verb tenses were extracted using the stanza python library48, and customs NER models were trained using the same procedure as described in lexical features, in order to extract the values of present tense and third personal p...
librispeech is in beta version. --datasets metavoice librispeech Examples: Input from a file: #No enhancer is used python speech_dataset_generator/main.py --input_file_path /path/to/audio/file.mp3 --output_directory /output/directory --range_times 5-10 --datasets metavoice #Using deep...