每个片段都提供了转录,片段长度从1到20秒不等,总长度在列表中显示。文本发表于1884至1964年间,属于公共领域。音频由LibriVox项目录制,也属于公共领域,乌克兰语除外。乌克兰语音频由Nash Format或Gwara Media提供,仅供机器学习使用。 数据集地址:M-AILABS Speech Dataset|语音识别数据集|语音合成数据集...
Size: 2.8 GiB Source: [size=2]https://www.caito.de/2019/01/03/the-m-ailabs-speech-dataset/[/size] Description:German phrases pronounced by native speakers mainly fromLibrivox.org. The data is ready to be used on GoldenDict PC (not Android) and the “Search Bar” needs to be visibl...
I replaced the broken link with the updated one that I found on the same website here: http://www.caito.de/2019/01/the-m-ailabs-speech-dataset/master (mozilla/DeepSpeech#3703) Daniel Tinazzi authored Nov 17, 2021 1 parent 73e1e4f commit 4fa8dd3 Showing 1 changed file with 1 additi...