M-AILABS 语音数据集是我们提供的首个大型免费数据集,可自由用于语音识别和语音合成的训练数据。数据主要基于LibriVox和Project Gutenberg,包含近千小时的音频和准备好的文本文件。每个片段都提供了转录,片段长度从1到20秒不等,总长度在列表中显示。文本发表于1884至1964年间,属于公共领域。音频由LibriVox项目录制,也...
Size: 2.8 GiB Source: [size=2]https://www.caito.de/2019/01/03/the-m-ailabs-speech-dataset/[/size] Description:German phrases pronounced by native speakers mainly fromLibrivox.org. The data is ready to be used on GoldenDict PC (not Android) and the “Search Bar” needs to be visibl...
I replaced the broken link with the updated one that I found on the same website here: http://www.caito.de/2019/01/the-m-ailabs-speech-dataset/master (mozilla/DeepSpeech#3703) Daniel Tinazzi authored Nov 17, 2021 1 parent 73e1e4f commit 4fa8dd3 Showing 1 changed file with 1 additi...
Video curation pipeline for building your own video dataset. [Coming soon] Post-training scripts via NeMo Framework to post-train the pre-trained world foundation models for various Physical AI setup. Pre-training scripts via NeMo Framework for building your own world foundation model. [Diffusion]...