https://keithito.com/LJ-Speech-Dataset/ https://github.com/keithito/tacotron
LJSpeech公司 3625.02M 320 浏览 0 喜欢 0 次下载 0 条讨论 Others Classification 分享 Share 收藏 Favorite 0 0 数据介绍 文件预览 相关论文 Code 分享讨论(0) 使用声明 启动Notebook开发 数据结构 ? 3625.02M * 以上分析是由系统提取分析形成的结果,具体实际数据为准。 README.md # Datas...
[Russian] This script will split audio file on silence, transcript it with google recognition and save it in LJSpeech-1.1 dataset manner. pythongoogle-cloudspeech-to-texttranscriptorrussian-languageljspeechaudio-transcription UpdatedMar 29, 2021 ...
What should I do ? The reason for such warnings is that there is '͡' in metadata.csv in ljspeech, but '͡' is not declared in characters. If you want to remove the warning, add '͡' to the characters or punctuations in your characters_class. But if '͡' is not used a l...
The LJ Speech 喜爱 2 这是一个公共领域的语音数据集,包含来自单个演讲者的13,100个简短音频片段,这些片段来自7部非小说类书籍。 为每个剪辑提供了转录。 剪辑的长度从1到10秒不等,总长度约为24小时。 缘梦枫华 4枚 CC0 4 21 2020-11-12 详情 相关项目 评论(0) 创建项目 数据集介绍 这是一个公共领域...
This is a checkpoint for the Tacotron 2 model that was trained in NeMo on LJspeech for 1200 epochs. It was trained with Apex/Amp optimization level O0, with 8 * 16GB V100, and with a batch size of 48 per GPU for a total batch size of 384. It contains the checkpoints for the Tac...
This is a public domain speech dataset consisting of 13,100 short audio clips of a single speaker reading passages from 7 non-fiction books. A transcription is provided for each clip. Clips vary in length from 1 to 10 seconds and have a total length of approximately 24 hours. Data Collecti...
This repository provides all the necessary tools for using a HiFIGAN vocoder trained with LJSpeech. The pre-trained model takes in input a spectrogram and produces a waveform in output. Typically, a vocoder is used after a TTS model that converts an input text into a spectrogram. ...
import numpy as np import librosa import librosa.display import matplotlib.pyplot as plt import glob # 设置文件目录和格式 file_path_pattern = './wavs/LJ050-*.wav' # 替换为实际LJSpeech文件所在的路径 # 收集所有LJ050的音频文件 audio_files = glob.glob(file_path_pattern) # 只加载前30个音频...
Breadcrumbs hifi-gan /LJSpeech-1.1 / validation.txtTop File metadata and controls Code Blame 150 lines (150 loc) · 29.8 KB Raw LJ050-0269|The essential terms of such memoranda might well be embodied in an Executive order.|The essential terms of such memoranda might well be embodied in ...