test_csv = "/tsdata/data/data_537h/aishell/test/together.txt.csv" dataset = load_dataset('csv', data_files={'train': train_csv, 'test': test_csv, 'validation':dev_csv }) print(dataset) aishell dataset 之后想使用aishell数据来finetune XLSR[3](跨语言预训练模型),待续。。。 参考 ^wa...
Dataset Loaders Edit huggingface/datasets 19,801 Tasks Edit Text-To-Speech Synthesis Similar Datasets JVS JVS AISHELL-1 Usage Number of Papers20222024202120232025024681012AISHELL-3JVSAISHELL-1 License Edit Custom (non-commercial) Modalities Edit Texts Speech Languages Edit Chinese Mandari...
The AISHELL-5 dataset contains more than 100 hours of speech data, divided into 94 hours of training data(Train), 3.3 hours of validation data (Dev), and two test sets(Eval1 and Eval2), with durations of 3.3 and 3.58 hours. Each dataset includes far-field audio from 4 channels, wit...
Source:AISHELL-1: An Open-Source Mandarin Speech Corpus and A Speech Recognition Baseline Homepage Benchmarks Edit Add a new resultLink an existing benchmark TrendTaskDataset VariantBest ModelPaperCode Speech Recognition AISHELL-1 FireRedASR-AED ...
Jian Wu, Hui Bu, Xin Xu, Jun Du, Jingdong Chen Interspeech 2021|August 2021 In this paper, we present AISHELL-4, a sizable real-recorded Mandarin speech dataset collected by 8-channel circular microphone array for speech proce...
AISHELL-4: An Open Source Dataset for Speech Enhancement, Separation, Recognition and Speaker Diarization in Conference Scenario Yihui Fu, Luyao Cheng, Shubo Lv, Yukai Jv, Yuxiang Kong, Zhuo Chen, Yanxin Hu, Lei Xie, Jian Wu, Hui Bu, ...
MusicEval 生成式音乐评分数据集 A Generative Music Dataset with Expert Ratings for Automatic Text-to-Music Evaluation* This dataset was jointly developed and constructed by the HLT Laboratory of the College of Computer Science at Nankai University and AISHELL....
AISHELL联合西北工业大学、中国科学技术大学、微软合著的论文《AISHELL-4: An Open Source Dataset for Speech Enhancement, Separation, Recognition and Speaker Diarization in Conference Scenario》已被语音研究顶级会议INTERSPEECH 2021接收。 论文地址 https://arxiv.org/abs/2104.03603 ...
pds用 y yearyyfan 2枚 CC0 语音识别 0 8 2024-11-12 详情 相关项目 评论(0) 创建项目 文件列表 data_aishell.tgz data_aishell.tgz (14861.02M) 下载 File Name Size Update Time data_aishell/wav/S0724.tar.gz 42977549 2017-06-13 02:08:54 data_aishell/wav/S0725.tar.gz 51900074 2017-06...
训练验证测试集 蛋 蛋宝哒哒 1枚 GPL 2 语音识别 8 41 2024-05-07 详情 相关项目 评论(0) 创建项目 文件列表 wav_-15_to_15db_test_4noise.zip avg_10.pdparams wav_-15_to_15db_train_dev_4noise.zip wav_-15_to_15db_test_4noise.zip (929.53M) 下载 File Name Size Update Time test/S...