Source:AISHELL-1: An Open-Source Mandarin Speech Corpus and A Speech Recognition Baseline Homepage Benchmarks Edit Add a new resultLink an existing benchmark TrendTaskDataset VariantBest ModelPaperCode Speech Recognition AISHELL-1 FireRedASR-AED ...
test_csv = "/tsdata/data/data_537h/aishell/test/together.txt.csv" dataset = load_dataset('csv', data_files={'train': train_csv, 'test': test_csv, 'validation':dev_csv }) print(dataset) aishell dataset 之后想使用aishell数据来finetune XLSR[3](跨语言预训练模型),待续。。。 参考 ^wa...
aishell-1混合四种飞机舱噪声 3 训练验证测试集 蛋 蛋宝哒哒 1枚 GPL 2 语音识别 8 41 2024-05-07 详情 相关项目 评论(0) 创建项目 文件列表 wav_-15_to_15db_test_4noise.zip avg_10.pdparams wav_-15_to_15db_train_dev_4noise.zip wav_-15_to_15db_test_4noise.zip (929.53M) 下载 File...
AISHELL-4 开源地址1 http://www.openslr.org/111/ AISHELL-4 开源地址2 http://www.aishelltech.com/aishell_4 AISHELL联合西北工业大学、中国科学技术大学、微软合著的论文《AISHELL-4: An Open Source Dataset for Speech Enhancement, Separation, Recognition and Speaker Diarization in Conference Scenario》已...
Dataset 基线系统 Recipe 论文 arxiv License: CC BY NC 4.0 The LRDWWS Challenge is designed to tackle the wake-up word spotting task for individuals with dysarthria, with the ultimate goal of facilitating broader integration in real-world applications. The challenge data uses the MDSC dat...
The AISHELL-5 dataset contains more than 100 hours of speech data, divided into 94 hours of training data(Train), 3.3 hours of validation data (Dev), and two test sets(Eval1 and Eval2), with durations of 3.3 and 3.58 hours. Each dataset includes far-field audio from 4 channels, wit...
AISHELL-NER 是建立在被广泛使用的 AISHELL-1 上的中文语音命名实体识别数据集,沿用 [Apache License v.2.0](https://www.apache.org/licenses/LICENSE-2.0) 发布,旨在推动中文语音命名实体识别技术的发展。 论文: https://arxiv.org/pdf/2202.08533.pdf ...
1 pds用 y yearyyfan 2枚 CC0 语音识别 0 8 2024-11-12 详情 相关项目 评论(0) 创建项目 文件列表 data_aishell.tgz data_aishell.tgz (14861.02M) 下载 File Name Size Update Time data_aishell/wav/S0724.tar.gz 42977549 2017-06-13 02:08:54 data_aishell/wav/S0725.tar.gz 51900074 2017...
WERs of 3.8%/9.1% on Librispeech test clean/other dataset without an external LM, and a CER of 5.8% on Aishell1 Mandarin corpus, respectively\n1\n... R Fan,W Chu,P Chang,... - IEEE International Conference on Acoustics 被引量: 0发表: 2021年 Improving Transformer-Based Speech Recogniti...
Dataset Loaders Edit huggingface/datasets 19,801 Tasks Edit Text-To-Speech Synthesis Similar Datasets JVS JVS AISHELL-1 Usage Number of Papers20222024202120232025024681012AISHELL-3JVSAISHELL-1 License Edit Custom (non-commercial) Modalities Edit Texts Speech Languages Edit Chinese Mandari...