nlpnewswikitext-classificationword2veccorpusdatasetquestion-answeringchinesechinese-nlplanguage-modelbertchinese-corpuspretrainchinese-dataset UpdatedMay 23, 2024 chaoswork/sft_datasets Star503 Code Issues Pull requests 开源SFT数据集整理,随时补充 datasetschinese-datasetlarge-language-modelsllmssupervised-finetunin...
Datasetfilenotes alpaca-chinesealpaca-chinese-52k.json包含了52k英文和中文的数据全集 alpaca-chinese./data/alpaca_chinese_part*.json分拆数据文件 Case1成语:有一些sample,直译后需要进行二次改写,例如成语类的 { "en_instruction": "What is the meaning of the following idiom?", "instruction": "以下成语...
Chinese_dataset5w.zip (144.34M) 下载 File Name Size Update Time Chinese_dataset5w_rec_test_win.txt 178792 2023-03-28 21:29:10 Chinese_dataset5w_rec_test.txt 168792 2023-03-28 21:29:10 Chinese_dataset5w.txt 4203619 2023-03-28 21:29:10 Chinese_dataset5w/img_0000001.jpg 3531 2023-03-...
In this paper, a deep-learning based face editing approach, StyleGAN, is used to synthesize a Chinese face dataset, namely SZU-EmoDage, where faces with different expressions and ages are synthesized. Leverage on the interpolations of latent vectors, continuously dynamic expressions with different ...
Meanwhile, DCU (Deep Computing Unit), a new Chinese domestic accelerator with high acceleration performance, exhibits tremendous adaptability in transplanting the work of TADOC. Therefore, this paper proposes D-TADOC, a compressed data direct computing technology for Chinese dataset on DCU, which can...
alpaca_chinese_datasetJe**ff 上传16.89 MB 文件格式 zip alpaca chatglm dataset 人工精调的中文对话数据集和一段chatglm的微调代码 点赞(0) 踩踩(0) 反馈 所需:1 积分 电信网络下载 zzyl-end-nursing 2025-03-22 03:46:41 积分:1 zzyl01 2025-03-22 03:45:54 积分:1 ...
数据集链接(CHEF Dataset),论文链接,欢迎大家使用CHEF! 1. 介绍 先来看看任务的定义,举一个相对比较简单的例子: 比如上海封控期间,某自媒体就声称“李立群偷下楼买肉被抓”。单凭这个声明(Claim)本身,我们其实没法判断他有没有偷偷下楼买肉然后被抓。为了验证这个声明的真实性,最直观的思路就是要寻找证据(...
Chinese_book_dataset故事**已淡 上传 data-mining dataset informatics library-management machine-learning natural-language-processing text-classification 中文图书数据集是自然语言处理领域的宝贵资源,它涵盖了大量的中文图书信息。这些数据经过精心整理和分类,包括了图书的基本信息、作者信息、出版日期、出版社等。通过...
Chinese. The brain’s processing mechanisms differ for various languages. For example, the brain exhibits specificity in response to Chinese compared to English25. Therefore, it is important to create an EEG dataset based on other language stimuli. Chinese, being distinct from English in both ...
一个不错的中文数据集 :Emotional first aid dataset, github.com/chatopera/ef,对话形式,但内容有些杂。 从心理健康领域的角度来看,之前的大多数工作都集中在单一领域,如抑郁症、自杀意念和饮食失调(Harrigian等人,2020年)。相反,PsyQA包含各种一般的心理健康障碍,涉及9个由求助者标记的主题,包括自我成长、情感、...