RoBERTa is a transformers model pretrained on a large corpus in a self-supervised fashion. This means it was pretrained on the raw texts only, with no humans labelling them in any way (which is why it can use lots of publicly available data) with an automatic process to generate inputs ...
xlm-roberta-large / model.safetensors model.safetensors 135 Bytes 一键复制 编辑 原始数据 按行查看 历史 Sylvain Gugger 提交于 2年前 . Adding safetensors variant of this model (#13) 文件存储在 Git LFS,不支持在线预览。 查看操作指南 下载(2.09 GB) Loading... 跳转 举报 举报成功...
xlm-roberta-large-ner-hrl / sentencepiece.bpe.model sentencepiece.bpe.model 132 Bytes 一键复制 编辑 原始数据 按行查看 历史 Davlanade 提交于 3年前 . add xlm-large ner hrl 文件存储在 Git LFS,不支持在线预览。 查看操作指南 下载(4.83 MB) Loading... 跳转 举报 举报成功 我们将于...
The appearance of complex attention‐based language models such as BERT, RoBERTa or GPT‐3 has allowed to address highly complex tasks in a plethora of sce... J Huertas‐Tato,A Martín,D Camacho - 《Expert Systems》 被引量: 0发表: 2023年 Multi-SimLex: A Large-Scale Evaluation of Multilin...
改进,首先借鉴RoBERTa(ARobustlyOptimizedBERT)[54]的思想在更大规模多语 言平行语料库(在100种语言上使用过滤后的超过2.5TBCommonCrawl大型数 据集上)使用掩码语言模型(MaskedLanguageModeling,MLM)进行自监督预 [55] 训练;另外通过基于一元文法语言模型子词切分方法对跨语言文本进行分词, ...
1 https://gitee.com/hf-models/xlm-roberta-large.git git@gitee.com:hf-models/xlm-roberta-large.git hf-models xlm-roberta-large xlm-roberta-large深圳市奥思网络科技有限公司版权所有 Git 大全 Git 命令学习 CopyCat 代码克隆检测 APP与插件下载 Gitee Reward Gitee 封面人物 GVP 项目 Gitee 博客...
Parent Model:XLM-RoBERTa-large Resources for more information:-GitHub Repo-Associated Paper Uses Direct Use The model is a language model. The model can be used for token classification, a natural language understanding task in which a label is assigned to some tokens in a text. ...
xlm-roberta-large chevron_right config.json703 B insert_drive_file pytorch_model.bin2.24 GB insert_drive_file sentencepiece.bpe.model5.07 MB special_tokens_map.json239 B tokenizer_config.json399 B tokenizer.json17.08 MB Outputmore_vert arrow_drop_down folder xlm-roberta-large config.json insert_...
more_vert Data CardCode (2)Discussion (0)Suggestions (0) About Dataset No description available Usability info License Unknown Tags Pre-Trained ModelArts and Entertainment xlm-roberta-large-bn.pt(2.31 GB) get_app fullscreen chevron_right
Something went wrong and this page crashed! If the issue persists, it's likely a problem on our side. Unexpected token '<', "<!doctype "... is not valid JSON SyntaxError: Unexpected token '<', "<!doctype "... is not valid JSON...