[2] X. Wang, J. Wu, J. Chen, L. Li, Y. Wang, and W. Y. Wang. Vatex: A large-scale, high-quality multilingual dataset for video-and-language research. In ICCV, 2019. [3] H. Xu, Q. Ye, M. Yan, Y. Shi, J. Ye, Y. Xu, C. Li, B. Bi, Q. Qian, W. Wang, G....
and V. Lal. Improving video retrieval using multilingual knowledge transfer. In European Conference on Information Retrieval, 2022. [2] X. Wang, J. Wu, J. Chen, L. Li, Y. Wang, and W. Y. Wang. Vatex: A large-scale, high-quality multilingual dataset for video-and-language research. ...
[2] X. Wang, J. Wu, J. Chen, L. Li, Y. Wang, and W. Y. Wang. Vatex: A large-scale, high-quality multilingual dataset for video-and-language research. In ICCV, 2019. [3] H. Xu, Q. Ye, M. Yan, Y...
Youku-mPLUG: Chinese Large-scale Video-Text Dataset (Youku-mPLUG中文视频文本大规模数据集) MiraMo 2023-06-21 17:22:22 281 0 发布于上海 举报飞天免费试用计划 领取免费云资源,开启云上实践第一步 NLP 自学习平台 3个模型定制额度 1个月 额度1个月内有效 立即试用 NLP自然语言处理_基础版 每接口每天5...
该数据集提供了三个不同的多模态视频基准任务,用于评估预训练模型的能力,包括视频分类预测、视频文本检索和视频字幕生成】'Youku-mPLUG 10M Chinese Large-Scale Video Text Dataset - Youku-mPLUG: A 10 Million Large-scale Chinese Video-Language Pre-training Dataset and Benchmarks' X-PLUG GitHub: github....
Youku-mPLUG: A 10 Million Large-scale Chinese Video-Language Pre-training Dataset and Benchmarks - X-PLUG/Youku-mPLUG
youku-mplug是一个基于Youku视频网站的多模态视频理解数据集,包含大量的视频和与之对应的文本描述、音频...
To promote the development of Vision-Language Pre-training (VLP) and multimodal Large Language Model (LLM) in the Chinese community, we firstly release the largest public Chinese high-quality video-language dataset named Youku-mPLUG, which is collected from Youku, a well-known Chinese video-shari...
- webdataset==0.2.30 - werkzeug==2.2.2 - wheel==0.37.1 - xxhash==3.2.0 - yacs==0.1.8 - yapf==0.33.0 - yarg==0.1.9 - yarl==1.9.2 - zhconv==1.4.3 196 changes: 196 additions & 0 deletions 196 initialize.py Original file line numberDiff line numberDiff line change @@ -0...
[2] X. Wang, J. Wu, J. Chen, L. Li, Y. Wang, and W. Y. Wang. Vatex: A large-scale, high-quality multilingual dataset for video-and-language research. In ICCV, 2019. [3] H. Xu, Q. Ye, M. Yan, Y. Shi, J. Ye, Y. Xu, C. Li, B. Bi, Q. Qian, W. Wang, G....