Fine-grained Audible Video Description ⋆Xuyang Shen2, ⋆Dong Li1, ⋆Jinxing Zhou3, Zhen Yuchao Dai4, Lingpeng Kong5, Meng Qin2, Bowen He2, Wang3, Yu Qiao1, XiYaoirdaonnZghHoanng21, Aixuan Li4, 1Shanghai Artificial Intelligence Laboratory, 2OpenNLPLab, 3Hefei University of ...
Fine-Grained Audible Video Description Xuyang Shen, Dong Li, Jinxing Zhou, Zhen Qin, Bowen He, Xiaodong Han, Aixuan Li, Yuchao Dai, Lingpeng Kong, Meng Wang, Yu Qiao, Yiran Zhong; Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2023, pp. 10585-...