Xiong. Audio-Visual Event Detection Based on Mining of Semantic Audio-Visual Labels, In Proc. SPIE Conference on Storage and Retrieval for Multimedia Databases, pp. 292-299, 2004.King-Shy Goh, Koji Miyahara, Regunathan Radhakrishan, Ziyou Xiong, Ajay Divakaran , ―Audio- Visual Event Detection...
3D localizationaudio-visual fusionevent detectionscene analysisIn this paper we address the problem of detecting and localizing objects that can be both seen... X Alameda-Pineda,V Khalidov,R Horaud,... - ACM 被引量: 45发表: 2011年 DOI: 10.1145/2070481.2070527 Finding Audio-Visual Events in ...
Chua, "The fusion of audio-visual features and external knowledge for event detection in team sports video," In Proc. of Workshop on Multimedia ... Z Bo,B Feng,X Bo 被引量: 0发表: 2013年 Integrated analysis of audiovisual signals and external information sources for event detection in te...
Unified Multisensory Perception: Weakly-Supervised Audio-Visual Video Parsing 这篇文章提出了Audio Visual Video Parsing,相比于之前的Video Localization任务只需要模型理解多模态共同存在的场景,这个新的任务需要多模态模型对各个单模态都有一定的认知能力,需要分辨出一个复杂的时序场景中,哪些是视频的,哪些是声音的...
关键词: Audio-visual processing Multiple cameras Synchronization Event detection DOI: 10.1007/s11042-014-1872-y 被引量: 19 年份: 2015 收藏 引用 批量引用 报错 分享 全部来源 免费下载 求助全文 全文购买 Springer Springer (全网免费下载) 国家科技图书文献中心 (权威机构) 掌桥科研 Semantic Scholar (...
Event boundary detection using audio-visual features and web-casting texts with imprecise time information We propose a method to detect events and event boundaries in soccer videos by using web-casting texts and audio-visual features. The events and their inacc... M Boyar,zgür Alan,S Akpinar...
Video event detection & summarization using audio, visual & text saliency Detection of perceptually important video events is formulated here on the basis of saliency models for the audio, visual and textual information conveyed ... G Evangelopoulos,A Zlatintsi,G Skoumas,... 被引量: 62发表: ...
The mission of the Audiovisual Communications laboratory is to perform basic and applied research in signal processing for communications. - Audiovisual Communications Laboratory
Audio-Visual Event Localization in Unconstrained Videos Yapeng Tian,Jing Shi,Bochen Li,Zhiyao Duan,and Chenliang Xu University of Rochester,United States In this material,firstly,we show how we gather the Audio-Visual Event(AVE) dataset in Sec.1.Then we describe the implementation details of ...
In this paper, an effective system consisting of four subsystems is proposed to bowling game video indexing by integrating the visual and the auditory information. The lane boundary information is extracted first to assist in all the events detection. For throwing clip event, the auditory temporal...