Streri, Arlette and Edouard Gentaz, "Cross-modal recognition of shape from hand to eyes in human newborns", Somatosensory & Motor Research, 2003; 20 (1): pp. 11-16Streri, A. & Gentaz, E. (2003) Cross-modal recognition of shape from hand to eyes in human newborns. Somatosensory and...
(ECCV'22) Expanding Language-Image Pretrained Models for General Video Recognition (AAAI'23) Revisiting Classifier: Transferring Vision-Language Models for Video Recognition (CVPR'23) Bidirectional Cross-Modal Knowledge Exploration for Video Recognition with Pre-trained Vision-Language Models (简称BIKE) 3...
We describe a patient (CD), with a right fronto-temporal degeneration, who showed massive defects in the recognition of familiar people and severe behavioural disorders. CD scored in the normal range on tests of episodic memory, attention and visual–spatial abilities, and obtained mildly abnormal...
本篇文章提出的方法针对的问题是音视频的语音识别,以及多模态的合成和转换,也即标题里的manipulation。相对于传统的方法,本文的特点是提出了一个统一的多模态多任务模型,经过训练后,可以同时完成多个模态任务。对多模态的表征在训练中,按照模态分离成了模态相关的话者表征,然后对语言学内容,即文本信息,采用了一个多...
Our procedures were based on previous studies of the cross-modal recognition of human individuals by horses23. The trials were conducted in the stables by two experimenters. Experimenter 1 (E1) was hidden from the subject horses behind the screen, and he/she manipulated the laptop (SVD13238EJB...
Currently, multimodal metaphor recognition is in an active exploration stage as an emerging direction in natural language processing. Although preliminary models [6,7] have emerged, these models still have some roughness. First, there is a lack of effective fusion algorithms to integrate multi-source...
Recently, it has become a popular strategy in multi-label image recognition to predict those labels that co-occur in a picture. Previous work has concentrated on capturing label correlation but has neglected to correctly fuse picture features and label embeddings, which has a substantial influence ...
CROSS-MODAL KNOWLEDGE DISTILLATION FOR ACTION RECOGNITION(2019 IEEE International Conference on Image Processing (ICIP)) 1.介绍 问题:完全监督训练网络有效,虽然数据集的收集不是问题,但是数据集的标注(定label)是一个比较繁琐的事情。 本文就利用知识蒸馏方法来解决在没有类标签的情况下,如何利用已经训练好的一...
Cross-modal Hallucination for Few-shot Fine-grained Recognition 最先进的深度学习算法需要大量的数据用于模型训练,缺乏会导致性能恶化,尤其是在不同类别之间具有细粒度的边界的时候。 Introduction 方法背后的直觉是生成额外训练的样本,这些样本适用于文本描述,有助于在低数据场景中学习分类模型。
In addition, cross-modal matching was only possible under the same conditions that yielded robust word recognition performance. The results are consistent with the hypothesis that acoustic and optical displays of speech simultaneously carry articulatory information about both the underlying linguistic message...