This paper presents a new method for video and image categorization based on a database of over 50,000 videos collected from YouTube and down-sampled to tiny size. The categorization results achieved by tiny videos are compared with the tiny images framework for a variety of recognition tasks....
The objective of this paper is large scale object instance retrieval, given a query image. A starting point of such systems is feature detection and descri... R Arandjelovic,A Zisserman 被引量: 378发表: 2013年 MS-Celeb-1M: A Dataset and Benchmark for Large-Scale Face Recognition In this...
FSIR方法在不同数据集上性能不同,可以用image complexity、intra-concept visual consistency和inter-concept visual similarity来描述不同数据集 不同FSIR方法在不同数据集上性能不同,这与dataset structures和method ability都有关,method ability和dataset structures的characteristics紧密相关 ...
In: 2018 IEEE/CVF Conference on computer vision and pattern recognition, pp 4510–4520, DOI https://doi.org/10.1109/CVPR.2018.00474 Scavaco S, Henriques JT, Mengucci M, Correia N, Medeiros F (2013) Color sonification for the visually impaired. In: Cruz-Cunha MM, Varajão HKJ, Martinho...
Dataset Bias in Few-shot Image Recognition-Shuqiang Jiang 主要研究了在少样本图像识别(Few-shot Image Recognition, FSIR)中,数据集偏差对模型性能的影响,并探讨了不同数据集结构和少样本学习方法之间的性能差异。 (1)研究背景:FSIR的目标是利用从训练数据(基础类别)中学习到的可转移知识来识别新类别,通常只需要...
Artificial intelligence models play a crucial role in monitoring and maintaining railroad infrastructure by analyzing image data of foreign objects on power transmission lines. However, the availability of publicly accessible datasets for railroad foreig
With the popularity of dual cameras in recently released smart phones, a growing number of super-resolution (SR) methods have been proposed to enhance the resolution of stereo image pairs. However, the lack of high-quality stereo datasets has limited the research in this area. To facilitate the...
摘要: We introduce a 120 class Stanford Dogs dataset, a challenging and large-scale dataset aimed at fine-grained image categorization. Stanford Dogs includes over 22,000 annotated images of dogs belonging to 120 species. Each image被引量: 38 ...
Face recognition can also identify the human almost in a precise detection; one of the primary problems in face recognition is the accurate recognition rate. Local datasets use for implementing this research rather than using public datasets. Midian filter uses to remo...
To advance object detection research in Earth Vision, also known as Earth Observation and Remote Sensing, we introduce a large-scale Dataset for Object deTection in Aerial images (DOTA). To this end, we collect $2806$ aerial images from different sensors and platforms. Each image is of the ...