content-based image retrievalimage databasequalitative spatial relationcolorshapeThis paper presents an intelligent method of retrieving images with Chinese captions from an image database. We combine color, shape and spatial features of the image to index and measure the similarity of images. As a ...
This is a python (Flask Application) based Automated Image Caption and Image Retrieval model which makes use of deep learning image caption generator. It uses a merge model comprising of Convolutional Neural Network (CNN) and a Long Short Term Memory Network (LSTM) . The dataset used here is...
Jiang D, Ye M (2023) Cross-modal implicit relation reasoning and aligning for text-to-image person retrieval. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp 2787–2797 Cao M, Bai Y, Zeng Z, Ye M, Zhang M (2023) An empirical study of clip fo...
Johnson J, Krishna R, Stark M, Li L-J, Shamma D, Bernstein M, Fei-Fei L (2015) Image retrieval using scene graphs. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 3668–3678. https://doi.org/10.1109/CVPR.2015.7298990 Gupta N, Jalal A (2020) Int...
Image captioning is a multi-modal transduction task, translating the source image into the target language. Numerous dominant approaches primarily employed the generation-based or the retrieval-based method. These two kinds of frameworks have their advantages and disadvantages. In this work, we make ...
which has been helped by the increase in internet bandwidths and storage spaces. The increase in video data has led to an interest in the understanding of video for different applications such as video retrieval, surveillance, and online advertisements. Video retrieval is a significant task in the...
Model library: Including multi-modal fusion, cross-modal retrieval, image caption, and multi-task algorithms. Trainer: Set up a unified training process and related score calculations for each task. Use Download the toolkit: git clone https://github.com/njustkmg/OMML.git Data construction instru...
including Indicator, Reagent, or Diagnostic Aid; Organic Chemical; Laboratory Procedure; Spatial Concept; Qualitative Concept; and Quantitative Concept.Discussion: The findings suggest that caption-based descriptors can complement title or abstract-based literature indexing for figure image retrieval in ...
image captionImage caption technology aims to convert visual features of images, extracted by computers, into meaningful semantic information. Therefore, the computers can generate text descriptions that resemble human perception, enabling tasks such as image classification, retri...
Faceretrieval,Constrained clusteringThis research is partially funded by the Cognitive-Level Annotation using Latent Statistical Structure(CLASS) project of the European Union Information Society Technologies unit E5 (Cognition).We wouldalso like to thank Tamara Berg,Mark Everingham,and Gary Huang for ...