6639 LAF-Net: Locally Adaptive Fusion Networks for Stereo Confidence Estimation Sunok Kim (Yonsei University); Seungryong Kim (Yonsei University); Dongbo Min (Ewha Womans University); Kwanghoon Sohn (Yonsei Univ.)* 3D from Multiview and Sensors Others Oral 1.1.2 3 23 3522 NM-Net: Mining...
D3VO: Deep Depth, Deep Pose and Deep Uncertainty for Monocular Visual Odometry 论文地址:https://arxiv.org/abs/2003.01060 Multi-Modal Domain Adaptation for Fine-Grained Action Recognition 论文地址:https://arxiv.org/abs/2001.09691 Distribution Aware Coordinate Representation for Human Pose Estimation 论...
LHRS-Bot-Nova features an enhanced vision encoder and a novel bridge layer, enabling efficient visual compression and better language-vision alignment. To further enhance RS-oriented vision-language alignment, we propose a large-scale RS image-caption dataset, generated through feature-guided image ...
Multi-Modal and Multi-Temporal Data Fusion: Outcome of the 2012 GRSS Data Fusion Contest The 2012 Data Fusion Contest organized by the Data Fusion Technical Committee (DFTC) of the IEEE Geo-science and Remote Sensing Society (GRSS) aimed at inv... C Berger,M Voltersen,R Eckardt,... - ...
In this work, research on remote sensing imagery intelligent interpretation combined with multimodal data and multitask learning is reviewed regarding basic concepts, research methods, and application scenarios. Moreover, a separate-domain extraction- and cross-domain fusion-based foundation model is ...
Keywords multimodal, remote sensing, image interpretation, feature fusion, co-learning Citation Sun X, Tian Y, Lu W X, et al. From single- to multi-modal remote sensing imagery interpretation: a survey and taxonomy. Sci China Inf Sci, 2023, 66(4): 140301, https://doi.org/10.1007/s1...
(DKS) module to initially focus on point cloud data on language-related objects. Furthermore, we design a target-oriented progressive relation mining (TPM) module to finely focus on target objects through multi-layer intra-modal relation modeling and inter-modal target mining. 3D-SPS avoids the...
Characterization of complex fluvio–deltaic deposits in Northeast China using multi-modal machine learning fusion Article Open access 07 August 2020 Machine learning and AVO class II workflow for hydrocarbon prospectivity in the Messinian offshore Nile Delta Egypt Article Open access 28 January 2025...
[1] Deep RGB-D Saliency Detection with Depth-Sensitive Attention and Automatic Multi-Modal Fusion(具有深度敏感注意力和自动多模态融合的深度RGB-D显著性检测) paper 图像异常检测(Anomally Detection in Image) [7] Anomaly Detection in Video via Self-Supervised and Multi-Task Learning(通过自我监督和多任...
## 多模态学习(Multi-Modal Learning) [5]Understanding and Constructing Latent Modality Structures in Multi-modal Representation Learning [paper](https://arxiv.org/abs/2303.05952) [4]Multimodal Prompting with Missing Modalities for Visual Recognition [paper](https://arxiv.org/abs/2303.03369...