- 方法:HiTVideo,层次化标记器,3D因果VAE- 效果:压缩率提升,重建质量,文本指导 多目标跟踪 (Multi-object Tracking) 状态英文标题中文标题作者PDF链接代码/贡献 发布 Cognitive Disentanglement for Referring Multi-Object Tracking 认知解耦用于多对象跟踪的指称 Shaofeng Liang, Runwei Guan, Wangwang Lian, Daizong...
Video object tracking based on Segment-Anything-2:Installation Guide Object proposal generation based on UPN or open_vision model:Installation Guide Interactive visual-text prompting for generic vision tasks:Installation Guide Important updates 🚀 feat(model_zoo): Add YOLOv8-SAM2.1 instance segmentation...
- 效果:效率提升,分辨率优化,CNN与ViT比较 发布 An interpretable approach to automating the assessment of biofouling in video footage 一种用于自动评估视频素材中生物污损的可解释方法 Evelyn J. Mannix, Bartholomew A. Woodham arxiv.org/pdf/2503.1287 - 问题:生物污损评估,自动化,计算机视觉- 方法:ComFe,...
"Video Object Segmentation via SAM 2: The 4th Solution for LSVOS Challenge VOS Track." ArXiv (2024). [paper] [2024.08] 💥Surgical SAM 2:Haofeng Liu, Erli Zhang, Junde Wu, Mingxuan Hong, Yueming Jin. "Surgical SAM 2: Real-time Segment Anything in Surgical Video by Efficient Frame Pr...
- 问题:ViT架构,效率低,冗余层- 方法:相似性引导,层自适应,SGLATrack- 效果:实时速度,精度保持 发布 GroMo: Plant Growth Modeling with Multiview Images GroMo:基于多视角图像的植物生长建模 Ruchi Bhatt, Shreya Bansal, Amanpreet Chander, Rupinder Kaur, Malya Singh, Mohan Kankanhalli, Abdulmotaleb El...
2024-11-01 Event-guided Low-light Video Semantic Segmentation Zhen Yao et.al. 2411.00639 null 2024-11-01 Cross-modal semantic segmentation for indoor environmental perception using single-chip millimeter-wave radar raw data Hairuo Hu et.al. 2411.00499 null 2024-11-01 Cityscape-Adverse: Benchmarkin...
- 方法:DE-ViT模型,Few-Shot学习,数据集- 效果:性能提升,域偏移 发布 Superpowering Open-Vocabulary Object Detectors for X-ray Vision 超级赋能开放词汇X射线视觉目标检测器 Pablo Garcia-Fernandez, Lorenzo Vaquero, Mingxuan Liu, Feng Xue, Daniel Cores, Nicu Sebe, Manuel Mucientes, Elisa Ricci arxiv....
Multispectral Video Semantic Segmentation: A Benchmark Dataset and Baseline Towards Artistic Image Aesthetics Assessment: A Large-Scale Dataset and a New Method ScaleDet: A Scalable Multi-Dataset Object Detector JRDB-Pose: A Large-Scale Dataset for Multi-Person Pose Estimation and Tracking🌻dataset ...
- 方法:轻量级CNN,状态空间模型,BSTM模块,EgoEvGesture数据集- 效果:高精度,低参数,泛化能力强 多目标跟踪 (Multi-object Tracking) 状态英文标题中文标题作者PDF链接代码/贡献 更新 TAPNext: Tracking Any Point (TAP) as Next Token Prediction TAPNext:作为下一个标记预测的任意点(TAP)跟踪 Artem Zholus, ...
目标检测识别 (Object Detection & Recognition) 其他 状态英文标题中文标题作者PDF链接代码/贡献 更新 AdaCM^2: On Understanding Extremely Long-Term Video with Adaptive Cross-Modality Memory Reduction AdaCM^2:关于理解极长期视频的自适应跨模态记忆缩减 Yuanbin Man, Ying Huang, Chengming Zhang, Bingzhe Li...