论文题目:Seeing Beyond the Patch: Scale-Adaptive Semantic Segmentation of High-resolution Remote Sensing Imagery based on Reinforcement Learning论文链接:openaccess.thecvf.com/c发表时间:2023.9.27 摘要 在遥感图像分析中,基于补丁的方法在捕捉滑动窗口之外的信息方面存在限制。为了解决这一问题,本文提出了一...
Scene image generation can be run with python scripts/make_scene_samples.py --outdir=/some/outdir -r /path/to/pretrained/model --resolution=512,512 Training on custom data Training on your own dataset can be beneficial to get better tokens and hence better images for your domain. Those ar...
Realistic image synthesis based on deep learning is an invaluable technique for developing high-performance computer aided diagnosis systems while protecting patient privacy. However, training a generative adversarial network (GAN) for image synthesis remains challenging because of the large amounts of data...
OpenImages Super-resolution LDM-VQ-4 N/A N/A N/A N/A https://ommer-lab.com/files/latent-diffusion/sr_bsr.zip BSR image degradation OpenImages Layout-to-Image Synthesis LDM-VQ-4 (200 DDIM steps, eta=0) 32.02 15.92 N/A N/A https://ommer-lab.com/files/latent-diffusion/layout2img...
基于Paddle复现《Restormer: Efficient Transformer for High-Resolution Image Restoration》 1.简介 由于CNN在从大规模数据中学习广义图像先验知识方面表现良好,这些模型已被广泛应用于图像恢复等相关任务。最近,另一类神经结构Transformers在自然语言和High-Level视觉任务上显示出显著的性能提升。虽然Transformer模型缓解了CNN...
Managing imagery using a mosaic dataset configured for a specific type of high-resolution satellite imagery makes it straightforward to visualize, query, and analyze data. The mosaic dataset is the recommended data model for managing, accessing, processing, and visualizing imagery in ArcGIS. With a ...
Here, we report on the development of dark-field X-ray ptychography, which combines X-ray ptychography and X-ray in-line holography, to observe weak-phase objects with a phase resolution better than 0.01 rad, a spatial resolution better than 15 nm, and a field of view larger than 5 ...
1 JiLin-1 image dataset 数据地址:https://pan.baidu.com/s/1_yFbJ6nX1ovOK0_9BZ5Lrg?pwd=1234 提取码:1234 2 Whole Pipeline Hybrid Task Cascade(HTC) backbone:ResNeXt-101-64x4d and Deformable ConvNets v2 (DCN) weight initialization: model pretrained for 20 epochs on the COCO dataset ...
HLVID DataSet 记录人数:200人,50656张image,平均长度为126帧. 相机:2个,Camera A:1920*1080,Camera B:640*480. 行人帧的规格:高分辨率帧(HR):44*120 到 173*258,平均 105*203;低分辨率帧(LR):8*19 到 19*31,平均 11*21. 高分辨率帧的数量约为低分辨率数量的91倍. ...
HRSID: high resolution sar images dataset for ship detection, semantic segmentation, and instance segmentation tasks. - zhangfx123/HRSID