This is the official implementation of the paper "SAM-DiffSR: Structure-Modulated Diffusion Model for Image Super-Resolution". Abstract Conventional diffusion models perform noise sampling from a single distribution, constraining their ability to handle real-world scenes and complex textures across semanti...
ResShift: Efficient Diffusion Model for Image Super-resolution by Residual Shifting (NeurIPS@2023 Spotlight, TPAMI@2024) - zsyOAOA/ResShift
logs = run(model["model"], lq_path, "superresolution", custom_steps) # lq.path 是低分辨率图片路径, custom steps 是ddim采样步骤,run在notebook_helpers.py中 sample = logs["sample"].detach().cpu() sample = torch.clamp(sample, -1., 1.) sample = (sample + 1.) / 2. * 255 sample...
论文链接:ResShift: Efficient Diffusion Model for Image Super-resolution by Residual Shifting 代码链接:github.com/zsyOAOA/ResS 论文概述: 基于扩散的图像超分辨率(SR)方法主要受限于低推理速度,这是由于需要数百甚至数千个采样步骤。现有的加速采样技术不可避免地在一定程度上牺牲了性能,导致过于模糊的SR结果。为...
Diffusion modelVariance attentionAerial imagerySuper-resolution relative fidelity indexImage super-resolution (SR) can significantly improve the resolution and quality of aerial imagery. Emerging diffusion models (DM) have shown superior image generation capabilities through multistep refinement. To explore ...
In image inpainting (Saharia et al., 2021a) and super resolution (Saharia et al., 2021b), the recently proposed and increasingly popular diffusion models (Sohl-Dickstein et al., 2015; Song and Ermon, 2019; Ho et al., 2020) have been shown to achieve state-of-the-art and competitive...
OpenImages Super-resolution LDM-VQ-4 N/A N/A N/A N/A https://ommer-lab.com/files/latent-diffusion/sr_bsr.zip BSR image degradation OpenImages Layout-to-Image Synthesis LDM-VQ-4 (200 DDIM steps, eta=0) 32.02 15.92 N/A N/A https://ommer-lab.com/files/latent-diffusion/layout2img...
1、HSR-Diff: Hyperspectral Image Super-Resolution via Conditional Diffusion Models 尽管高光谱图像(hyperspectral image,HSIs)在执行各种计算机视觉任务中的重要性已被证明,但由于在空间域中具有低分辨率(LR)属性,其潜力受到不利影响,这是由多种物理因素引起的。
https://emotion-diffusion.github.io/ 28、CosmicMan: A Text-to-Image Foundation Model for Humans 提出CosmicMan,一种用于生成高保真人体图像的文本到图像基础模型。与当前困在人体图像质量和文本-图像不对齐困境中的通用基础模型不同,CosmicMan能够生成具有细致外貌、合理结构和精确文本-图像对齐的逼真人体图像,...
OpenImagesSuper-resolutionLDM-VQ-4N/AN/AN/AN/Ahttps://ommer-lab.com/files/latent-diffusion/sr_bsr.zipBSR image degradation OpenImagesLayout-to-Image SynthesisLDM-VQ-4 (200 DDIM steps, eta=0)32.0215.92N/AN/Ahttps://ommer-lab.com/files/latent-diffusion/layout2img_model.zip ...