23.Text Guided Person Image Synthesis 文本控制人像的image-to-image生成。用VQA Perceptual Score评估。(效果看起来不是很好) 24.Text-Guided Neural Image Inpainting 文本控制图像修复。有代码。 25.TivGAN Text to Image to VIdeo Generation with Step by Step Evolutionary Generator 文本生成图片再生成视频。分...
6.Describe What to Change: A Text-guided Unsupervised Image-to-Image Translation Approach 使用文本控制image-to-image特定部分的改变,比如“把头发的颜色变成红色”。 7. Development of a New Image-to-text Conversion System for Pashto Farsi and Traditional Chinese 这是ocr,已删。 8.DF-GAN: Deep Fus...
Image variations: Inversion 能重建更精细的特征: Text-guided synthesis: 将原始主题推广到新场景,psedu-word 封装了语义概念 Style transfer: 利用 prompt “A painting in the style ofS_*” 优化 psudo-word, 抽取风格并将其应用于任意场景: Concept compositions: 任意概念的组合。虽然可以实现简单的风格包含...
^TFLCGhttps://silent-chen.github.io/layout-guidance/ ^Guided Image Synthesis via Initial Image Editing in Diffusion Modelhttps://dl.acm.org/doi/abs/10.1145/3581783.3612191
24.Text-Guided Neural Image Inpainting 文本控制图像修复。有代码。 25.TivGAN Text to Image to VIdeo Generation with Step by Step Evolutionary Generator 文本生成图片再生成视频。分两步训练,先根据文本生成高质量的单帧图片,再生成连续帧。 26.Text-to-Image Synthesis Based on Machine Generated Captions ...
Text-Guided Synthesis of Artistic Images with Retrieval-Augmented Diffusion Models2022-11-18 最近,扩散模型改进了生成图像生成,从而在各种任务中获得了出色的视觉质量。随着强大的多模态模型(如CLIP)的出现,“AI-Art”领域领域获得了前所未有的增长。通过将语音合成模型与图像合成模型相结合,建立了所谓的“提示工程...
Text-Guided Synthesis of Artistic Images with Retrieval-Augmented Diffusion Models2022-11-18 最近,扩散模型改进了生成图像生成,从而在各种任务中获得了出色的视觉质量。随着强大的多模态模型(如CLIP)的出现,“AI-Art”领域领域获得了前所未有的增长。通过将语音合成模型与图像合成模型相结合,建立了所谓的“提示工程...
In this paper, we propose a layer-collaborative diffusion model, named LayerDiff , specifically designed for text-guided, multi-layered, composable image synthesis. The composable image consists of a background layer, a set of foreground layers, and associated mask layers for each foreground ...
MediSyn: Text-Guided Diffusion Models for Broad Medical 2D and 3D Image Synthesis 来自 arXiv.org 喜欢 0 阅读量: 14 作者:J Cho,C Zakka,D Kaur,R Shad,R Wightman,A Chaudhari,W Hiesinger 摘要: Diffusion models have recently gained significant traction due to their ability to generate high-...
SceneTextGen: Layout-Agnostic Scene Text Image Synthesis with Integrated Character-Level Diffusion and Contextual Consistency CONFORM: Contrast is All You Need for High-Fidelity Text-to-Image Diffusion Models Contrastive Denoising Score for Text-guided Latent Diffusion Image Editing Residual Learning in Di...