11.14.3.4.1 Diffusion for Description to or Text to Image Generation

Chapter Contents (Back)
Diffusion Models. Synthesis. Image Synthesis. Text to Image.
See also Diffusion Process, Diffusion Operators, Mechanism, or Technique.
See also Adversarial Networks for Image Synthesis, Image Generation.

Sun, G.[Gan], Liang, W.Q.[Wen-Qi], Dong, J.H.[Jia-Hua], Li, J.[Jun], Ding, Z.M.[Zheng-Ming], Cong, Y.[Yang],
Create Your World: Lifelong Text-to-Image Diffusion,
PAMI(46), No. 9, September 2024, pp. 6454-6470.
IEEE DOI 2408
Task analysis, Dogs, Computational modeling, Semantics, Training, Neural networks, Continual learning, image generation, stable diffusion BibRef

Chen, H.[Hong], Zhang, Y.P.[Yi-Peng], Wang, X.[Xin], Duan, X.G.[Xu-Guang], Zhou, Y.W.[Yu-Wei], Zhu, W.W.[Wen-Wu],
DisenDreamer: Subject-Driven Text-to-Image Generation With Sample-Aware Disentangled Tuning,
CirSysVideo(34), No. 8, August 2024, pp. 6860-6873.
IEEE DOI 2408
Noise reduction, Visualization, Tuning, Controllability, Circuits and systems, Image synthesis, Training, Diffusion model, disentangled finetuning BibRef

Verma, A.[Ayushi], Badal, T.[Tapas], Bansal, A.[Abhay],
Advancing Image Generation with Denoising Diffusion Probabilistic Model and ConvNeXt-V2: A novel approach for enhanced diversity and quality,
CVIU(247), 2024, pp. 104077.
Elsevier DOI 2408
Deep learning, Diffusion model, Generative model, Image generation BibRef

Xu, Y.F.[Yi-Fei], Xu, X.L.[Xiao-Long], Gao, H.H.[Hong-Hao], Xiao, F.[Fu],
SGDM: An Adaptive Style-Guided Diffusion Model for Personalized Text to Image Generation,
MultMed(26), 2024, pp. 9804-9813.
IEEE DOI 2410
Feature extraction, Adaptation models, Image synthesis, Computational modeling, Training, Task analysis, Noise reduction, image style similarity assessment BibRef

Ramasinghe, S.[Sameera], Shevchenko, V.[Violetta], Avraham, G.[Gil], Thalaiyasingam, A.[Ajanthan],
Accept the Modality Gap: An Exploration in the Hyperbolic Space,
CVPR24(27253-27262)
IEEE DOI 2410
Text to image, Machine learning, Linear programming, multimodal learning, modality gap BibRef

Luo, Y.M.[Yi-Min], Yang, Q.[Qinyu], Fan, Y.H.[Yu-Heng], Qi, H.K.[Hai-Kun], Xia, M.[Menghan],
Measurement Guidance in Diffusion Models: Insight from Medical Image Synthesis,
PAMI(46), No. 12, December 2024, pp. 7983-7997.
IEEE DOI 2411
Task analysis, Medical diagnostic imaging, Uncertainty, Image synthesis, Training, Reliability, Data models, controllable generation BibRef

Cao, J.H.[Jing-Hao], Liu, S.[Sheng], Yang, X.[Xiong], Li, Y.[Yang], Du, S.[Sidan],
ARES: Text-Driven Automatic Realistic Simulator for Autonomous Traffic,
SPLetters(31), 2024, pp. 3049-3053.
IEEE DOI 2411
Trajectory, Rendering (computer graphics), Training, Diffusion models, Accuracy, Logic, Turning, Predictive models BibRef


Zhao, Y.P.[Ya-Ping], Zhang, P.[Pei], Wang, C.[Chutian], Lam, E.Y.[Edmund Y.],
Controllable Unsupervised Event-Based Video Generation,
ICIP24(2278-2284)
IEEE DOI Code:
WWW Link. 2411
Training, Codes, Image edge detection, Cameras, Diffusion models, neuromorphic imaging, computational imaging BibRef

Qazi, T.[Tayeba], Lall, B.[Brejesh],
Thermal Videodiff (TVD): A Diffusion Architecture for Thermal Video Synthesis,
ICIP24(2438-2444)
IEEE DOI Code:
WWW Link. 2411
Deep learning, Temperature distribution, Costs, Infrared imaging, Thermal sensors, Diffusion models, Synthetic Video Generation, Visible Spectrum Context BibRef

Maung-Maung, A.P.[April-Pyone], Nguyen, H.H.[Huy H.], Kiya, H.[Hitoshi], Echizen, I.[Isao],
Fine-Tuning Text-To-Image Diffusion Models for Class-Wise Spurious Feature Generation,
ICIP24(3910-3916)
IEEE DOI 2411
Text to image, Flowering plants, Diffusion models, Feature extraction, Information filters, Internet, Testing, finetuning BibRef

Hudson, D.A.[Drew A.], Zoran, D.[Daniel], Malinowski, M.[Mateusz], Lampinen, A.K.[Andrew K.], Jaegle, A.[Andrew], McClelland, J.L.[James L.], Matthey, L.[Loic], Hill, F.[Felix], Lerchner, A.[Alexander],
SODA: Bottleneck Diffusion Models for Representation Learning,
CVPR24(23115-23127)
IEEE DOI 2410
Representation learning, Training, Visualization, Image synthesis, Semantics, Noise reduction, Self-supervised learning, classification BibRef

Karras, T.[Tero], Aittala, M.[Miika], Lehtinen, J.[Jaakko], Hellsten, J.[Janne], Aila, T.[Timo], Laine, S.[Samuli],
Analyzing and Improving the Training Dynamics of Diffusion Models,
CVPR24(24174-24184)
IEEE DOI 2410
Training, Systematics, Costs, Image synthesis, Computer architecture, Network architecture BibRef

Gu, Y.M.[Yu-Ming], Xu, H.Y.[Hong-Yi], Xie, Y.[You], Song, G.X.[Guo-Xian], Shi, Y.C.[Yi-Chun], Chang, D.[Di], Yang, J.[Jing], Luo, L.J.[Lin-Jie],
DiffPortrait3D: Controllable Diffusion for Zero-Shot Portrait View Synthesis,
CVPR24(10456-10465)
IEEE DOI 2410
Training, Visualization, Noise reduction, Noise, Cameras, Diffusion models, diffusion model, generative model, single to 3D BibRef

Li, J.[Jing], Wang, Z.[Zigan], Li, J.L.[Jin-Liang],
AdvDenoise: Fast Generation Framework of Universal and Robust Adversarial Patches Using Denoise,
SAIAD24(3481-3490)
IEEE DOI Code:
WWW Link. 2410
Visualization, Computational modeling, Noise reduction, Diffusion models, Transformers, Robustness BibRef

Wang, C.[Changyuan], Wang, Z.W.[Zi-Wei], Xu, X.W.[Xiu-Wei], Tang, Y.S.[Yan-Song], Zhou, J.[Jie], Lu, J.W.[Ji-Wen],
Towards Accurate Post-Training Quantization for Diffusion Models,
CVPR24(16026-16035)
IEEE DOI Code:
WWW Link. 2410
Quantization (signal), Risk minimization, Accuracy, Tensors, Image synthesis, Diffusion models, Minimization, diffusion model, network quantization BibRef

Islam, K.[Khawar], Zaheer, M.Z.[Muhammad Zaigham], Mahmood, A.[Arif], Nandakumar, K.[Karthik],
Diffusemix: Label-Preserving Data Augmentation with Diffusion Models,
CVPR24(27611-27620)
IEEE DOI Code:
WWW Link. 2410
Training, Performance gain, Diffusion models, Data augmentation, Robustness, Image augmentation, Fractals, data augmentation, cutmix BibRef

Miao, Z.C.[Zi-Chen], Wang, J.[Jiang], Wang, Z.[Ze], Yang, Z.Y.[Zheng-Yuan], Wang, L.J.[Li-Juan], Qiu, Q.[Qiang], Liu, Z.C.[Zi-Cheng],
Training Diffusion Models Towards Diverse Image Generation with Reinforcement Learning,
CVPR24(10844-10853)
IEEE DOI 2410
Training, Gradient methods, Limiting, Image synthesis, Estimation, Diffusion processes, Reinforcement learning BibRef

Shabani, M.A.[Mohammad Amin], Wang, Z.W.[Zhao-Wen], Liu, D.[Difan], Zhao, N.X.[Nan-Xuan], Yang, J.[Jimei], Furukawa, Y.[Yasutaka],
Visual Layout Composer: Image-Vector Dual Diffusion Model for Design Layout Generation,
CVPR24(9222-9231)
IEEE DOI Code:
WWW Link. 2410
Visualization, Computational modeling, Layout, Diffusion models, Controllability, Vectors BibRef

Qian, Y.R.[Yu-Rui], Cai, Q.[Qi], Pan, Y.W.[Ying-Wei], Li, Y.[Yehao], Yao, T.[Ting], Sun, Q.[Qibin], Mei, T.[Tao],
Boosting Diffusion Models with Moving Average Sampling in Frequency Domain,
CVPR24(8911-8920)
IEEE DOI 2410
Schedules, Image synthesis, Frequency-domain analysis, Noise reduction, Diffusion processes, Diffusion models, image generation BibRef

Yang, K.[Kai], Tao, J.[Jian], Lyu, J.[Jiafei], Ge, C.J.[Chun-Jiang], Chen, J.X.[Jia-Xin], Shen, W.H.[Wei-Han], Zhu, X.L.[Xiao-Long], Li, X.[Xiu],
Using Human Feedback to Fine-tune Diffusion Models without Any Reward Model,
CVPR24(8941-8951)
IEEE DOI Code:
WWW Link. 2410
Training, Analytical models, Image coding, Computational modeling, Noise reduction, Graphics processing units, Diffusion models, Human feedback BibRef

Zhu, R.[Rui], Pan, Y.W.[Ying-Wei], Li, Y.[Yehao], Yao, T.[Ting], Sun, Z.L.[Zheng-Long], Mei, T.[Tao], Chen, C.W.[Chang Wen],
SD-DiT: Unleashing the Power of Self-Supervised Discrimination in Diffusion Transformer*,
CVPR24(8435-8445)
IEEE DOI 2410
Training, Image synthesis, Noise, Diffusion processes, Ordinary differential equations, Transformers, self-supervised learning BibRef

Zhou, Z.Y.[Zhen-Yu], Chen, D.[Defang], Wang, C.[Can], Chen, C.[Chun],
Fast ODE-based Sampling for Diffusion Models in Around 5 Steps,
CVPR24(7777-7786)
IEEE DOI Code:
WWW Link. 2410
Degradation, Image resolution, Image synthesis, Ordinary differential equations, Diffusion models, Fast Sampling BibRef

Lee, H.Y.[Hsin-Ying], Tseng, H.Y.[Hung-Yu], Lee, H.Y.[Hsin-Ying], Yang, M.H.[Ming-Hsuan],
Exploiting Diffusion Prior for Generalizable Dense Prediction,
CVPR24(7861-7871)
IEEE DOI Code:
WWW Link. 2410
Adaptation models, Visualization, Training data, Stochastic processes, Estimation, Diffusion processes, image generation BibRef

Zhang, K.W.[Kai-Wen], Zhou, Y.F.[Yi-Fan], Xu, X.D.[Xu-Dong], Dai, B.[Bo], Pan, X.G.[Xin-Gang],
DiffMorpher: Unleashing the Capability of Diffusion Models for Image Morphing,
CVPR24(7912-7921)
IEEE DOI 2410
Interpolation, Schedules, Image synthesis, Semantics, Image morphing, Noise, Fitting, Diffusion models, Image morphing, video generation BibRef

Li, M.Y.[Mu-Yang], Cai, T.[Tianle], Cao, J.X.[Jia-Xin], Zhang, Q.S.[Qin-Sheng], Cai, H.[Han], Bai, J.J.[Jun-Jie], Jia, Y.Q.[Yang-Qing], Li, K.[Kai], Han, S.[Song],
DistriFusion: Distributed Parallel Inference for High-Resolution Diffusion Models,
CVPR24(7183-7193)
IEEE DOI 2410
Degradation, Computational modeling, Graphics processing units, Diffusion processes, Parallel processing, Diffusion models, generative-ai BibRef

Koley, S.[Subhadeep], Bhunia, A.K.[Ayan Kumar], Sekhri, D.[Deeptanshu], Sain, A.[Aneeshan], Chowdhury, P.N.[Pinaki Nath], Xiang, T.[Tao], Song, Y.Z.[Yi-Zhe],
It's All About Your Sketch: Democratising Sketch Control in Diffusion Models,
CVPR24(7204-7214)
IEEE DOI 2410
Adaptation models, Adaptive systems, Navigation, Generative AI, Image retrieval, Process control, Streaming media BibRef

Wang, Y.[Yibo], Gao, R.[Ruiyuan], Chen, K.[Kai], Zhou, K.Q.[Kai-Qiang], Cai, Y.J.[Ying-Jie], Hong, L.[Lanqing], Li, Z.G.[Zhen-Guo], Jiang, L.H.[Li-Hui], Yeung, D.Y.[Dit-Yan], Xu, Q.[Qiang], Zhang, K.[Kai],
DetDiffusion: Synergizing Generative and Perceptive Models for Enhanced Data Generation and Perception,
CVPR24(7246-7255)
IEEE DOI 2410
Image segmentation, Image recognition, Image synthesis, Training data, Object detection, Diffusion models, Data augmentation BibRef

Zhang, P.Z.[Peng-Ze], Yin, H.[Hubery], Li, C.[Chen], Xie, X.H.[Xiao-Hua],
Tackling the Singularities at the Endpoints of Time Intervals in Diffusion Models,
CVPR24(6945-6954)
IEEE DOI 2410
Training, Brightness, Gaussian distribution, Diffusion models, Diffusion Model, Generative Model, Singularity BibRef

Hong, S.[Seongmin], Lee, K.[Kyeonghyun], Jeon, S.Y.[Suh Yoon], Bae, H.[Hyewon], Chun, S.Y.[Se Young],
On Exact Inversion of DPM-Solvers,
CVPR24(7069-7078)
IEEE DOI 2410
Noise, Noise reduction, Watermarking, Diffusion models, Robustness, Diffusion, Inversion, DPM-Solver BibRef

Yang, J.[Jiayu], Cheng, Z.[Ziang], Duan, Y.F.[Yun-Fei], Ji, P.[Pan], Li, H.D.[Hong-Dong],
ConsistNet: Enforcing 3D Consistency for Multi-View Images Diffusion,
CVPR24(7079-7088)
IEEE DOI Code:
WWW Link. 2410
Solid modeling, Image synthesis, Computational modeling, Graphics processing units, Diffusion models, latent diffusion model BibRef

Fu, B.[Bin], Yu, F.[Fanghua], Liu, A.[Anran], Wang, Z.X.[Zi-Xuan], Wen, J.[Jie], He, J.J.[Jun-Jun], Qiao, Y.[Yu],
Generate Like Experts: Multi-Stage Font Generation by Incorporating Font Transfer Process into Diffusion Models,
CVPR24(6892-6901)
IEEE DOI Code:
WWW Link. 2410
Costs, Noise, Diffusion processes, Transforms, Manuals, Diffusion models, Generative adversarial networks, Probabilistic Generative Model BibRef

Deng, F.[Fei], Wang, Q.F.[Qi-Fei], Wei, W.[Wei], Hou, T.B.[Ting-Bo], Grundmann, M.[Matthias],
PRDP: Proximal Reward Difference Prediction for Large-Scale Reward Finetuning of Diffusion Models,
CVPR24(7423-7433)
IEEE DOI 2410
Training, Technological innovation, Closed box, Reinforcement learning, Diffusion models, RLHF BibRef

Du, R.[Ruoyi], Chang, D.L.[Dong-Liang], Hospedales, T.[Timothy], Song, Y.Z.[Yi-Zhe], Ma, Z.Y.[Zhan-Yu],
DemoFusion: Democratising High-Resolution Image Generation With No $$,
CVPR24(6159-6168)
IEEE DOI 2410
Training, Image resolution, Image synthesis, Generative AI, Semantics, Memory management, Image Generation, Diffusion Model, High-resolution BibRef

Wang, H.J.[Hong-Jie], Liu, D.[Difan], Kang, Y.[Yan], Li, Y.J.[Yi-Jun], Lin, Z.[Zhe], Jha, N.K.[Niraj K.], Liu, Y.C.[Yu-Chen],
Attention-Driven Training-Free Efficiency Enhancement of Diffusion Models,
CVPR24(16080-16089)
IEEE DOI Code:
WWW Link. 2410
Image quality, Schedules, Costs, Convolution, Computational modeling, Noise reduction, diffusion model, training-free, efficiency, attention map BibRef

Chen, H.X.[Hao-Xin], Zhang, Y.[Yong], Cun, X.D.[Xiao-Dong], Xia, M.H.[Meng-Han], Wang, X.[Xintao], Weng, C.[Chao], Shan, Y.[Ying],
VideoCrafter2: Overcoming Data Limitations for High-Quality Video Diffusion Models,
CVPR24(7310-7320)
IEEE DOI 2410
Training, Couplings, Degradation, Analytical models, Noise, Diffusion models BibRef

Kang, J.[Junoh], Choi, J.[Jinyoung], Choi, S.[Sungik], Han, B.H.[Bo-Hyung],
Observation-Guided Diffusion Probabilistic Models,
CVPR24(8323-8331)
IEEE DOI Code:
WWW Link. 2410
Training, Accuracy, Computational modeling, Noise reduction, Quality control, Diffusion models, Robustness, generative models, diffusion models BibRef

Zhou, J.X.[Jin-Xin], Ding, T.Y.[Tian-Yu], Chen, T.Y.[Tian-Yi], Jiang, J.C.[Jia-Chen], Zharkov, I.[Ilya], Zhu, Z.H.[Zhi-Hui], Liang, L.[Luming],
DREAM: Diffusion Rectification and Estimation-Adaptive Models,
CVPR24(8342-8351)
IEEE DOI 2410
Training, Image quality, Navigation, Source coding, Superresolution, Estimation, Distortion BibRef

Chen, C.[Chen], Liu, D.[Daochang], Xu, C.[Chang],
Towards Memorization-Free Diffusion Models,
CVPR24(8425-8434)
IEEE DOI 2410
Image quality, Training, Measurement, Refining, Noise reduction, Training data, Reliability theory, Diffusion Models, Memorization BibRef

Qi, L.[Lu], Yang, L.[Lehan], Guo, W.D.[Wei-Dong], Xu, Y.[Yu], Du, B.[Bo], Jampani, V.[Varun], Yang, M.H.[Ming-Hsuan],
UniGS: Unified Representation for Image Generation and Segmentation,
CVPR24(6305-6315)
IEEE DOI 2410
Training, Image segmentation, Image synthesis, Image color analysis, Pipelines, Training data, Transforms, diffusion BibRef

Wan, Z.Y.[Zi-Yu], Paschalidou, D.[Despoina], Huang, I.[Ian], Liu, H.Y.[Hong-Yu], Shen, B.[Bokui], Xiang, X.Y.[Xiao-Yu], Liao, J.[Jing], Guibas, L.J.[Leonidas J.],
CAD: Photorealistic 3D Generation via Adversarial Distillation,
CVPR24(10194-10207)
IEEE DOI 2410
Training, Solid modeling, Interpolation, Pipelines, Diffusion models, Rendering (computer graphics) BibRef

Wang, L.Z.[Le-Zhong], Frisvad, J.R.[Jeppe Revall], Jensen, M.B.[Mark Bo], Bigdeli, S.A.[Siavash Arjomand],
StereoDiffusion: Training-Free Stereo Image Generation Using Latent Diffusion Models,
GCV24(7416-7425)
IEEE DOI 2410
Image quality, Image synthesis, Extended reality, Pipelines, Noise reduction, Diffusion models, Deep Image/Video Synthesis, Stable Diffusion BibRef

Sharma, N.[Nakul], Tripathi, A.[Aditay], Chakraborty, A.[Anirban], Mishra, A.[Anand],
Sketch-guided Image Inpainting with Partial Discrete Diffusion Process,
NTIRE24(6024-6034)
IEEE DOI Code:
WWW Link. 2410
Visualization, Shape, Semantics, Diffusion processes, Text to image, Transformers BibRef

Shi, F.Y.[Feng-Yuan], Gu, J.X.[Jia-Xi], Xu, H.[Hang], Xu, S.[Songcen], Zhang, W.[Wei], Wang, L.M.[Li-Min],
BIVDiff: A Training-Free Framework for General-Purpose Video Synthesis via Bridging Image and Video Diffusion Models,
CVPR24(7393-7402)
IEEE DOI 2410
Training, Smoothing methods, Image synthesis, Memory management, Text to image, Diffusion models, Video Synthesis, Diffusion models, General Framework BibRef

Guo, J.Y.[Jia-Yi], Xu, X.Q.[Xing-Qian], Pu, Y.F.[Yi-Fan], Ni, Z.[Zanlin], Wang, C.F.[Chao-Fei], Vasu, M.[Manushree], Song, S.[Shiji], Huang, G.[Gao], Shi, H.[Humphrey],
Smooth Diffusion: Crafting Smooth Latent Spaces in Diffusion Models,
CVPR24(7548-7558)
IEEE DOI Code:
WWW Link. 2410
Training, Measurement, Interpolation, Visualization, Fluctuations, Perturbation methods, Text to image BibRef

Lyu, M.Y.[Meng-Yao], Yang, Y.H.[Yu-Hong], Hong, H.[Haiwen], Chen, H.[Hui], Jin, X.[Xuan], He, Y.[Yuan], Xue, H.[Hui], Han, J.G.[Jun-Gong], Ding, G.[Guiguang],
One-dimensional Adapter to Rule Them All: Concepts, Diffusion Models and Erasing Applications,
CVPR24(7559-7568)
IEEE DOI Code:
WWW Link. 2410
Deformable models, Adaptation models, Costs, Deformation, Text to image, Diffusion models, Permeability, Diffusion Models, Concept Erasing BibRef

Yang, L.[Ling], Qian, H.T.[Hao-Tian], Zhang, Z.L.[Zhi-Ling], Liu, J.W.[Jing-Wei], Cui, B.[Bin],
Structure-Guided Adversarial Training of Diffusion Models,
CVPR24(7256-7266)
IEEE DOI 2410
Training, Manifolds, Image synthesis, Noise reduction, Text to image, Diffusion models, Data models, Diffusion models, generative models, Image generation BibRef

Yu, Y.Y.[Yu-Yang], Liu, B.Z.[Bang-Zhen], Zheng, C.X.[Chen-Xi], Xu, X.M.[Xue-Miao], He, S.F.[Sheng-Feng], Zhang, H.D.[Huai-Dong],
Beyond Textual Constraints: Learning Novel Diffusion Conditions with Fewer Examples,
CVPR24(7109-7118)
IEEE DOI Code:
WWW Link. 2410
Training, Adaptation models, Codes, Text to image, Diffusion processes, Diffusion models, diffusion model BibRef

Xing, X.[Ximing], Zhou, H.T.[Hai-Tao], Wang, C.[Chuang], Zhang, J.[Jing], Xu, D.[Dong], Yu, Q.[Qian],
SVGDreamer: Text Guided SVG Generation with Diffusion Model,
CVPR24(4546-4555)
IEEE DOI Code:
WWW Link. 2410
Visualization, Image color analysis, Shape, Text to image, Process control, Diffusion models, vector graphics, SVG, text-to-svg, Diffusion BibRef

Huang, X.[Xin], Shao, R.Z.[Rui-Zhi], Zhang, Q.[Qi], Zhang, H.W.[Hong-Wen], Feng, Y.[Ying], Liu, Y.B.[Ye-Bin], Wang, Q.[Qing],
HumanNorm: Learning Normal Diffusion Model for High-quality and Realistic 3D Human Generation,
CVPR24(4568-4577)
IEEE DOI Code:
WWW Link. 2410
Geometry, Solid modeling, Text to image, Color, Diffusion models, Diffusion Model, 3D Human, 3D Generation BibRef

Parihar, R.[Rishubh], Bhat, A.[Abhijnya], Basu, A.[Abhipsa], Mallick, S.[Saswat], Kundu, J.N.[Jogendra Nath], Babu, R.V.[R. Venkatesh],
Balancing Act: Distribution-Guided Debiasing in Diffusion Models,
CVPR24(6668-6678)
IEEE DOI 2410
Training, Image synthesis, Semantics, Noise reduction, Text to image, Diffusion models, Data augmentation, Debiasing, diffusion models, generative models BibRef

Ren, J.W.[Jia-Wei], Xu, M.M.[Meng-Meng], Wu, J.C.[Jui-Chieh], Liu, Z.W.[Zi-Wei], Xiang, T.[Tao], Toisoul, A.[Antoine],
Move Anything with Layered Scene Diffusion,
CVPR24(6380-6389)
IEEE DOI 2410
Codes, Layout, Noise reduction, Memory management, Text to image, Process control BibRef

Lu, Y.Z.[Yan-Zuo], Zhang, M.[Manlin], Ma, A.J.[Andy J.], Xie, X.H.[Xiao-Hua], Lai, J.H.[Jian-Huang],
Coarse-to-Fine Latent Diffusion for Pose-Guided Person Image Synthesis,
CVPR24(6420-6429)
IEEE DOI Code:
WWW Link. 2410
Training, Image synthesis, Semantics, Text to image, Process control, Diffusion models, Generators, Diffusion Model, Person Image Synthesis BibRef

Liu, C.[Chang], Wu, H.[Haoning], Zhong, Y.J.[Yu-Jie], Zhang, X.Y.[Xiao-Yun], Wang, Y.F.[Yan-Feng], Xie, W.[Weidi],
Intelligent Grimm: Open-ended Visual Storytelling via Latent Diffusion Models,
CVPR24(6190-6200)
IEEE DOI Code:
WWW Link. 2410
Visualization, Electronic publishing, Computational modeling, Pipelines, Text to image, Image sequences, Diffusion Models BibRef

Wimbauer, F.[Felix], Wu, B.[Bichen], Schoenfeld, E.[Edgar], Dai, X.L.[Xiao-Liang], Hou, J.[Ji], He, Z.J.[Zi-Jian], Sanakoyeu, A.[Artsiom], Zhang, P.Z.[Pei-Zhao], Tsai, S.[Sam], Kohler, J.[Jonas], Rupprecht, C.[Christian], Cremers, D.[Daniel], Vajda, P.[Peter], Wang, J.L.[Jia-Liang],
Cache Me if You Can: Accelerating Diffusion Models through Block Caching,
CVPR24(6211-6220)
IEEE DOI 2410
Image quality, Visualization, Schedules, Image synthesis, Computational modeling, Noise reduction, Noise, diffusion, fid BibRef

Dalva, Y.[Yusuf], Yanardag, P.[Pinar],
NoiseCLR: A Contrastive Learning Approach for Unsupervised Discovery of Interpretable Directions in Diffusion Models,
CVPR24(24209-24218)
IEEE DOI 2410
Image synthesis, Computational modeling, Semantics, Text to image, Contrastive learning, Aerospace electronics, Diffusion models, semantic discovery BibRef

Sun, H.[Haoze], Li, W.B.[Wen-Bo], Liu, J.Z.[Jian-Zhuang], Chen, H.Y.[Hao-Yu], Pei, R.[Renjing], Zou, X.[Xueyi], Yan, Y.[Youliang], Yang, Y.[Yujiu],
CoSeR: Bridging Image and Language for Cognitive Super-Resolution,
CVPR24(25868-25878)
IEEE DOI Code:
WWW Link. 2410
Computational modeling, Superresolution, Semantics, Text to image, Benchmark testing, Diffusion models BibRef

Wang, Z.C.[Zhi-Cai], Wei, L.[Longhui], Wang, T.[Tan], Chen, H.[Heyu], Hao, Y.[Yanbin], Wang, X.[Xiang], He, X.N.[Xiang-Nan], Tian, Q.[Qi],
Enhance Image Classification via Inter-Class Image Mixup with Diffusion Model,
CVPR24(17223-17233)
IEEE DOI Code:
WWW Link. 2410
Training, Computational modeling, Text to image, Data augmentation, Diffusion models, diffusion model, data augmentation BibRef

Hsiao, Y.T.[Yi-Ting], Khodadadeh, S.[Siavash], Duarte, K.[Kevin], Lin, W.A.[Wei-An], Qu, H.[Hui], Kwon, M.[Mingi], Kalarot, R.[Ratheesh],
Plug-and-Play Diffusion Distillation,
CVPR24(13743-13752)
IEEE DOI 2410
Training, Visualization, Image synthesis, Computational modeling, Text to image, Diffusion processes, distillation, model efficiency, diffusion model BibRef

Zhan, C.[Chenlu], Lin, Y.[Yu], Wang, G.[Gaoang], Wang, H.W.[Hong-Wei], Wu, J.[Jian],
MedM2G: Unifying Medical Multi-Modal Generation via Cross-Guided Diffusion with Visual Invariant,
CVPR24(11502-11512)
IEEE DOI 2410
Visualization, Adaptation models, Technological innovation, Magnetic resonance imaging, Text to image, Medical services, Diffusion Model BibRef

Kant, Y.[Yash], Siarohin, A.[Aliaksandr], Wu, Z.[Ziyi], Vasilkovsky, M.[Michael], Qian, G.[Guocheng], Ren, J.[Jian], Guler, R.A.[Riza Alp], Ghanem, B.[Bernard], Tulyakov, S.[Sergey], Gilitschenski, I.[Igor],
SPAD: Spatially Aware Multi-View Diffusers,
CVPR24(10026-10038)
IEEE DOI 2410
Geometry, Text to image, Transforms, Cameras, Diffusion models, Encoding, novel view synthesis, diffusion BibRef

Starodubcev, N.[Nikita], Baranchuk, D.[Dmitry], Fedorov, A.[Artem], Babenko, A.[Artem],
Your Student is Better than Expected: Adaptive Teacher-Student Collaboration for Text-Conditional Diffusion Models,
CVPR24(9275-9285)
IEEE DOI 2410
Adaptation models, Computational modeling, Pipelines, Text to image, Collaboration, Diffusion models, Image and video synthesis and generation BibRef

Mei, K.[Kangfu], Delbracio, M.[Mauricio], Talebi, H.[Hossein], Tu, Z.Z.[Zheng-Zhong], Patel, V.M.[Vishal M.], Milanfar, P.[Peyman],
CoDi: Conditional Diffusion Distillation for Higher-Fidelity and Faster Image Generation,
CVPR24(9048-9058)
IEEE DOI 2410
Image synthesis, Superresolution, Text to image, Computer architecture, Predictive models, Diffusion models BibRef

Ran, L.M.[Ling-Min], Cun, X.D.[Xiao-Dong], Liu, J.W.[Jia-Wei], Zhao, R.[Rui], Zijie, S.[Song], Wang, X.[Xintao], Keppo, J.[Jussi], Shou, M.Z.[Mike Zheng],
X- Adapter: Universal Compatibility of Plugins for Upgraded Diffusion Model,
CVPR24(8775-8784)
IEEE DOI Code:
WWW Link. 2410
Training, Connectors, Adaptation models, Noise reduction, Text to image, Diffusion models, Data models BibRef

Liu, Y.J.[Yu-Jian], Zhang, Y.[Yang], Jaakkola, T.[Tommi], Chang, S.Y.[Shi-Yu],
Correcting Diffusion Generation Through Resampling,
CVPR24(8713-8723)
IEEE DOI Code:
WWW Link. 2410
Image quality, Image synthesis, Filtering, Computational modeling, Text to image, Detectors, image generation, diffusion model, particle filtering BibRef

Luo, G.[Grace], Darrell, T.J.[Trevor J.], Wang, O.[Oliver], Goldman, D.B.[Dan B], Holynski, A.[Aleksander],
Readout Guidance: Learning Control from Diffusion Features,
CVPR24(8217-8227)
IEEE DOI Code:
WWW Link. 2410
Training, Head, Image edge detection, Training data, Text to image, Diffusion models, Image and video synthesis and generation BibRef

Wallace, B.[Bram], Dang, M.[Meihua], Rafailov, R.[Rafael], Zhou, L.Q.[Lin-Qi], Lou, A.[Aaron], Purushwalkam, S.[Senthil], Ermon, S.[Stefano], Xiong, C.M.[Cai-Ming], Joty, S.[Shafiq], Naik, N.[Nikhil],
Diffusion Model Alignment Using Direct Preference Optimization,
CVPR24(8228-8238)
IEEE DOI 2410
Training, Learning systems, Visualization, Pipelines, Text to image, Reinforcement learning, Diffusion models, generative, diffusion, dpo BibRef

Yan, J.N.[Jing Nathan], Gu, J.[Jiatao], Rush, A.M.[Alexander M.],
Diffusion Models Without Attention,
CVPR24(8239-8249)
IEEE DOI 2410
Training, Image resolution, Computational modeling, Noise reduction, Text to image, Computer architecture BibRef

Gokaslan, A.[Aaron], Cooper, A.F.[A. Feder], Collins, J.[Jasmine], Seguin, L.[Landan], Jacobson, A.[Austin], Patel, M.[Mihir], Frankle, J.[Jonathan], Stephenson, C.[Cory], Kuleshov, V.[Volodymyr],
Common Canvas: Open Diffusion Models Trained on Creative-Commons Images,
CVPR24(8250-8260)
IEEE DOI 2410
Training, Computational modeling, Transfer learning, Text to image, Diffusion models, Data models, diffusion, copyright, text2image, dataset BibRef

Habibian, A.[Amirhossein], Ghodrati, A.[Amir], Fathima, N.[Noor], Sautiere, G.[Guillaume], Garrepalli, R.[Risheek], Porikli, F.M.[Fatih M.], Petersen, J.[Jens],
Clockwork Diffusion: Efficient Generation With Model-Step Distillation,
CVPR24(8352-8361)
IEEE DOI Code:
WWW Link. 2410
Training, Adaptation models, Runtime, Noise reduction, Semantics, Layout, Text to image, diffusion, efficient diffusion, distillation BibRef

Wang, J.Y.[Jun-Yan], Sun, Z.H.[Zhen-Hong], Tan, Z.Y.[Zhi-Yu], Chen, X.B.[Xuan-Bai], Chen, W.H.[Wei-Hua], Li, H.[Hao], Zhang, C.[Cheng], Song, Y.[Yang],
Towards Effective Usage of Human-Centric Priors in Diffusion Models for Text-based Human Image Generation,
CVPR24(8446-8455)
IEEE DOI Code:
WWW Link. 2410
Accuracy, Image synthesis, Semantics, Text to image, Diffusion processes, Diffusion models BibRef

Lin, H.[Haonan],
DreamSalon: A Staged Diffusion Framework for Preserving Identity-Context in Editable Face Generation,
CVPR24(8589-8598)
IEEE DOI 2410
Image quality, Face recognition, Semantics, Noise reduction, Noise, Text to image, Stochastic processes, staged diffusion framework BibRef

Li, Z.[Zhen], Cao, M.D.[Ming-Deng], Wang, X.[Xintao], Qi, Z.A.[Zhong-Ang], Cheng, M.M.[Ming-Ming], Shan, Y.[Ying],
PhotoMaker: Customizing Realistic Human Photos via Stacked ID Embedding,
CVPR24(8640-8650)
IEEE DOI 2410
Training, Pipelines, Text to image, Training data, Controllability, diffusion model, personalization, face synthesis BibRef

Feng, Y.T.[Yu-Tong], Gong, B.[Biao], Chen, D.[Di], Shen, Y.J.[Yu-Jun], Liu, Y.[Yu], Zhou, J.[Jingren],
Ranni: Taming Text-to-Image Diffusion for Accurate Instruction Following,
CVPR24(4744-4753)
IEEE DOI 2410
Visualization, Protocols, Semantics, Pipelines, Text to image, Diffusion models, Generators, diffusion model, text-to-image BibRef

Lu, S.L.[Shi-Lin], Wang, Z.[Zilan], Li, L.[Leyang], Liu, Y.Z.[Yan-Zhu], Kong, A.W.K.[Adams Wai-Kin],
MACE: Mass Concept Erasure in Diffusion Models,
CVPR24(6430-6440)
IEEE DOI Code:
WWW Link. 2410
Codes, Text to image, Interference, Diffusion models, Generative AI, AI security, diffusion model, concept editing BibRef

Nam, J.[Jisu], Kim, H.[Heesu], Lee, D.[DongJae], Jin, S.[Siyoon], Kim, S.[Seungryong], Chang, S.[Seunggyu],
DreamMatcher: Appearance Matching Self-Attention for Semantically-Consistent Text-to-Image Personalization,
CVPR24(8100-8110)
IEEE DOI 2410
Visualization, Computational modeling, Semantics, Noise reduction, Text to image, Diffusion models, Diffusion Models, Semantic Correspondence BibRef

Ham, C.[Cusuh], Fisher, M.[Matthew], Hays, J.[James], Kolkin, N.[Nicholas], Liu, Y.C.[Yu-Chen], Zhang, R.[Richard], Hinz, T.[Tobias],
Personalized Residuals for Concept-Driven Text-to-Image Generation,
CVPR24(8186-8195)
IEEE DOI 2410
Training, Measurement, Computational modeling, Text to image, Graphics processing units, Diffusion models, personalization, diffusion models BibRef

Phung, Q.[Quynh], Ge, S.W.[Song-Wei], Huang, J.B.[Jia-Bin],
Grounded Text-to-Image Synthesis with Attention Refocusing,
CVPR24(7932-7942)
IEEE DOI 2410
Visualization, Large language models, Computational modeling, Layout, Text to image, Benchmark testing, Diffusion models, grounded text-to-image BibRef

Nguyen, T.H.[Thuan Hoang], Tran, A.[Anh],
SwiftBrush: One-Step Text-to-Image Diffusion Model with Variational Score Distillation,
CVPR24(7807-7816)
IEEE DOI 2410
Training, Solid modeling, Text to image, Diffusion models, Neural radiance field, Data models BibRef

Cao, C.J.[Chen-Jie], Cai, Y.[Yunuo], Dong, Q.[Qiaole], Wang, Y.K.[Yi-Kai], Fu, Y.W.[Yan-Wei],
LeftRefill: Filling Right Canvas based on Left Reference through Generalized Text-to-Image Diffusion Model,
CVPR24(7705-7715)
IEEE DOI Code:
WWW Link. 2410
Adaptation models, Image synthesis, Text to image, Diffusion models, Filling, Diffusion Model, Image Inpainting BibRef

Mo, S.C.[Si-Cheng], Mu, F.Z.[Fang-Zhou], Lin, K.H.[Kuan Heng], Liu, Y.L.[Yan-Li], Guan, B.[Bochen], Li, Y.[Yin], Zhou, B.[Bolei],
FreeControl: Training-Free Spatial Control of Any Text-to-Image Diffusion Model with Any Condition,
CVPR24(7465-7475)
IEEE DOI Code:
WWW Link. 2410
Visualization, Text to image, Computer architecture, Aerospace electronics, Diffusion models, Feature extraction, Controllable generation BibRef

Huang, M.Q.[Meng-Qi], Mao, Z.D.[Zhen-Dong], Liu, M.C.[Ming-Cong], He, Q.[Qian], Zhang, Y.D.[Yong-Dong],
RealCustom: Narrowing Real Text Word for Real-Time Open-Domain Text-to-Image Customization,
CVPR24(7476-7485)
IEEE DOI 2410
Training, Visualization, Adaptive systems, Limiting, Navigation, Text to image, text-to-image generation, diffusion models BibRef

Mahajan, S.[Shweta], Rahman, T.[Tanzila], Yi, K.M.[Kwang Moo], Sigal, L.[Leonid],
Prompting Hard or Hardly Prompting: Prompt Inversion for Text-to-Image Diffusion Models,
CVPR24(6808-6817)
IEEE DOI 2410
Vocabulary, Visualization, Image synthesis, Semantics, Text to image, Diffusion processes, Diffusion models BibRef

Zhou, D.[Dewei], Li, Y.[You], Ma, F.[Fan], Zhang, X.T.[Xiao-Ting], Yang, Y.[Yi],
MIGC: Multi-Instance Generation Controller for Text-to-Image Synthesis,
CVPR24(6818-6828)
IEEE DOI Code:
WWW Link. 2410
Codes, Attention mechanisms, Aggregates, Pipelines, Layout, Text to image, AIGC, Diffusion Models, Image Generation, Stable Diffusion BibRef

Zeng, Y.[Yu], Patel, V.M.[Vishal M.], Wang, H.C.[Hao-Chen], Huang, X.[Xun], Wang, T.C.[Ting-Chun], Liu, M.Y.[Ming-Yu], Balaji, Y.[Yogesh],
JeDi: Joint-Image Diffusion Models for Finetuning-Free Personalized Text-to-Image Generation,
CVPR24(6786-6795)
IEEE DOI 2410
Adaptation models, Computational modeling, Text to image, Benchmark testing, Diffusion models, image generation BibRef

Gong, B.[Biao], Huang, S.[Siteng], Feng, Y.T.[Yu-Tong], Zhang, S.W.[Shi-Wei], Li, Y.[Yuyuan], Liu, Y.[Yu],
Check, Locate, Rectify: A Training-Free Layout Calibration System for Text- to- Image Generation,
CVPR24(6624-6634)
IEEE DOI Code:
WWW Link. 2410
Image synthesis, Layout, Pipelines, Text to image, Benchmark testing, Diffusion models, Generators, text-to-image generation, training-free BibRef

Hoe, J.T.[Jiun Tian], Jiang, X.D.[Xu-Dong], Chan, C.S.[Chee Seng], Tan, Y.P.[Yap-Peng], Hu, W.P.[Wei-Peng],
InteractDiffusion: Interaction Control in Text-to-Image Diffusion Models,
CVPR24(6180-6189)
IEEE DOI Code:
WWW Link. 2410
Location awareness, Visualization, Computational modeling, Layout, Text to image, Diffusion models, image generation, generative ai BibRef

Menon, S.[Sachit], Misra, I.[Ishan], Girdhar, R.[Rohit],
Generating Illustrated Instructions,
CVPR24(6274-6284)
IEEE DOI 2410
Measurement, Visualization, Large language models, Text to image, Diffusion models, diffusion, multimodal, text-to-image BibRef

Yang, J.Y.[Jing-Yuan], Feng, J.W.[Jia-Wei], Huang, H.[Hui],
EmoGen: Emotional Image Content Generation with Text-to-Image Diffusion Models,
CVPR24(6358-6368)
IEEE DOI Code:
WWW Link. 2410
Measurement, Visualization, Image color analysis, Image synthesis, Semantics, Text to image BibRef

Dong, Y.[Yuan], Zuo, Q.[Qi], Gu, X.D.[Xiao-Dong], Yuan, W.H.[Wei-Hao], Zhao, Z.Y.[Zheng-Yi], Dong, Z.L.[Zi-Long], Bo, L.F.[Lie-Feng], Huang, Q.X.[Qi-Xing],
GPLD3D: Latent Diffusion of 3D Shape Generative Models by Enforcing Geometric and Physical Priors,
CVPR24(56-66)
IEEE DOI 2410
Solid modeling, Codes, Shape, Computational modeling, Noise reduction, Shape Generative Model, Latent Diffusion, Quality Checker BibRef

Yang, Y.J.[Yi-Jun], Gao, R.[Ruiyuan], Wang, X.[Xiaosen], Ho, T.Y.[Tsung-Yi], Xu, N.[Nan], xu, Q.[Qiang],
MMA-Diffusion: MultiModal Attack on Diffusion Models,
CVPR24(7737-7746)
IEEE DOI Code:
WWW Link. 2410
Visualization, Filters, Current measurement, Computational modeling, Text to image, Diffusion models, Adversarial attack BibRef

Hedlin, E.[Eric], Sharma, G.[Gopal], Mahajan, S.[Shweta], He, X.Z.[Xing-Zhe], Isack, H.[Hossam], Kar, A.[Abhishek], Rhodin, H.[Helge], Tagliasacchi, A.[Andrea], Yi, K.M.[Kwang Moo],
Unsupervised Keypoints from Pretrained Diffusion Models,
CVPR24(22820-22830)
IEEE DOI 2410
Codes, Noise reduction, Neural networks, Text to image, Computer architecture, Diffusion models, Diffusion models, emergent understandings BibRef

Sato, T.[Takami], Yue, J.[Justin], Chen, N.[Nanze], Wang, N.[Ningfei], Chen, Q.A.[Qi Alfred],
Intriguing Properties of Diffusion Models: An Empirical Study of the Natural Attack Capability in Text-to-Image Generative Models,
CVPR24(24635-24644)
IEEE DOI 2410
Noise reduction, Text to image, Artificial neural networks, Visual systems, Predictive models, Diffusion models, Safety BibRef

Gandikota, K.V.[Kanchana Vaishnavi], Chandramouli, P.[Paramanand],
Text-Guided Explorable Image Super-Resolution,
CVPR24(25900-25911)
IEEE DOI 2410
Training, Degradation, Superresolution, Semantics, Text to image, Diffusion models, diffusion, text-to-image, super-resolution BibRef

Mo, W.[Wenyi], Zhang, T.Y.[Tian-Yu], Bai, Y.[Yalong], Su, B.[Bing], Wen, J.R.[Ji-Rong], Yang, Q.[Qing],
Dynamic Prompt Optimizing for Text-to-Image Generation,
CVPR24(26617-26626)
IEEE DOI 2410
Uniform resource locators, Training, Image synthesis, Semantics, Refining, Text to image, Reinforcement learning, Diffusion Model BibRef

Smith, J.S.[James Seale], Hsu, Y.C.[Yen-Chang], Kira, Z.[Zsolt], Shen, Y.L.[Yi-Lin], Jin, H.X.[Hong-Xia],
Continual Diffusion with STAMINA: STack-And-Mask INcremental Adapters,
WhatNext24(1744-1754)
IEEE DOI 2410
Training, Continuing education, Costs, Text to image, Benchmark testing, Diffusion models, text-to-image customization BibRef

Zhang, G.[Gong], Wang, K.[Kai], Xu, X.Q.[Xing-Qian], Wang, Z.Y.[Zhang-Yang], Shi, H.[Humphrey],
Forget-Me-Not: Learning to Forget in Text-to-Image Diffusion Models,
WhatNext24(1755-1764)
IEEE DOI 2410
Adaptation models, Privacy, Accuracy, Computational modeling, Knowledge based systems, Text to image, Safety, text-to-image, concept forgetting BibRef

Tudosiu, P.D.[Petru-Daniel], Yang, Y.X.[Yong-Xin], Zhang, S.F.[Shi-Feng], Chen, F.[Fei], McDonagh, S.[Steven], Lampouras, G.[Gerasimos], Iacobacci, I.[Ignacio], Parisot, S.[Sarah],
MULAN: A Multi Layer Annotated Dataset for Controllable Text-to-Image Generation,
CVPR24(22413-22422)
IEEE DOI Code:
WWW Link. 2410
Training, Image segmentation, Annotations, Pipelines, Text to image, Image decomposition, Software, Dataset, Text-to-Image Generation, Diffusion Models BibRef

Wang, F.F.[Fei-Fei], Tan, Z.T.[Zhen-Tao], Wei, T.Y.[Tian-Yi], Wu, Y.[Yue], Huang, Q.D.[Qi-Dong],
SimAC: A Simple Anti-Customization Method for Protecting Face Privacy Against Text-to-Image Synthesis of Diffusion Models,
CVPR24(12047-12056)
IEEE DOI Code:
WWW Link. 2410
Training, Privacy, Adaptation models, Visualization, Frequency-domain analysis, Noise reduction, Text to image, face privacy BibRef

Pang, L.[Lianyu], Yin, J.[Jian], Xie, H.R.[Hao-Ran], Wang, Q.[Qiping], Li, Q.[Qing], Mao, X.D.[Xu-Dong],
Cross Initialization for Face Personalization of Text-to-Image Models,
CVPR24(8393-8403)
IEEE DOI Code:
WWW Link. 2410
Face recognition, Computational modeling, Text to image, Diffusion models, Surges, Image reconstruction BibRef

Xu, X.Q.[Xing-Qian], Guo, J.Y.[Jia-Yi], Wang, Z.Y.[Zhang-Yang], Huang, G.[Gao], Essa, I.[Irfan], Shi, H.[Humphrey],
Prompt-Free Diffusion: Taking 'Text' Out of Text-to-Image Diffusion Models,
CVPR24(8682-8692)
IEEE DOI 2410
Visualization, Pain, Image synthesis, Computational modeling, Semantics, Noise, Text to image, Generative Model, Image Editing, Text-to-Image BibRef

Qi, T.H.[Tian-Hao], Fang, S.C.[Shan-Cheng], Wu, Y.[Yanze], Xie, H.T.[Hong-Tao], Liu, J.W.[Jia-Wei], Chen, L.[Lang], He, Q.[Qian], Zhang, Y.D.[Yong-Dong],
DEADiff: An Efficient Stylization Diffusion Model with Disentangled Representations,
CVPR24(8693-8702)
IEEE DOI Code:
WWW Link. 2410
Learning systems, Visualization, Semantics, Text to image, Feature extraction, Diffusion models BibRef

Brack, M.[Manuel], Friedrich, F.[Felix], Kornmeier, K.[Katharina], Tsaban, L.[Linoy], Schramowski, P.[Patrick], Kersting, K.[Kristian], Passos, A.[Apolinário],
LEDITS++: Limitless Image Editing Using Text-to-Image Models,
CVPR24(8861-8870)
IEEE DOI 2410
Computational modeling, Text to image, Computer architecture, Benchmark testing, Diffusion models BibRef

Li, H.[Hang], Shen, C.Z.[Cheng-Zhi], Torr, P.[Philip], Tresp, V.[Volker], Gu, J.D.[Jin-Dong],
Self-Discovering Interpretable Diffusion Latent Directions for Responsible Text-to-Image Generation,
CVPR24(12006-12016)
IEEE DOI Code:
WWW Link. 2410
Ethics, Prevention and mitigation, Semantics, Text to image, Diffusion models, Vectors, Text-to-Image Generation, Explainability and Transparency BibRef

Li, H.[Hao], Zou, Y.[Yang], Wang, Y.[Ying], Majumder, O.[Orchid], Xie, Y.S.[Yu-Sheng], Manmatha, R., Swaminathan, A.[Ashwin], Tu, Z.W.[Zhuo-Wen], Ermon, S.[Stefano], Soatto, S.[Stefano],
On the Scalability of Diffusion-based Text-to-Image Generation,
CVPR24(9400-9409)
IEEE DOI 2410
Training, Costs, Systematics, Computational modeling, Scalability, Noise reduction, Text to image, diffusion models, text-to-image, Transformers BibRef

Guo, X.[Xiefan], Liu, J.L.[Jin-Lin], Cui, M.M.[Miao-Miao], Li, J.[Jiankai], Yang, H.Y.[Hong-Yu], Huang, D.[Di],
Initno: Boosting Text-to-Image Diffusion Models via Initial Noise Optimization,
CVPR24(9380-9389)
IEEE DOI Code:
WWW Link. 2410
Navigation, Instruments, Noise, Pipelines, Text to image, Aerospace electronics BibRef

Shen, D.[Dazhong], Song, G.[Guanglu], Xue, Z.[Zeyue], Wang, F.Y.[Fu-Yun], Liu, Y.[Yu],
Rethinking the Spatial Inconsistency in Classifier-Free Diffusion Guidance,
CVPR24(9370-9379)
IEEE DOI Code:
WWW Link. 2410
Image quality, Training, Costs, Semantic segmentation, Semantics, Noise reduction, Text-to-Image Diffusion Models, Semantic Segmentation BibRef

Zhou, Y.F.[Yu-Fan], Zhang, R.[Ruiyi], Gu, J.X.[Jiu-Xiang], Sun, T.[Tong],
Customization Assistant for Text-to-image Generation,
CVPR24(9182-9191)
IEEE DOI 2410
Training, Large language models, Text to image, Diffusion models, Testing BibRef

Patel, M.[Maitreya], Kim, C.[Changhoon], Cheng, S.[Sheng], Baral, C.[Chitta], Yang, Y.Z.[Ye-Zhou],
ECLIPSE: A Resource-Efficient Text-to-Image Prior for Image Generations,
CVPR24(9069-9078)
IEEE DOI Code:
WWW Link. 2410
Training, Image coding, Image synthesis, Computational modeling, Text to image, Contrastive learning, Diffusion models, ECLIPSE BibRef

Meral, T.H.S.[Tuna Han Salih], Simsar, E.[Enis], Tombari, F.[Federico], Yanardag, P.[Pinar],
CONFORM: Contrast is All You Need For High-Fidelity Text-to-Image Diffusion Models,
CVPR24(9005-9014)
IEEE DOI 2410
Source coding, Computational modeling, Semantics, Text to image, Benchmark testing, Diffusion models, Semantic fidelity BibRef

Jiang, Z.[Zeyinzi], Mao, C.J.[Chao-Jie], Pan, Y.L.[Yu-Lin], Han, Z.[Zhen], Zhang, J.[Jingfeng],
SCEdit: Efficient and Controllable Image Diffusion Generation via Skip Connection Editing,
CVPR24(8995-9004)
IEEE DOI Code:
WWW Link. 2410
Training, Adaptation models, Tuners, Image synthesis, Text to image, Diffusion models, Diffusion model, Text-to-image generation, Efficient Tuning BibRef

Kim, C.[Changhoon], Min, K.[Kyle], Patel, M.[Maitreya], Cheng, S.[Sheng], Yang, Y.Z.[Ye-Zhou],
WOUAF: Weight Modulation for User Attribution and Fingerprinting in Text-to-Image Diffusion Models,
CVPR24(8974-8983)
IEEE DOI Code:
WWW Link. 2410
Solid modeling, Computational modeling, Prevention and mitigation, Text to image, Modulation, Generative Model BibRef

Shirakawa, T.[Takahiro], Uchida, S.[Seiichi],
NoiseCollage: A Layout-Aware Text-to-Image Diffusion Model Based on Noise Cropping and Merging,
CVPR24(8921-8930)
IEEE DOI Code:
WWW Link. 2410
Image synthesis, Image edge detection, Noise, Layout, Noise reduction, Merging, Text to image, diffusion model, text-to-image generation BibRef

Kwon, G.[Gihyun], Jenni, S.[Simon], Li, D.Z.[Ding-Zeyu], Lee, J.Y.[Joon-Young], Ye, J.C.[Jong Chul], Heilbron, F.C.[Fabian Caba],
Concept Weaver: Enabling Multi-Concept Fusion in Text-to-Image Models,
CVPR24(8880-8889)
IEEE DOI 2410
Fuses, Semantics, Text to image, Diffusion models, Optimization, Text-to-image Model, Multi-concept BibRef

Sueyoshi, K.[Kota], Matsubara, T.[Takashi],
Predicated Diffusion: Predicate Logic-Based Attention Guidance for Text-to-Image Diffusion Models,
CVPR24(8651-8660)
IEEE DOI 2410
Image quality, Image synthesis, Natural languages, Layout, Text to image, Diffusion models, text-to-image generation, attention guidance BibRef

Wang, Z.[Zirui], Sha, Z.Z.[Zhi-Zhou], Ding, Z.[Zheng], Wang, Y.L.[Yi-Lin], Tu, Z.W.[Zhuo-Wen],
TokenCompose: Text-to-Image Diffusion with Token-Level Supervision,
CVPR24(8553-8564)
IEEE DOI 2410
Training, Photorealism, Pipelines, Noise reduction, Text to image, Object segmentation, Benchmark testing, Diffusion Models, Compositional Generation BibRef

Kim, J.[Jimyeong], Park, J.[Jungwon], Rhee, W.[Wonjong],
Selectively Informative Description can Reduce Undesired Embedding Entanglements in Text-to-Image Personalization,
CVPR24(8312-8322)
IEEE DOI 2410
Text to image, Reflection, Text-to-Image Generation, Text-to-Image Diffusion, Text-to-image Personalization BibRef

Koley, S.[Subhadeep], Bhunia, A.K.[Ayan Kumar], Sain, A.[Aneeshan], Chowdhury, P.N.[Pinaki Nath], Xiang, T.[Tao], Song, Y.Z.[Yi-Zhe],
Text-to-Image Diffusion Models are Great Sketch-Photo Matchmakers,
CVPR24(16826-16837)
IEEE DOI 2410
Visualization, Adaptation models, Shape, Pipelines, Image retrieval, Text to image, Benchmark testing BibRef

Zhao, L.[Lin], Zhao, T.C.[Tian-Chen], Lin, Z.[Zinan], Ning, X.F.[Xue-Fei], Dai, G.H.[Guo-Hao], Yang, H.Z.[Hua-Zhong], Wang, Y.[Yu],
FlashEval: Towards Fast and Accurate Evaluation of Text-to-Image Diffusion Generative Models,
CVPR24(16122-16131)
IEEE DOI Code:
WWW Link. 2410
Training, Schedules, Quantization (signal), Computational modeling, Text to image, Training data, Diffusion models BibRef

Liu, H.[Hanwen], Sun, Z.C.[Zhi-Cheng], Mu, Y.D.[Ya-Dong],
Countering Personalized Text-to-Image Generation with Influence Watermarks,
CVPR24(12257-12267)
IEEE DOI 2410
Training, Visualization, Computational modeling, Semantics, Noise, Text to image, Watermarking, diffusion models, watermarks BibRef

Azarian, K.[Kambiz], Das, D.[Debasmit], Hou, Q.Q.[Qi-Qi], Porikli, F.M.[Fatih M.],
Segmentation-Free Guidance for Text-to-Image Diffusion Models,
GCV24(7520-7529)
IEEE DOI 2410
Image segmentation, Costs, Image color analysis, Text to image, Focusing, Switches BibRef

Lee, S.[Seoyoung], Lee, J.[Joonseok],
PoseDiff: Pose-conditioned Multimodal Diffusion Model for Unbounded Scene Synthesis from Sparse Inputs,
WACV24(5005-5015)
IEEE DOI 2404
Image color analysis, Computational modeling, Scalability, Cameras, Tuning, Faces, Algorithms, Generative models for image, video, 3D, etc., Vision + language and/or other modalities BibRef

Wang, H.[Hai], Xiang, X.Y.[Xiao-Yu], Fan, Y.C.[Yu-Chen], Xue, J.H.[Jing-Hao],
Customizing 360-Degree Panoramas through Text-to-Image Diffusion Models,
WACV24(4921-4931)
IEEE DOI Code:
WWW Link. 2404
Geometry, Codes, Noise reduction, Games, Task analysis, Algorithms, Generative models for image, video, 3D, etc., Algorithms, image and video synthesis BibRef

Li, C.[Cheng], Qi, Y.[Yali], Zeng, Q.[Qingtao], Lu, L.[Likun],
Comparison of Image Generation methods based on Diffusion Models,
CVIDL23(1-4)
IEEE DOI 2403
Training, Deep learning, Learning systems, Image synthesis, Computational modeling, Diffusion models BibRef

Xu, Y.[Yanwu], Zhao, Y.[Yang], Xiao, Z.S.[Zhi-Sheng], Hou, T.B.[Ting-Bo],
UFOGen: You Forward Once Large Scale Text-to-Image Generation via Diffusion GANs,
CVPR24(8196-8206)
IEEE DOI 2410
Image synthesis, Computational modeling, Text to image, Propulsion, Diffusion models, Hybrid power systems, diffusion models, GANs BibRef

Chen, M.H.[Ming-Hao], Laina, I.[Iro], Vedaldi, A.[Andrea],
Training-Free Layout Control with Cross-Attention Guidance,
WACV24(5331-5341)
IEEE DOI 2404
Training, Visualization, Layout, Semantics, Noise, Benchmark testing, Algorithms, Generative models for image, video, 3D, etc BibRef

Huang, R.H.[Run-Hui], Han, J.H.[Jian-Hua], Lu, G.S.[Guan-Song], Liang, X.D.[Xiao-Dan], Zeng, Y.[Yihan], Zhang, W.[Wei], Xu, H.[Hang],
DiffDis: Empowering Generative Diffusion Model with Cross-Modal Discrimination Capability,
ICCV23(15667-15677)
IEEE DOI 2401
BibRef

Yang, X.Y.[Xing-Yi], Wang, X.C.[Xin-Chao],
Diffusion Model as Representation Learner,
ICCV23(18892-18903)
IEEE DOI Code:
WWW Link. 2401
BibRef

Nair, N.G.[Nithin Gopalakrishnan], Cherian, A.[Anoop], Lohit, S.[Suhas], Wang, Y.[Ye], Koike-Akino, T.[Toshiaki], Patel, V.M.[Vishal M.], Marks, T.K.[Tim K.],
Steered Diffusion: A Generalized Framework for Plug-and-Play Conditional Image Synthesis,
ICCV23(20793-20803)
IEEE DOI 2401
BibRef

Wang, Z.D.[Zhen-Dong], Bao, J.M.[Jian-Min], Zhou, W.G.[Wen-Gang], Wang, W.[Weilun], Hu, H.[Hezhen], Chen, H.[Hong], Li, H.Q.[Hou-Qiang],
DIRE for Diffusion-Generated Image Detection,
ICCV23(22388-22398)
IEEE DOI Code:
WWW Link. 2401
BibRef

Tang, J.[Junshu], Wang, T.F.[Teng-Fei], Zhang, B.[Bo], Zhang, T.[Ting], Yi, R.[Ran], Ma, L.Z.[Li-Zhuang], Chen, D.[Dong],
Make-It-3D: High-Fidelity 3D Creation from A Single Image with Diffusion Prior,
ICCV23(22762-22772)
IEEE DOI 2401
BibRef

Ge, S.W.[Song-Wei], Nah, S.J.[Seung-Jun], Liu, G.L.[Gui-Lin], Poon, T.[Tyler], Tao, A.[Andrew], Catanzaro, B.[Bryan], Jacobs, D.[David], Huang, J.B.[Jia-Bin], Liu, M.Y.[Ming-Yu], Balaji, Y.[Yogesh],
Preserve Your Own Correlation: A Noise Prior for Video Diffusion Models,
ICCV23(22873-22884)
IEEE DOI Code:
WWW Link. 2401
BibRef

Hong, S.[Susung], Lee, G.[Gyuseong], Jang, W.[Wooseok], Kim, S.[Seungryong],
Improving Sample Quality of Diffusion Models Using Self-Attention Guidance,
ICCV23(7428-7437)
IEEE DOI 2401
BibRef

Szymanowicz, S.[Stanislaw], Rupprecht, C.[Christian], Vedaldi, A.[Andrea],
Viewset Diffusion: (0-)Image-Conditioned 3D Generative Models from 2D Data,
ICCV23(8829-8839)
IEEE DOI 2401
BibRef

Jiang, Y.[Yutao], Zhou, Y.[Yang], Liang, Y.[Yuan], Liu, W.X.[Wen-Xi], Jiao, J.B.[Jian-Bo], Quan, Y.H.[Yu-Hui], He, S.F.[Sheng-Feng],
Diffuse3D: Wide-Angle 3D Photography via Bilateral Diffusion,
ICCV23(8964-8974)
IEEE DOI Code:
WWW Link. 2401
BibRef

Feng, B.T.[Berthy T.], Smith, J.[Jamie], Rubinstein, M.[Michael], Chang, H.[Huiwen], Bouman, K.L.[Katherine L.], Freeman, W.T.[William T.],
Score-Based Diffusion Models as Principled Priors for Inverse Imaging,
ICCV23(10486-10497)
IEEE DOI 2401
BibRef

Yang, B.B.[Bin-Bin], Luo, Y.[Yi], Chen, Z.L.[Zi-Liang], Wang, G.R.[Guang-Run], Liang, X.D.[Xiao-Dan], Lin, L.[Liang],
LAW-Diffusion: Complex Scene Generation by Diffusion with Layouts,
ICCV23(22612-22622)
IEEE DOI 2401
BibRef

Levi, E.[Elad], Brosh, E.[Eli], Mykhailych, M.[Mykola], Perez, M.[Meir],
DLT: Conditioned layout generation with Joint Discrete-Continuous Diffusion Layout Transformer,
ICCV23(2106-2115)
IEEE DOI Code:
WWW Link. 2401
BibRef

Couairon, G.[Guillaume], Careil, M.[Marlène], Cord, M.[Matthieu], Lathuilière, S.[Stéphane], Verbeek, J.[Jakob],
Zero-shot spatial layout conditioning for text-to-image diffusion models,
ICCV23(2174-2183)
IEEE DOI 2401
BibRef

Zhang, L.[Lvmin], Rao, A.[Anyi], Agrawala, M.[Maneesh],
Adding Conditional Control to Text-to-Image Diffusion Models,
ICCV23(3813-3824)
IEEE DOI 2401
Award, Marr Price, ICCV. BibRef

Zhao, W.L.[Wen-Liang], Rao, Y.M.[Yong-Ming], Liu, Z.[Zuyan], Liu, B.[Benlin], Zhou, J.[Jie], Lu, J.W.[Ji-Wen],
Unleashing Text-to-Image Diffusion Models for Visual Perception,
ICCV23(5706-5716)
IEEE DOI Code:
WWW Link. 2401
BibRef

Xie, J.[Jinheng], Li, Y.X.[Yue-Xiang], Huang, Y.W.[Ya-Wen], Liu, H.Z.[Hao-Zhe], Zhang, W.[Wentian], Zheng, Y.F.[Ye-Feng], Shou, M.Z.[Mike Zheng],
BoxDiff: Text-to-Image Synthesis with Training-Free Box-Constrained Diffusion,
ICCV23(7418-7427)
IEEE DOI 2401
BibRef

Wu, Q.C.[Qiu-Cheng], Liu, Y.J.[Yu-Jian], Zhao, H.[Handong], Bui, T.[Trung], Lin, Z.[Zhe], Zhang, Y.[Yang], Chang, S.Y.[Shi-Yu],
Harnessing the Spatial-Temporal Attention of Diffusion Models for High-Fidelity Text-to-Image Synthesis,
ICCV23(7732-7742)
IEEE DOI 2401
BibRef

Khachatryan, L.[Levon], Movsisyan, A.[Andranik], Tadevosyan, V.[Vahram], Henschel, R.[Roberto], Wang, Z.Y.[Zhang-Yang], Navasardyan, S.[Shant], Shi, H.[Humphrey],
Text2Video-Zero: Text-to-Image Diffusion Models are Zero-Shot Video Generators,
ICCV23(15908-15918)
IEEE DOI Code:
WWW Link. 2401
BibRef

Zhao, J.[Jing], Zheng, H.[Heliang], Wang, C.[Chaoyue], Lan, L.[Long], Yang, W.J.[Wen-Jing],
MagicFusion: Boosting Text-to-Image Generation Performance by Fusing Diffusion Models,
ICCV23(22535-22545)
IEEE DOI Code:
WWW Link. 2401
BibRef

Kumari, N.[Nupur], Zhang, B.L.[Bing-Liang], Wang, S.Y.[Sheng-Yu], Shechtman, E.[Eli], Zhang, R.[Richard], Zhu, J.Y.[Jun-Yan],
Ablating Concepts in Text-to-Image Diffusion Models,
ICCV23(22634-22645)
IEEE DOI 2401
BibRef

Schwartz, I.[Idan], Snæbjarnarson, V.[Vésteinn], Chefer, H.[Hila], Belongie, S.[Serge], Wolf, L.[Lior], Benaim, S.[Sagie],
Discriminative Class Tokens for Text-to-Image Diffusion Models,
ICCV23(22668-22678)
IEEE DOI Code:
WWW Link. 2401
BibRef

Patashnik, O.[Or], Garibi, D.[Daniel], Azuri, I.[Idan], Averbuch-Elor, H.[Hadar], Cohen-Or, D.[Daniel],
Localizing Object-level Shape Variations with Text-to-Image Diffusion Models,
ICCV23(22994-23004)
IEEE DOI 2401
BibRef

Ge, S.W.[Song-Wei], Park, T.[Taesung], Zhu, J.Y.[Jun-Yan], Huang, J.B.[Jia-Bin],
Expressive Text-to-Image Generation with Rich Text,
ICCV23(7511-7522)
IEEE DOI 2401
BibRef

Kim, Y.J.[Yun-Ji], Lee, J.Y.[Ji-Young], Kim, J.H.[Jin-Hwa], Ha, J.W.[Jung-Woo], Zhu, J.Y.[Jun-Yan],
Dense Text-to-Image Generation with Attention Modulation,
ICCV23(7667-7677)
IEEE DOI Code:
WWW Link. 2401
BibRef

Xiang, J.F.[Jian-Feng], Yang, J.[Jiaolong], Huang, B.B.[Bin-Bin], Tong, X.[Xin],
3D-aware Image Generation using 2D Diffusion Models,
ICCV23(2383-2393)
IEEE DOI 2401
BibRef

Schramowski, P.[Patrick], Brack, M.[Manuel], Deiseroth, B.[Björn], Kersting, K.[Kristian],
Safe Latent Diffusion: Mitigating Inappropriate Degeneration in Diffusion Models,
CVPR23(22522-22531)
IEEE DOI 2309
BibRef

Chen, C.[Chen], Liu, D.[Daochang], Ma, S.Q.[Si-Qi], Nepal, S.[Surya], Xu, C.[Chang],
Private Image Generation with Dual-Purpose Auxiliary Classifier,
CVPR23(20361-20370)
IEEE DOI 2309
BibRef

Chai, L.[Lucy], Tucker, R.[Richard], Li, Z.Q.[Zheng-Qi], Isola, P.[Phillip], Snavely, N.[Noah],
Persistent Nature: A Generative Model of Unbounded 3D Worlds,
CVPR23(20863-20874)
IEEE DOI 2309
BibRef

Ni, H.[Haomiao], Shi, C.[Changhao], Li, K.[Kai], Huang, S.X.[Sharon X.], Min, M.R.[Martin Renqiang],
Conditional Image-to-Video Generation with Latent Flow Diffusion Models,
CVPR23(18444-18455)
IEEE DOI 2309
BibRef

Zhang, Q.S.[Qin-Sheng], Song, J.[JiaMing], Huang, X.[Xun], Chen, Y.X.[Yong-Xin], Liu, M.Y.[Ming-Yu],
DiffCollage: Parallel Generation of Large Content with Diffusion Models,
CVPR23(10188-10198)
IEEE DOI 2309
BibRef

Phung, H.[Hao], Dao, Q.[Quan], Tran, A.[Anh],
Wavelet Diffusion Models are fast and scalable Image Generators,
CVPR23(10199-10208)
IEEE DOI 2309
BibRef

Shim, J.[Jaehyeok], Joo, K.[Kyungdon],
DITTO: Dual and Integrated Latent Topologies for Implicit 3D Reconstruction,
CVPR24(5396-5405)
IEEE DOI 2410
Point cloud compression, Shape, Power system stability, Transformers, Topology, ComputerVision BibRef

Shim, J.[Jaehyeok], Kang, C.W.[Chang-Woo], Joo, K.[Kyungdon],
Diffusion-Based Signed Distance Fields for 3D Shape Generation,
CVPR23(20887-20897)
IEEE DOI 2309
BibRef

Po, R.[Ryan], Wetzstein, G.[Gordon],
Compositional 3D Scene Generation using Locally Conditioned Diffusion,
3DV24(651-663)
IEEE DOI 2408
Semantics, Pipelines, Manuals, Task analysis BibRef

Shue, J.R.[J. Ryan], Chan, E.R.[Eric Ryan], Po, R.[Ryan], Ankner, Z.[Zachary], Wu, J.J.[Jia-Jun], Wetzstein, G.[Gordon],
3D Neural Field Generation Using Triplane Diffusion,
CVPR23(20875-20886)
IEEE DOI 2309
BibRef

Kim, S.W.[Seung Wook], Brown, B.[Bradley], Yin, K.X.[Kang-Xue], Kreis, K.[Karsten], Schwarz, K.[Katja], Li, D.[Daiqing], Rombach, R.[Robin], Torralba, A.[Antonio], Fidler, S.[Sanja],
NeuralField-LDM: Scene Generation with Hierarchical Latent Diffusion Models,
CVPR23(8496-8506)
IEEE DOI 2309
BibRef

Luo, Z.X.[Zheng-Xiong], Chen, D.[Dayou], Zhang, Y.Y.[Ying-Ya], Huang, Y.[Yan], Wang, L.[Liang], Shen, Y.J.[Yu-Jun], Zhao, D.L.[De-Li], Zhou, J.[Jingren], Tan, T.N.[Tie-Niu],
VideoFusion: Decomposed Diffusion Models for High-Quality Video Generation,
CVPR23(10209-10218)
IEEE DOI 2309
BibRef

Ruan, L.[Ludan], Ma, Y.Y.[Yi-Yang], Yang, H.[Huan], He, H.G.[Hui-Guo], Liu, B.[Bei], Fu, J.L.[Jian-Long], Yuan, N.J.[Nicholas Jing], Jin, Q.[Qin], Guo, B.[Baining],
MM-Diffusion: Learning Multi-Modal Diffusion Models for Joint Audio and Video Generation,
CVPR23(10219-10228)
IEEE DOI 2309
BibRef

Zhu, Y.Z.[Yuan-Zhi], Li, Z.H.[Zhao-Hai], Wang, T.W.[Tian-Wei], He, M.C.[Meng-Chao], Yao, C.[Cong],
Conditional Text Image Generation with Diffusion Models,
CVPR23(14235-14244)
IEEE DOI 2309
BibRef

Zhou, Y.F.[Yu-Fan], Liu, B.C.[Bing-Chen], Zhu, Y.Z.[Yi-Zhe], Yang, X.[Xiao], Chen, C.Y.[Chang-You], Xu, J.H.[Jin-Hui],
Shifted Diffusion for Text-to-image Generation,
CVPR23(10157-10166)
IEEE DOI 2309
BibRef

Li, M.[Muheng], Duan, Y.[Yueqi], Zhou, J.[Jie], Lu, J.W.[Ji-Wen],
Diffusion-SDF: Text-to-Shape via Voxelized Diffusion,
CVPR23(12642-12651)
IEEE DOI 2309
BibRef

Xu, J.[Jiale], Wang, X.[Xintao], Cheng, W.H.[Wei-Hao], Cao, Y.P.[Yan-Pei], Shan, Y.[Ying], Qie, X.[Xiaohu], Gao, S.H.[Sheng-Hua],
Dream3D: Zero-Shot Text-to-3D Synthesis Using 3D Shape Prior and Text-to-Image Diffusion Models,
CVPR23(20908-20918)
IEEE DOI 2309
BibRef

Chai, S.[Shang], Zhuang, L.S.[Lian-Sheng], Yan, F.Y.[Feng-Ying],
LayoutDM: Transformer-based Diffusion Model for Layout Generation,
CVPR23(18349-18358)
IEEE DOI 2309
BibRef

Wu, Q.C.[Qiu-Cheng], Liu, Y.J.[Yu-Jian], Zhao, H.[Handong], Kale, A.[Ajinkya], Bui, T.[Trung], Yu, T.[Tong], Lin, Z.[Zhe], Zhang, Y.[Yang], Chang, S.Y.[Shi-Yu],
Uncovering the Disentanglement Capability in Text-to-Image Diffusion Models,
CVPR23(1900-1910)
IEEE DOI 2309
BibRef

Jain, A.[Ajay], Xie, A.[Amber], Abbeel, P.[Pieter],
VectorFusion: Text-to-SVG by Abstracting Pixel-Based Diffusion Models,
CVPR23(1911-1920)
IEEE DOI 2309
BibRef

Kumari, N.[Nupur], Zhang, B.L.[Bing-Liang], Zhang, R.[Richard], Shechtman, E.[Eli], Zhu, J.Y.[Jun-Yan],
Multi-Concept Customization of Text-to-Image Diffusion,
CVPR23(1931-1941)
IEEE DOI 2309
BibRef

Hui, M.[Mude], Zhang, Z.Z.[Zhi-Zheng], Zhang, X.Y.[Xiao-Yi], Xie, W.X.[Wen-Xuan], Wang, Y.W.[Yu-Wang], Lu, Y.[Yan],
Unifying Layout Generation with a Decoupled Diffusion Model,
CVPR23(1942-1951)
IEEE DOI 2309
BibRef

Ruiz, N.[Nataniel], Li, Y.Z.[Yuan-Zhen], Jampani, V.[Varun], Pritch, Y.[Yael], Rubinstein, M.[Michael], Aberman, K.[Kfir],
DreamBooth: Fine Tuning Text-to-Image Diffusion Models for Subject-Driven Generation,
CVPR23(22500-22510)
IEEE DOI 2309
BibRef

Zheng, G.C.[Guang-Cong], Zhou, X.P.[Xian-Pan], Li, X.W.[Xue-Wei], Qi, Z.A.[Zhong-Ang], Shan, Y.[Ying], Li, X.[Xi],
LayoutDiffusion: Controllable Diffusion Model for Layout-to-Image Generation,
CVPR23(22490-22499)
IEEE DOI 2309
BibRef

Liu, X.H.[Xi-Hui], Park, D.H.[Dong Huk], Azadi, S.[Samaneh], Zhang, G.[Gong], Chopikyan, A.[Arman], Hu, Y.X.[Yu-Xiao], Shi, H.[Humphrey], Rohrbach, A.[Anna], Darrell, T.J.[Trevor J.],
More Control for Free! Image Synthesis with Semantic Diffusion Guidance,
WACV23(289-299)
IEEE DOI 2302
Image synthesis, Annotations, Image matching, Semantics, Noise reduction, Probabilistic logic, Vision + language and/or other modalities BibRef

Pan, Z.H.[Zhi-Hong], Zhou, X.[Xin], Tian, H.[Hao],
Arbitrary Style Guidance for Enhanced Diffusion-Based Text-to-Image Generation,
WACV23(4450-4460)
IEEE DOI 2302
Graphics, Training, Technological innovation, Adaptation models, Adaptive systems, Art, Navigation, Vision + language and/or other modalities BibRef

Gu, S.Y.[Shu-Yang], Chen, D.[Dong], Bao, J.M.[Jian-Min], Wen, F.[Fang], Zhang, B.[Bo], Chen, D.D.[Dong-Dong], Yuan, L.[Lu], Guo, B.N.[Bai-Ning],
Vector Quantized Diffusion Model for Text-to-Image Synthesis,
CVPR22(10686-10696)
IEEE DOI 2210
Image quality, Image resolution, Image synthesis, Computational modeling, Noise reduction, Vision+language BibRef

Jing, B.[Bowen], Corso, G.[Gabriele], Berlinghieri, R.[Renato], Jaakkola, T.[Tommi],
Subspace Diffusion Generative Models,
ECCV22(XXIII:274-289).
Springer DOI 2211
BibRef

Han, L.G.[Li-Gong], Li, Y.X.[Yin-Xiao], Zhang, H.[Han], Milanfar, P.[Peyman], Metaxas, D.N.[Dimitris N.], Yang, F.[Feng],
SVDiff: Compact Parameter Space for Diffusion Fine-Tuning,
ICCV23(7289-7300)
IEEE DOI 2401
BibRef

Nair, N.G.[Nithin Gopalakrishnan], Bandara, W.G.C.[Wele Gedara Chaminda], Patel, V.M.[Vishal M.],
Unite and Conquer: Plug and Play Multi-Modal Synthesis Using Diffusion Models,
CVPR23(6070-6079)
IEEE DOI 2309
BibRef

Benny, Y.[Yaniv], Wolf, L.B.[Lior B.],
Dynamic Dual-Output Diffusion Models,
CVPR22(11472-11481)
IEEE DOI 2210
Image quality, Image synthesis, Noise reduction, Generative adversarial networks, Image and video synthesis and generation BibRef

Hu, M.H.[Ming-Hui], Wang, Y.J.[Yu-Jie], Cham, T.J.[Tat-Jen], Yang, J.F.[Jian-Fei], Suganthan, P.N.,
Global Context with Discrete Diffusion in Vector Quantised Modelling for Image Generation,
CVPR22(11492-11501)
IEEE DOI 2210
Training, Visualization, Image resolution, Image synthesis, Pipelines, Noise reduction, Probabilistic logic, Image and video synthesis and generation BibRef

Ma, H.Y.[Heng-Yuan], Zhang, L.[Li], Zhu, X.T.[Xia-Tian], Feng, J.F.[Jian-Feng],
Accelerating Score-Based Generative Models with Preconditioned Diffusion Sampling,
ECCV22(XXIII:1-16).
Springer DOI 2211
BibRef

Zheng, G.[Guangcong], Li, S.[Shengming], Wang, H.[Hui], Yao, T.P.[Tai-Ping], Chen, Y.[Yang], Ding, S.H.[Shou-Hong], Li, X.[Xi],
Entropy-Driven Sampling and Training Scheme for Conditional Diffusion Generation,
ECCV22(XXII:754-769).
Springer DOI 2211
BibRef

Liu, N.[Nan], Li, S.[Shuang], Du, Y.L.[Yi-Lun], Torralba, A.[Antonio], Tenenbaum, J.B.[Joshua B.],
Compositional Visual Generation with Composable Diffusion Models,
ECCV22(XVII:423-439).
Springer DOI 2211
BibRef

Sehwag, V.[Vikash], Hazirbas, C.[Caner], Gordo, A.[Albert], Ozgenel, F.[Firat], Ferrer, C.C.[Cristian Canton],
Generating High Fidelity Data from Low-density Regions using Diffusion Models,
CVPR22(11482-11491)
IEEE DOI 2210
Manifolds, Computational modeling, Diffusion processes, Data models, Representation learning BibRef

Chapter on 3-D Object Description and Computation Techniques, Surfaces, Deformable, View Generation, Video Conferencing continues in
Vision Transformers for Image Generation and Image Synthesis .


Last update:Nov 26, 2024 at 16:40:19