扩散模型用于指纹生成

扩散模型基础理论与核心架构

这些文献奠定了扩散模型的基础，涵盖了从早期DDPM到潜空间扩散（LDM）以及基于Transformer的架构演进。

Denoising Diffusion Probabilistic Models（Jonathan Ho, Ajay Jain, P. Abbeel, 2020, Neural Information Processing Systems）
Improved Denoising Diffusion Probabilistic Models（Alex Nichol, Prafulla Dhariwal, 2021, International Conference on Machine Learning）
Denoising Diffusion Implicit Models（Jiaming Song, Chenlin Meng, Stefano Ermon, 2020, International Conference on Learning Representations）
High-Resolution Image Synthesis with Latent Diffusion Models（Robin Rombach, A. Blattmann, Dominik Lorenz, Patrick Esser, B. Ommer, 2021, 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)）
Scalable Diffusion Models with Transformers（William S. Peebles, Saining Xie, 2022, 2023 IEEE/CVF International Conference on Computer Vision (ICCV)）
SiT: Exploring Flow and Diffusion-based Generative Models with Scalable Interpolant Transformers（Nanye Ma, Mark Goldstein, M. S. Albergo, N. Boffi, Eric Vanden-Eijnden, Saining Xie, 2024, European Conference on Computer Vision）
Simplified and Generalized Masked Diffusion for Discrete Data（Jiaxin Shi, Kehang Han, Zhe Wang, Arnaud Doucet, Michalis K. Titsias, 2024, Neural Information Processing Systems）
Representation Alignment for Generation: Training Diffusion Transformers Is Easier Than You Think（Sihyun Yu, Sangkyung Kwak, Huiwon Jang, Jongheon Jeong, Jonathan Huang, Jinwoo Shin, Saining Xie, 2024, International Conference on Learning Representations）

指纹与生物识别图像生成应用

专门探讨将扩散模型应用于指纹和掌纹生成的文献，旨在解决生物识别领域的数据稀缺、隐私保护和增强识别准确性等问题。

DiffFinger: Advancing Synthetic Fingerprint Generation through Denoising Diffusion Probabilistic Models（Fred M. Grabovski, Lior Yasur, Yaniv Hacmon, Lior Nisimov, Stav Nimrod, 2024, arXiv.org）
DENOISING DIFFUSION PROBABILISTIC MODEL WITH WAVELET PACKET TRANSFORM FOR FINGERPRINT GENERATION（Li Chen, Yong Chan, 2024, Jordanian Journal of Computers and Information Technology）
Diffusion Probabilistic Model Based End-to-End Latent Fingerprint Synthesis（Kejian Li, Xiao Yang, 2023, 2023 IEEE 4th International Conference on Pattern Recognition and Machine Learning (PRML)）
Fingerprint Synthesis from Diffusion Models and Generative Adversarial Networks（Weizhong Tang, Diego Andre Figueroa Llamosas, Donglin Liu, K. Johnsson, A. Sopasakis, 2025, Lecture Notes in Networks and Systems）
Data augmentation-based enhanced fingerprint recognition using deep convolutional generative adversarial network and diffusion models（Yukai Liu, 2024, Applied and Computational Engineering）
PalmDiff: When Palmprint Generation Meets Controllable Diffusion Model（Long Tang, Tingting Chai, Zheng Zhang, Miao Zhang, Xiangqian Wu, 2025, IEEE Transactions on Image Processing）
Diff-Palm: Realistic Palmprint Generation with Polynomial Creases and Intra-Class Variation Controllable Diffusion Models（Jianlong Jin, Chenglong Zhao, Ruixin Zhang, Sheng Shang, Jianqing Xu, Jingyu Zhang, Shaoming Wang, Yang Zhao, Shouhong Ding, Wei Jia, Yunsheng Wu, 2025, 2025 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)）
Enhancing Fingerprint Image Synthesis with GANs, Diffusion Models, and Style Transfer Techniques（W. Tang, D. Figueroa, D. Liu, K. Johnsson, A. Sopasakis, 2024, arXiv.org）

身份保持与可控生成技术

聚焦于如何实现精确的条件控制（如布局、身份信息ID、姿态等），这对于生成同一指纹的不同样本（类内差异）至关重要。

Adding Conditional Control to Text-to-Image Diffusion Models（Lvmin Zhang, Anyi Rao, Maneesh Agrawala, 2023, 2023 IEEE/CVF International Conference on Computer Vision (ICCV)）
FPGAN-Control: A Controllable Fingerprint Generator for Training with Synthetic Data（Alon Shoshan, Nadav Bhonker, Emanuel Ben Baruch, Ori Nizan, Igor Kviatkovsky, Joshua Engelsma, Manoj Aggarwal, Gérard G. Medioni, 2023, 2024 IEEE/CVF Winter Conference on Applications of Computer Vision (WACV)）
Universal Fingerprint Generation: Controllable Diffusion Model With Multimodal Conditions（Steven A. Grosz, Anil K. Jain, 2024, IEEE Transactions on Pattern Analysis and Machine Intelligence）
DCFace: Synthetic Face Generation with Dual Condition Diffusion Model（Minchul Kim, Feng Liu, Anil Jain, Xiaoming Liu, 2023, 2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)）
UniControl: A Unified Diffusion Model for Controllable Visual Generation In the Wild（Can Qin, Shu Zhang, Ning Yu, Yihao Feng, Xinyi Yang, Yingbo Zhou, Haiquan Wang, Juan Carlos Niebles, Caiming Xiong, S. Savarese, Stefano Ermon, Yun Fu, Ran Xu, 2023, Neural Information Processing Systems）
IP-Adapter: Text Compatible Image Prompt Adapter for Text-to-Image Diffusion Models（Hu Ye, Jun Zhang, Siyi Liu, Xiao Han, Wei Yang, 2023, arXiv.org）
DreamBooth: Fine Tuning Text-to-Image Diffusion Models for Subject-Driven Generation（Nataniel Ruiz, Yuanzhen Li, Varun Jampani, Y. Pritch, Michael Rubinstein, Kfir Aberman, 2022, 2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)）
Laytrol: Preserving Pretrained Knowledge in Layout Control for Multimodal Diffusion Transformers（Sida Huang, Siqi Huang, Ping Luo, Hongyuan Zhang, 2025, ArXiv）
Mastering Text-to-Image Diffusion: Recaptioning, Planning, and Generating with Multimodal LLMs（Ling Yang, Zhaochen Yu, Chenlin Meng, Minkai Xu, Stefano Ermon, Bin Cui, 2024, International Conference on Machine Learning）
Data-Driven Fingerprint Reconstruction from Minutiae Based on Real and Synthetic Training Data（A. Makrushin, V. Mannam, J. Dittmann, 2023, Proceedings of the 18th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications）

多模态对齐、引导机制与大规模模型

研究如何通过分类器引导或无分类器引导增强图像质量，以及多模态（文本/图像）信息的深度对齐与推理机制。

Diffusion Models Beat GANs on Image Synthesis（Prafulla Dhariwal, Alex Nichol, 2021, Neural Information Processing Systems）
Classifier-Free Diffusion Guidance（Jonathan Ho, 2022, arXiv.org）
GLIDE: Towards Photorealistic Image Generation and Editing with Text-Guided Diffusion Models（Alex Nichol, Prafulla Dhariwal, A. Ramesh, Pranav Shyam, Pamela Mishkin, Bob McGrew, I. Sutskever, Mark Chen, 2021, International Conference on Machine Learning）
Photorealistic Text-to-Image Diffusion Models with Deep Language Understanding（Chitwan Saharia, William Chan, Saurabh Saxena, Lala Li, Jay Whang, Emily L. Denton, Seyed Kamyar Seyed Ghasemipour, Burcu Karagol Ayan, S. S. Mahdavi, Raphael Gontijo Lopes, Tim Salimans, Jonathan Ho, David J. Fleet, Mohammad Norouzi, 2022, Neural Information Processing Systems）
SDXL: Improving Latent Diffusion Models for High-Resolution Image Synthesis（Dustin Podell, Zion English, Kyle Lacey, A. Blattmann, Tim Dockhorn, Jonas Muller, Joe Penna, Robin Rombach, 2023, International Conference on Learning Representations）
I Think, Therefore I Diffuse: Enabling Multimodal In-Context Reasoning in Diffusion Models（Zhenxing Mi, K. Wang, G. Qian, Hanrong Ye, Runtao Liu, Sergey Tulyakov, Kfir Aberman, Dan Xu, 2025, International Conference on Machine Learning）
MMaDA: Multimodal Large Diffusion Language Models（Ling Yang, Ye Tian, Bowen Li, Xinchen Zhang, Ke Shen, Yunhai Tong, Mengdi Wang, 2025, arXiv.org）
Rethinking Cross-Modal Interaction in Multimodal Diffusion Transformers（Zhengyao Lv, Tianlin Pan, Chenyang Si, Zhaoxi Chen, Wangmeng Zuo, Ziwei Liu, Kwan-Yee K. Wong, 2025, arXiv.org）
LAMIC: Layout-Aware Multi-Image Composition via Scalability of Multimodal Diffusion Transformer（Yuzhuo Chen, Zehua Ma, Jianhua Wang, Kai Kang, Shunyu Yao, Weiming Zhang, 2025, ArXiv）
Diffuse Everything: Multimodal Diffusion Models on Arbitrary State Spaces（Kevin Rojas, Yuchen Zhu, Sichen Zhu, Felix X.-F. Ye, Molei Tao, 2025, International Conference on Machine Learning）
Towards Understanding Cross and Self-Attention in Stable Diffusion for Text-Guided Image Editing（Bingyan Liu, Chengyu Wang, Tingfeng Cao, Kui Jia, Jun Huang, 2024, 2024 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)）

扩散模型推理加速与蒸馏技术

针对扩散模型推理步骤多、速度慢的问题，提出高效的采样和蒸馏方案，以实现快速生成。

SDXL-Lightning: Progressive Adversarial Diffusion Distillation（Shanchuan Lin, Anran Wang, Xiao Yang, 2024, arXiv.org）
Flash Diffusion: Accelerating Any Conditional Diffusion Model for Few Steps Image Generation（Clément Chadebec, O. Tasar, Eyal Benaroche, Benjamin Aubin, 2024, AAAI Conference on Artificial Intelligence）

扩散模型用于指纹生成

该组论文展示了扩散模型从基础生成理论到指纹识别等特定生物识别领域的完整发展路径。研究核心集中在如何通过改进架构（如从U-Net转向Transformer）和控制机制（如ControlNet、ID-Loss）来生成既具有高度真实感又能保持身份一致性的合成指纹。此外，为了克服数据稀缺与隐私限制，研究者利用这些模型进行大规模数据增强，并致力于解决生成过程中的模态对齐效率和推理加速挑战。

共 39 篇文献，5 个研究方向

扩散模型基础理论与核心架构

这些文献奠定了扩散模型的基础，涵盖了从早期DDPM到潜空间扩散（LDM）以及基于Transformer的架构演进。相关文献: Jonathan Ho et. al, 2020 等 8 篇文献

指纹与生物识别图像生成应用

专门探讨将扩散模型应用于指纹和掌纹生成的文献，旨在解决生物识别领域的数据稀缺、隐私保护和增强识别准确性等问题。相关文献: Fred M. Grabovski et. al, 2024 等 8 篇文献

身份保持与可控生成技术

聚焦于如何实现精确的条件控制（如布局、身份信息ID、姿态等），这对于生成同一指纹的不同样本（类内差异）至关重要。相关文献: Lvmin Zhang et. al, 2023 等 10 篇文献

多模态对齐、引导机制与大规模模型

研究如何通过分类器引导或无分类器引导增强图像质量，以及多模态（文本/图像）信息的深度对齐与推理机制。相关文献: Prafulla Dhariwal et. al, 2021 等 11 篇文献

扩散模型推理加速与蒸馏技术

针对扩散模型推理步骤多、速度慢的问题，提出高效的采样和蒸馏方案，以实现快速生成。相关文献: Shanchuan Lin et. al, 2024 等 2 篇文献

总计39篇相关文献

Universal Fingerprint Generation: Controllable Diffusion Model With Multimodal Conditions

通用指纹生成：具有多模态条件的可控扩散模型

Steven A. Grosz, Anil K. Jain, 2024-IEEE Transactions on Pattern Analysis and Machine Intelligence

The utilization of synthetic data for fingerprint recognition has garnered increased attention due to its potential to alleviate privacy concerns surrounding sensitive biometric data. However, current methods for generating fingerprints have limitations in creating impressions of the same finger with useful intra-class variations. To tackle this challenge, we present GenPrint, a framework to produce fingerprint images of various types while maintaining identity and offering humanly understandable control over different appearance factors, such as fingerprint class, acquisition type, sensor device, and quality level. Unlike previous fingerprint generation approaches, GenPrint is not confined to replicating style characteristics from the training dataset alone: it enables the generation of novel styles from unseen devices without requiring additional fine-tuning. To accomplish these objectives, we developed GenPrint using latent diffusion models with multimodal conditions (text and image) for consistent generation of style and identity. Our experiments leverage a variety of publicly available datasets for training and evaluation. Results demonstrate the benefits of GenPrint in terms of identity preservation, explainable control, and universality of generated images. Importantly, the GenPrint-generated images yield comparable or even superior accuracy to models trained solely on real data and further enhances performance when augmenting the diversity of existing real fingerprint datasets.