U-Net 显著性检测

基于架构改进的U-Net显著性检测方法

聚焦于U-Net骨干网的直接优化，通过集成注意力机制、多尺度模块及创新性架构组件来提升显著性特征提取与语义理解能力。

Salient Region Detection in Images Based on U-Net and Deep Learning（K. Kumar, M. Marimuthu, Ayan Das Gupta, Bhasker Pant, Surendra Kumar Shukla, Dhiraj Kapila, 2022, 2022 Second International Conference on Advanced Technologies in Intelligent Control, Environment, Computing & Communication Engineering (ICATIECE)）
Att-U2Net: Using Attention to Enhance Semantic Representation for Salient Object Detection（Chenzhe Jiang, Banglian Xu, Qinghe Zheng, Zhengtao Li, Leihong Zhang, Zimin Shen, Quan Sun, Dawei Zhang, 2024, IET Signal Processing）
Salient object detection via multi-scale attention CNN（Yuzhu Ji, Haijun Zhang, Q. M. J. Wu, 2018, Neurocomputing）
SA-UNet: A Saliency-Aware Segmentation Network for Waterlogging Disaster Identification in Urban Rail Transit Systems（Jiaying Fan, Xinbo Jiang, Changyuan Chen, Yang Li, Shilun Ma, Jiakai Tian, Rui Guo, 2025, 2025 IEEE 6th International Conference on Computer, Big Data, Artificial Intelligence (ICCBD+AI)）
CMAD-UNet: UNet-Driven RGB-D Salient Object Detection with Cross-Modal Consistency and Aggregative Decoding（Qi Xu, Zhaozhao Su, Zhaoru Guo, Yongming Li, Liejun Wang, Panpan Zheng, 2025, Proceedings of the 2025 International Conference on Multimedia Retrieval）
Multiscale Cascaded Attention Network for Saliency Detection Based on ResNet（Muwei Jian, Haodong Jin, Xiangyu Liu, Linsong Zhang, 2022, Sensors）
Hierarchical U-Shape Attention Network for Salient Object Detection（Sanping Zhou, Jinjun Wang, Jimuyang Zhang, Le Wang, Dong Huang, S. Du, N. Zheng, 2020, IEEE Transactions on Image Processing）
An Improved UNet Algorithm Based on Multiscale Features and Attention Modules for Underwater Salient Multi-Target Detection（Haibin Han, Xinyi Zhou, Haodi Zhu, Yueyi Qiao, Shaojian Yang, Yan Wei, Fengzhong Qu, 2025, OCEANS 2025 Brest）

边缘感知与结构引导的精细化建模

专门解决显著性检测中边界模糊问题，通过引入边缘预测约束、反向注意力机制及结构化特征分解，实现高质量的边缘保持与完整物体分割。

BASNet: Boundary-Aware Salient Object Detection（Xuebin Qin, Zichen Zhang, Chenyang Huang, Chao Gao, Masood Dehghan, Martin Jägersand, 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)）
Revise-Net: Exploiting Reverse Attention Mechanism for Salient Object Detection（Rukhshanda Hussain, Yash Karbhari, Muhammad Fazal Ijaz, M. Woźniak, P. Singh, R. Sarkar, 2021, Remote Sensing）
Multi-scale feature aggregation and boundary awareness network for salient object detection（Qin Wu, Jianzhe Wang, Zhilei Chai, Guodong Guo, 2022, Image and Vision Computing）
Disentangled High Quality Salient Object Detection（Lv Tang, Bo Li, Shouhong Ding, Mofei Song, 2021, 2021 IEEE/CVF International Conference on Computer Vision (ICCV)）
Decomposition and Completion Network for Salient Object Detection（Zhe Wu, Li Su, Qingming Huang, 2021, IEEE Transactions on Image Processing）
SODU2-NET: a novel deep learning-based approach for salient object detection utilizing U-NET（Hyder Abbas, Sheng Ren, M. Asim, Syeda Iqra Hassan, A. El-latif, 2025, PeerJ Computer Science）
EGNet: Edge Guidance Network for Salient Object Detection（Jiaxing Zhao, Jiangjiang Liu, Deng-Ping Fan, Yang Cao, Jufeng Yang, Ming-Ming Cheng, 2019, 2019 IEEE/CVF International Conference on Computer Vision (ICCV)）
Convolutional Edge Constraint-Based U-Net for Salient Object Detection（L. Han, Xuelong Li, Yongsheng Dong, 2019, IEEE Access）
Stacked Cross Refinement Network for Edge-Aware Salient Object Detection（Zhe Wu, Li Su, Qingming Huang, 2019, 2019 IEEE/CVF International Conference on Computer Vision (ICCV)）

RGB-D与多模态显著性融合技术

利用深度、热成像或语义信息辅助RGB图像，通过跨模态交互与互补特征增强，提升复杂场景下对目标的识别精度。

Pushing the Boundaries of Salient Object Detection: A Denoising-Driven Approach（Mengke Song, Luming Li, Xu Yu, Chenglizhao Chen, 2025, IEEE Transactions on Image Processing）
TriTransNet: RGB-D Salient Object Detection with a Triplet Transformer Embedding Network（Zhengyi Liu, Yuan Wang, Zhengzheng Tu, Yun Xiao, Bin Tang, 2021, Proceedings of the 29th ACM International Conference on Multimedia）
Hybrid-Attention Network for RGB-D Salient Object Detection（Yuzhen Chen, Wujie Zhou, 2020, Applied Sciences）
RGB-D Salient Object Detection via 3D Convolutional Neural Networks（Qian Chen, Ze Liu, Y. Zhang, Keren Fu, Qijun Zhao, H. Du, 2021, Proceedings of the AAAI Conference on Artificial Intelligence）
Real-Time One-Stream Semantic-Guided Refinement Network for RGB-Thermal Salient Object Detection（Fushuo Huo, Xuegui Zhu, Q. Zhang, Ziming Liu, Wenchao Yu, 2022, IEEE Transactions on Instrumentation and Measurement）
3-D Convolutional Neural Networks for RGB-D Salient Object Detection and Beyond（Qian Chen, Zhenxi Zhang, Yanye Lu, Keren Fu, Qijun Zhao, 2022, IEEE Transactions on Neural Networks and Learning Systems）
Deep RGB-D Saliency Detection Without Depth（Yuan-fang Zhang, Jiangbin Zheng, W. Jia, Wenfeng Huang, Long Li, Nian Liu, Fei Li, Xiangjian He, 2021, IEEE Transactions on Multimedia）
Select, Supplement and Focus for RGB-D Saliency Detection（Miao Zhang, Weisong Ren, Yongri Piao, Zhengkun Rong, Huchuan Lu, 2020, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)）
UMINet: a unified multi-modality interaction network for RGB-D and RGB-T salient object detection（Lina Gao, P. Fu, Mingzhu Xu, Tiantian Wang, Bing Liu, 2023, The Visual Computer）
Calibrated RGB-D Salient Object Detection（Wei Ji, Jingjing Li, Shuang Yu, Miao Zhang, Yongri Piao, S. Yao, Qi Bi, Kai Ma, Yefeng Zheng, Huchuan Lu, Li Cheng, 2021, 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)）
Specificity-preserving RGB-D saliency detection（Tao Zhou, Huazhu Fu, Geng Chen, Yi Zhou, Deng-Ping Fan, Ling Shao, 2021, 2021 IEEE/CVF International Conference on Computer Vision (ICCV)）
CATNet: A Cascaded and Aggregated Transformer Network for RGB-D Salient Object Detection（Fuming Sun, Pengfei Ren, Bo Yin, Fasheng Wang, Haojie Li, 2024, IEEE Transactions on Multimedia）
Advancing in RGB-D Salient Object Detection: A Survey（Ai Chen, Xin Li, Tianxiang He, Junlin Zhou, Du Chen, 2024, Applied Sciences）
Delving into Calibrated Depth for Accurate RGB-D Salient Object Detection（Jingjing Li, Wei Ji, Miao Zhang, Yongri Piao, Huchuan Lu, Li Cheng, 2022, International Journal of Computer Vision）

轻量化网络设计与实时显著性推断

关注网络推断效率，通过参数精简、高效池化模块与轻量级骨干网设计，实现在边缘设备上的实时显著性预测。

LARNet: Towards Lightweight, Accurate and Real-Time Salient Object Detection（Zhenyu Wang, Yunzhou Zhang, Yan Liu, Cao Qin, Sonya A. Coleman, D. Kerr, 2024, IEEE Transactions on Multimedia）
A Simple Pooling-Based Design for Real-Time Salient Object Detection（Jiangjiang Liu, Qibin Hou, Ming-Ming Cheng, Jiashi Feng, Jianmin Jiang, 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)）
FasterSal: Robust and Real-Time Single-Stream Architecture for RGB-D Salient Object Detection（Jin Zhang, Ruiheng Zhang, Lixin Xu, Xiankai Lu, Yushu Yu, Min Xu, He Zhao, 2025, IEEE Transactions on Multimedia）
ELWNet: An Extremely Lightweight Approach for Real-Time Salient Object Detection（Zhenyu Wang, Yunzhou Zhang, Yan Liu, Delong Zhu, Sonya A. Coleman, D. Kerr, 2023, IEEE Transactions on Circuits and Systems for Video Technology）
MEANet: Multi-modal edge-aware network for light field salient object detection（Yao Jiang, Wenbo Zhang, Keren Fu, Qijun Zhao, 2022, Neurocomputing）
Meanet: An Effective and Lightweight Solution for Salient Object Detection in Optical Remote Sensing Images（Bocheng Liang, Huilan Luo, 2023, Expert Systems with Applications）

基于Transformer与序列建模的全局感知

利用Transformer的长距离依赖建模能力替代或增强传统卷积层，从全局维度提升显著性对象的感知与背景区分度。

UNETRSal: Saliency Prediction with Hybrid Transformer-Based Architecture（Azamat Kaibaldiyev, Jérémie Pantin, Alexis Lechervy, Fabrice Maurel, Youssef Chahir, Gael Dias, 2025, Lecture Notes in Computer Science）
TranSalNet: Towards perceptually relevant visual saliency prediction（Jianxun Lou, Hanhe Lin, David Marshall, D. Saupe, Hantao Liu, 2021, Neurocomputing）
Visual Saliency Transformer（Nian Liu, Ni Zhang, Kaiyuan Wan, Junwei Han, Ling Shao, 2021, 2021 IEEE/CVF International Conference on Computer Vision (ICCV)）

视频显著性与时空特征联合建模

针对视频序列的动态特性，通过时空关联建模捕捉视频中的运动显著性，并优化视频处理的计算效率。

Salient Object Detection by Spatiotemporal and Semantic Features in Real-Time Video Processing Systems（Yuming Fang, Guanqun Ding, Wenying Wen, Feiniu Yuan, Yong Yang, Zhijun Fang, Weisi Lin, 2020, IEEE Transactions on Industrial Electronics）
Exploring Rich and Efficient Spatial Temporal Interactions for Real-Time Video Salient Object Detection（Chenglizhao Chen, Guotao Wang, Chong Peng, Yuming Fang, Dingwen Zhang, Hong Qin, 2020, IEEE Transactions on Image Processing）
Transformer-Based Multi-Scale Feature Integration Network for Video Saliency Prediction（Xiaofei Zhou, Songhe Wu, Ran Shi, Bolun Zheng, Shuai Wang, Haibing Yin, Jiyong Zhang, C. Yan, 2023, IEEE Transactions on Circuits and Systems for Video Technology）
Real-time Surveillance Video Salient Object Detection Using Collaborative Cloud-Edge Deep Reinforcement Learning（Biao Hou, Junxing Zhang, 2021, 2021 International Joint Conference on Neural Networks (IJCNN)）
Improved salient object detection using hybrid Convolution Recurrent Neural Network（N. V. Kousik, Yuvaraj Natarajan, R. Raja, Suresh Kallam, Rizwan Patan, Amir H. Gandomi, 2021, Expert Systems with Applications）
Multi-Scale Spatiotemporal Feature Fusion Network for Video Saliency Prediction（Yunzuo Zhang, Tian Zhang, Cunyu Wu, Ran Tao, 2024, IEEE Transactions on Multimedia）

通用显著性建模与多尺度特征聚合

涵盖多尺度上下文信息提取、抗干扰机制设计及领域基础研究，为显著性检测提供广泛的方法论支持。

Supervision by Fusion: Towards Unsupervised Learning of Deep Salient Object Detector（Dingwen Zhang, Junwei Han, Yu Zhang, 2017, 2017 IEEE International Conference on Computer Vision (ICCV)）
Deep Salient Object Detection With Dense Connections and Distraction Diagnosis（Huaxin Xiao, Jiashi Feng, Yunchao Wei, Maojun Zhang, Shuicheng Yan, 2018, IEEE Transactions on Multimedia）
Visual saliency based on multiscale deep features（Guanbin Li, Yizhou Yu, 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)）
Superpixel attention guided network for accurate and real-time salient object detection（Zhiheng Zhou, Yongfan Guo, Junchu Huang, Ming Dai, Ming Deng, Qingjun Yu, 2022, Multimedia Tools and Applications）
UDNet: Uncertainty-aware deep network for salient object detection（Yuming Fang, Haiyan Zhang, Jiebin Yan, Wenhui Jiang, Yang Liu, 2022, Pattern Recognition）
The Prediction of Saliency Map for Head and Eye Movements in 360 Degree Images（Yucheng Zhu, Guangtao Zhai, Xiongkuo Min, Jiantao Zhou, 2020, IEEE Transactions on Multimedia）
Salient Object Detection with Dynamic Convolutions（Rohit Venkata Sai Dulam, Chandra Kambhamettu, 2025, 2025 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW)）
DHSNet: Deep Hierarchical Saliency Network for Salient Object Detection（Nian Liu, Junwei Han, 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)）
Saliency Unified: A Deep Architecture for simultaneous Eye Fixation Prediction and Salient Object Segmentation（S. Kruthiventi, Vennela Gudisa, Jaley H. Dholakiya, R. Venkatesh Babu, 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)）
Multi-Scale Interactive Network for Salient Object Detection（Youwei Pang, Xiaoqi Zhao, Lihe Zhang, Huchuan Lu, 2020, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)）
Deep Salient Object Detection With Contextual Information Guidance（Yi Liu, Jungong Han, Qiang Zhang, Caifeng Shan, 2020, IEEE Transactions on Image Processing）
Multi-Scale Cascade Network for Salient Object Detection（X. Li, F. Yang, Hong Cheng, Junyu Chen, Yuxiao Guo, Leiting Chen, 2017, Proceedings of the 25th ACM international conference on Multimedia）
Salient Object Detection Techniques in Computer Vision—A Survey（A. Gupta, Ayan Seal, M. Prasad, P. Khanna, 2020, Entropy）
Shallow and Deep Convolutional Networks for Saliency Prediction（Junting Pan, E. Sayrol, Xavier Giro-i-Nieto, Kevin McGuinness, N. O’Connor, 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)）
Visual saliency detection via combining center prior and U-Net（Xiangwei Lu, Muwei Jian, Xing Wang, Hui Yu, Junyu Dong, K. Lam, 2022, Multimedia Systems）

U-Net 显著性检测

本次调研梳理了U-Net及其变体在显著性检测领域的广泛应用，研究趋势从基础架构演进扩展至多模态融合、高精度边缘保持、Transformer全局建模、实时轻量化以及视频时空动态分析。这些研究路径相互补充，共同推动显著性检测向鲁棒性更强、细节还原度更高及实时性能更佳的工业级应用方向发展。

共 61 篇文献，7 个研究方向

基于架构改进的U-Net显著性检测方法

聚焦于U-Net骨干网的直接优化，通过集成注意力机制、多尺度模块及创新性架构组件来提升显著性特征提取与语义理解能力。相关文献: K. Kumar et. al, 2022 等 8 篇文献

边缘感知与结构引导的精细化建模

专门解决显著性检测中边界模糊问题，通过引入边缘预测约束、反向注意力机制及结构化特征分解，实现高质量的边缘保持与完整物体分割。相关文献: Xuebin Qin et. al, 2019 等 9 篇文献

RGB-D与多模态显著性融合技术

利用深度、热成像或语义信息辅助RGB图像，通过跨模态交互与互补特征增强，提升复杂场景下对目标的识别精度。相关文献: Mengke Song et. al, 2025 等 14 篇文献

轻量化网络设计与实时显著性推断

关注网络推断效率，通过参数精简、高效池化模块与轻量级骨干网设计，实现在边缘设备上的实时显著性预测。相关文献: Zhenyu Wang et. al, 2024 等 6 篇文献

基于Transformer与序列建模的全局感知

利用Transformer的长距离依赖建模能力替代或增强传统卷积层，从全局维度提升显著性对象的感知与背景区分度。相关文献: Azamat Kaibaldiyev et. al, 2025 等 3 篇文献

视频显著性与时空特征联合建模

针对视频序列的动态特性，通过时空关联建模捕捉视频中的运动显著性，并优化视频处理的计算效率。相关文献: Yuming Fang et. al, 2020 等 6 篇文献

通用显著性建模与多尺度特征聚合

涵盖多尺度上下文信息提取、抗干扰机制设计及领域基础研究，为显著性检测提供广泛的方法论支持。相关文献: Dingwen Zhang et. al, 2017 等 15 篇文献

总计61篇相关文献

Convolutional Edge Constraint-Based U-Net for Salient Object Detection

基于卷积边缘约束的U-Net在显著目标检测中的应用

doi.org-L. Han, Xuelong Li, Yongsheng Dong, 2019-IEEE Access4区IF 3.6

The salient object detection is receiving more and more attention from researchers. An accurate saliency map will be useful for subsequent tasks. However, in most saliency maps predicted by existing models, the objects regions are very blurred and the edges of objects are irregular. The reason is that the hand-crafted features are the main basis for existing traditional methods to predict salient objects, which results in different pixels belonging to the same object often being predicted different saliency scores. Besides, the convolutional neural network (CNN)-based models predict saliency maps at patch scale, which causes the objects edges of the output to be fuzzy. In this paper, we attempt to add an edge convolution constraint to a modified U-Net to predict the saliency map of the image. The network structure we adopt can fuse the features of different layers to reduce the loss of information. Our SalNet predicts the saliency map pixel-by-pixel, rather than at the patch scale as the CNN-based models do. Moreover, in order to better guide the network mining the information of objects edges, we design a new loss function based on image convolution, which adds an L1 constraint to the edge information of saliency map and ground-truth. Finally, experimental results reveal that our SalNet is effective in salient object detection task and is also competitive when compared with 11 state-of-the-art models.