真实世界超分辨率—语义分割联合框架研究

刘晓; 王正勇; 何小海; 任超

您当前的位置：

首页 >

文章列表页 >

真实世界超分辨率—语义分割联合框架研究

更新时间：2024-05-10

- 真实世界超分辨率—语义分割联合框架研究
- A study of the joint framework for real-world super-resolution -semantic segmentation
- 新一代信息技术 2024年页码：1-6
- 作者机构：
  
  四川大学电子信息学院，四川成都610065
- 作者简介：
  
  刘晓（1998—），男，硕士研究生，主要研究方向为图像处理；
  王正勇（1969—），女，博士，副教授，主要研究方向为图像处理、智能系统设计；
  何小海（1964—），男，博士，教授，主要研究方向为图像处理与网络通信；
  任　超（1988—），男，博士，副教授，主要研究方向为图像处理、计算机视觉、人工智能、多媒体通信与信息系统等。
- 基金信息：
  
  国家自然科学基金项目(62171304);四川大学达州市校地合作项目(2022CDDZ-09)
- DOI：
  中图分类号： TN911.73
- 网络出版日期：2024-05-10，
扫描看全文
刘晓,王正勇,何小海等.真实世界超分辨率—语义分割联合框架研究[J].新一代信息技术,

LIU Xiao,Wang Zheng-yong,HE Xiao-hai,et al.A study of the joint framework for real-world super-resolution -semantic segmentation[J].New Generation of Information Technology,
刘晓,王正勇,何小海等.真实世界超分辨率—语义分割联合框架研究[J].新一代信息技术, DOI：10.3969/j.issn.2096-6091.XXXX.XX.001.

LIU Xiao,Wang Zheng-yong,HE Xiao-hai,et al.A study of the joint framework for real-world super-resolution -semantic segmentation[J].New Generation of Information Technology, DOI：10.3969/j.issn.2096-6091.XXXX.XX.001.

摘要

现有的语义分割方法在干净的图像上可以产生较好的结果，但是在干净图像上训练的分割模型应用到真实世界的图像会出现性能下降，因为训练域和测试域之间存在域间隙，从而降低分割的准确性。针对真实世界语义分割的问题，本文提出了一种超分辨率—语义分割联合框架，用于提升语义分割准确性。具体来说，所提出的框架嵌入了一个两分支网络，其中包括超分辨率分支、语义分割分支和一个特征共享模块。超分辨率任务鼓励网络找到对不同分辨率特征鲁棒的表示，从而分割头部可以使用恢复的“干净”特征进行更好的预测。其中超分辨率分支仅配置在训练过程中，在推理阶段可以丢弃。基于构建的伪真实配对数据集CityDeg进行监督训练，提出的框架联合现有先进的语义分割方法能够在不引入额外计算成本的情况下有效提高低分辨率场景语义分割性能。

Abstract

Existing semantic segmentation methods produce better results on clean images

but segmentation models trained on clean images applied to real-world images experience performance degradation because of the domain gap between the training and testing domains

which reduces the segmentation accuracy. To address the problem of real-world semantic segmentation

this paper proposes a joint super-resolution-semantic segmentation framework for improving semantic segmentation accuracy. Specifically

the proposed framework embeds a two-branch network that includes a super-resolution branch

a semantic segmentation branch

and a feature sharing module. The super-resolution task encourages the network to find a robust representation of features with different resolutions

so that the segmentation head can use the recovered "clean" features for better prediction. The super-resolution branch is configured only during training and can be discarded during the inference phase. Based on the constructed pseudo-real pairwise dataset CityDeg for supervised training

the proposed framework

together with the existing state-of-the-art semantic segmentation methods

is able to effectively improve the performance of semantic segmentation for low-resolution scenes without introducing additional computational cost.

关键词

超分辨率语义分割联合框架深度学习

Keywords

super-resolutionsemantic segmentationjoint frameworkdeep learning

references

Yu C, Gao C, Wang J, et al. Bisenet v2: Bilateral network with guided aggregation for real-time semantic segmentation[J]. International Journal of Computer Vision, 2021, 129: 3051-3068.

Fan M, Lai S, Huang J, et al. Rethinking bisenet for real-time semantic segmentation[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2021: 9716-9725.

Xu J, Xiong Z, Bhattacharyya S P. PIDNet: A real-time semantic segmentation network inspired by PID controllers[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023: 19529-19539.

Liu X, Shi X, Chen L, et al. Efficient Parallel Multi-Scale Detail and Semantic Encoding Network for Lightweight Semantic Segmentation[C]//Proceedings of the 31st ACM International Conference on Multimedia, 2023: 2544-2552.

Hu J, Chang M, Xu B, et al. ConvFormer: Vision Backbone Network Based on Transformer[J]. Acta Electronica Sinica, 2024, 52(1): 46-57.

Wei Y, Zhang Z, Zheng H, et al. Sginet: Toward sufficient interaction between single image deraining and semantic segmentation[C]//Proceedings of the 30th ACM International Conference on Multimedia, 2022: 6202-6210.

Chen W T, Chen I H, Yeh C Y, et al. Sjdl-vehicle: Semi-supervised joint defogging learning for foggy vehicle re-identification[C]//Proceedings of the AAAI Conference on Artificial Intelligence, 2022, 36(1): 347-355.

Li Y, Chang Y, Yu C, et al. Close the loop: A unified bottom-up and top-down paradigm for joint image deraining and segmentation[C]//Proceedings of the AAAI Conference on Artificial Intelligence, 2022, 36(2): 1438-1446.

Hashmi K A, Kallempudi G, Stricker D, et al. Featenhancer: Enhancing hierarchical features for object detection and beyond under low-light vision[C]//Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023: 6725-6735.

Hong Y, Wei K, Chen L, et al. Crafting Object Detection in Very Low Light[C]//Proceedings of the British Machine Vision Conference, 2021, 1(2): 3.

Wang X, Xie L, Dong C, et al. Real-esrgan: Training real-world blind super-resolution with pure synthetic data[C]//Proceedings of the IEEE/CVF international conference on computer vision, 2021: 1905-1914.

Liu X, Liao X, Shi X, et al. Efficient Information Modulation Network for Image Super Resolution[C]//Proceedings of the European Conference on Artificial Intelligence, 2023: 1544-1551.

浏览量

下载量

CSCD

文章被引用时，请邮件提醒。

提交

工具集

关联资源

基于深度学习的微观驱替图像分类

基于改进CNN的药用植物叶片分类研究

基于轻量化VGG16和注意力机制的骨龄预测研究

文本辅助图像信息的行人重识别方法