hig.sePublications
Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • harvard-cite-them-right
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • sv-SE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • de-DE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
TransFusion: Transfer learning-driven adaptive fusion network for infrared and visible image
School of Automation Engineering, Shanghai University of Electric Power, Shanghai 200090, China.
School of Automation Engineering, Shanghai University of Electric Power, Shanghai 200090, China.
School of Automation Engineering, Shanghai University of Electric Power, Shanghai 200090, China.ORCID iD: 0000-0003-4143-828X
College of Computer Science, Laboratory of Aerial Information Probe and Intelligent Perception, Sichuan University, Chengdu 610065, China.
Show others and affiliations
2025 (English)In: Infrared physics & technology, ISSN 1350-4495, E-ISSN 1879-0275, Vol. 150, article id 105906Article in journal (Refereed) Published
Abstract [en]

The image fusion algorithm based on deep learning possesses strong feature extraction capabilities and generalization. However, due to the uninterpretability of features in deep learning, the design of fusion strategies becomes quite challenging. To address this issue, we propose a two-stage training feature adaptive fusion network based on the VGG-19 network. We introduce a parallel cross-modal channel perception module to achieve more targeted feature fusion by capturing channel differences between different modal domains. At the same time, in order to enhance the preservation of salient features, we designed a dynamic multi-level spatial attention guidance module that utilizes the saliency information of deep features from the source image to guide the adaptive fusion of shallow features. Additionally, we propose a double inner-loop feature mutual information loss that enforces the correlation of modal information, promoting efficient convergence of the perception module and guidance module. This method not only preserves the unique features of each modal domain but also effectively integrates information across modal domains, improving the quality of image fusion. Finally, we also perform objective and subjective experiments on MSRS and TNO datasets, and analyze the method. Experiments show that the proposed method achieves superior performance in image fusion tasks, and its potential value in practical applications is verified. The source code will be publicly available at https://github.com/YQ-097/TransFusion

Place, publisher, year, edition, pages
Elsevier , 2025. Vol. 150, article id 105906
Keywords [en]
Image fusion, Transfer learning, VGG-19, Feature fusion, Feature perception
National Category
Computer Sciences
Identifiers
URN: urn:nbn:se:hig:diva-47088DOI: 10.1016/j.infrared.2025.105906ISI: 001502163000004Scopus ID: 2-s2.0-105006829801OAI: oai:DiVA.org:hig-47088DiVA, id: diva2:1965471
Available from: 2025-06-09 Created: 2025-06-09 Last updated: 2025-10-02Bibliographically approved

Open Access in DiVA

No full text in DiVA

Other links

Publisher's full textScopusLink to TNO Image Fusion Dataset

Authority records

Bavirisetti, Durga Prasad

Search in DiVA

By author/editor
Liu, GangBavirisetti, Durga Prasad
By organisation
Computer Science
In the same journal
Infrared physics & technology
Computer Sciences

Search outside of DiVA

GoogleGoogle Scholar

doi
urn-nbn

Altmetric score

doi
urn-nbn
Total: 51 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • harvard-cite-them-right
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • sv-SE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • de-DE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf