hig.sePublikasjoner
Endre søk
RefereraExporteraLink to record
Permanent link

Direct link
Referera
Referensformat
  • apa
  • harvard-cite-them-right
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Annet format
Fler format
Språk
  • sv-SE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • de-DE
  • Annet språk
Fler språk
Utmatningsformat
  • html
  • text
  • asciidoc
  • rtf
TransFusion: Transfer learning-driven adaptive fusion network for infrared and visible image
School of Automation Engineering, Shanghai University of Electric Power, Shanghai 200090, China.
School of Automation Engineering, Shanghai University of Electric Power, Shanghai 200090, China.
School of Automation Engineering, Shanghai University of Electric Power, Shanghai 200090, China.ORCID-id: 0000-0003-4143-828X
College of Computer Science, Laboratory of Aerial Information Probe and Intelligent Perception, Sichuan University, Chengdu 610065, China.
Vise andre og tillknytning
2025 (engelsk)Inngår i: Infrared physics & technology, ISSN 1350-4495, E-ISSN 1879-0275, Vol. 150, artikkel-id 105906Artikkel i tidsskrift (Fagfellevurdert) Published
Abstract [en]

The image fusion algorithm based on deep learning possesses strong feature extraction capabilities and generalization. However, due to the uninterpretability of features in deep learning, the design of fusion strategies becomes quite challenging. To address this issue, we propose a two-stage training feature adaptive fusion network based on the VGG-19 network. We introduce a parallel cross-modal channel perception module to achieve more targeted feature fusion by capturing channel differences between different modal domains. At the same time, in order to enhance the preservation of salient features, we designed a dynamic multi-level spatial attention guidance module that utilizes the saliency information of deep features from the source image to guide the adaptive fusion of shallow features. Additionally, we propose a double inner-loop feature mutual information loss that enforces the correlation of modal information, promoting efficient convergence of the perception module and guidance module. This method not only preserves the unique features of each modal domain but also effectively integrates information across modal domains, improving the quality of image fusion. Finally, we also perform objective and subjective experiments on MSRS and TNO datasets, and analyze the method. Experiments show that the proposed method achieves superior performance in image fusion tasks, and its potential value in practical applications is verified. The source code will be publicly available at https://github.com/YQ-097/TransFusion

sted, utgiver, år, opplag, sider
Elsevier , 2025. Vol. 150, artikkel-id 105906
Emneord [en]
Image fusion, Transfer learning, VGG-19, Feature fusion, Feature perception
HSV kategori
Identifikatorer
URN: urn:nbn:se:hig:diva-47088DOI: 10.1016/j.infrared.2025.105906ISI: 001502163000004Scopus ID: 2-s2.0-105006829801OAI: oai:DiVA.org:hig-47088DiVA, id: diva2:1965471
Tilgjengelig fra: 2025-06-09 Laget: 2025-06-09 Sist oppdatert: 2025-10-02bibliografisk kontrollert

Open Access i DiVA

Fulltekst mangler i DiVA

Andre lenker

Forlagets fulltekstScopusLink to TNO Image Fusion Dataset

Person

Bavirisetti, Durga Prasad

Søk i DiVA

Av forfatter/redaktør
Liu, GangBavirisetti, Durga Prasad
Av organisasjonen
I samme tidsskrift
Infrared physics & technology

Søk utenfor DiVA

GoogleGoogle Scholar

doi
urn-nbn

Altmetric

doi
urn-nbn
Totalt: 113 treff
RefereraExporteraLink to record
Permanent link

Direct link
Referera
Referensformat
  • apa
  • harvard-cite-them-right
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Annet format
Fler format
Språk
  • sv-SE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • de-DE
  • Annet språk
Fler språk
Utmatningsformat
  • html
  • text
  • asciidoc
  • rtf