A Multilevel Multimodal Fusion Transformer for Remote Sensing Semantic Segmentation (2024), Ma Xianping | AcademicGPT, tlooto

A multilevel multimodal fusion scheme called FTransUNet is proposed to provide a robust and effective multimodal fusion backbone for semantic segmentation by integrating both CNN and Vit into one unif (2024), IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, Ma Xianping | AcademicGPT, tlooto for Academic and Research