Modern social networks facilitate the rapid global dissemination of news. However, much of this content is often unverified or shared based on personal opinions and beliefs, leading to widespread misinformation, erosion of public trust, and potential social and political instability. In this evolving landscape, the early detection of fake news has become a pressing challenge. To address this issue, multi-modal approaches have gained increasing attention due to their ability to integrate diverse sources of information, including textual content, images, and network structures, to capture richer contextual cues and deeper semantic relationships, improving detection performance. Despite the growing interest in this field, there remains no clear consensus on the most effective fusion strategy for integrating these heterogeneous data sources. In this work, we introduce M3DUSA , a modular multi-modal framework designed to effectively detect malicious content by leveraging the complementary strengths of multiple modalities. By systematically exploring the interplay between different fusion techniques, our framework adapts to various fake news detection scenarios, offering flexibility and robustness in identifying deceptive content across diverse contexts. To evaluate the effectiveness of M3DUSA, we conducted extensive experiments on two real-world datasets. The results demonstrate that our framework outperforms existing methods, highlighting its superior ability to exploit multi-modal information for improved fake news detection. Our findings confirm that while both early and late fusion strategies can effectively capture complementary signals across modalities, their advantages vary depending on the detection objectives and dataset characteristics.

M3DUSA: A Modular Multi-Modal Deep fUSion Architecture for fake news detection on social media

Martirano Liliana
Primo
;
Comito Carmela;Guarascio Massimo;Pisani Francesco Sergio
;
Zicari Paolo
2025

Abstract

Modern social networks facilitate the rapid global dissemination of news. However, much of this content is often unverified or shared based on personal opinions and beliefs, leading to widespread misinformation, erosion of public trust, and potential social and political instability. In this evolving landscape, the early detection of fake news has become a pressing challenge. To address this issue, multi-modal approaches have gained increasing attention due to their ability to integrate diverse sources of information, including textual content, images, and network structures, to capture richer contextual cues and deeper semantic relationships, improving detection performance. Despite the growing interest in this field, there remains no clear consensus on the most effective fusion strategy for integrating these heterogeneous data sources. In this work, we introduce M3DUSA , a modular multi-modal framework designed to effectively detect malicious content by leveraging the complementary strengths of multiple modalities. By systematically exploring the interplay between different fusion techniques, our framework adapts to various fake news detection scenarios, offering flexibility and robustness in identifying deceptive content across diverse contexts. To evaluate the effectiveness of M3DUSA, we conducted extensive experiments on two real-world datasets. The results demonstrate that our framework outperforms existing methods, highlighting its superior ability to exploit multi-modal information for improved fake news detection. Our findings confirm that while both early and late fusion strategies can effectively capture complementary signals across modalities, their advantages vary depending on the detection objectives and dataset characteristics.
2025
Istituto di Calcolo e Reti ad Alte Prestazioni - ICAR
Multi-Modal Fake News Detection, Deep fusion methods, Social Networks, Heterogeneous Information Networks
File in questo prodotto:
File Dimensione Formato  
M3DUSA.pdf

accesso aperto

Tipologia: Versione Editoriale (PDF)
Licenza: Creative commons
Dimensione 3.39 MB
Formato Adobe PDF
3.39 MB Adobe PDF Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.14243/559903
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 2
  • ???jsp.display-item.citation.isi??? 0
social impact