Studying migration using traditional data has some limitations. To date, there have been several studies proposing innovative methodologies to measure migration stocks and flows from social big data. Nevertheless, a uniform definition of a migrant is difficult to find as it varies from one work to another depending on the purpose of the study and nature of the dataset used. In this work, a generic methodology is developed to identify migrants within the Twitter population. This describes a migrant as a person who has the current residence different from the nationality. The residence is defined as the location where a user spends most of his/her time in a certain year. The nationality is inferred from linguistic and social connections to a migrant's country of origin. This methodology is validated first with an internal gold standard dataset and second with two official statistics, and shows strong performance scores and correlation coefficients. Our method has the advantage that it can identify both immigrants and emigrants, regardless of the origin/destination countries. The new methodology can be used to study various aspects of migration, including opinions, integration, attachment, stocks and flows, motivations for migration, etc. Here, we exemplify how trending topics across and throughout different migrant communities can be observed.

Digital footprints of international migration on twitter

Giannotti F;Gabrielli L
2020

Abstract

Studying migration using traditional data has some limitations. To date, there have been several studies proposing innovative methodologies to measure migration stocks and flows from social big data. Nevertheless, a uniform definition of a migrant is difficult to find as it varies from one work to another depending on the purpose of the study and nature of the dataset used. In this work, a generic methodology is developed to identify migrants within the Twitter population. This describes a migrant as a person who has the current residence different from the nationality. The residence is defined as the location where a user spends most of his/her time in a certain year. The nationality is inferred from linguistic and social connections to a migrant's country of origin. This methodology is validated first with an internal gold standard dataset and second with two official statistics, and shows strong performance scores and correlation coefficients. Our method has the advantage that it can identify both immigrants and emigrants, regardless of the origin/destination countries. The new methodology can be used to study various aspects of migration, including opinions, integration, attachment, stocks and flows, motivations for migration, etc. Here, we exemplify how trending topics across and throughout different migrant communities can be observed.
2020
Istituto di Scienza e Tecnologie dell'Informazione "Alessandro Faedo" - ISTI
Inglese
Berthold M., Feelders A., Krempl G.
Advances in Intelligent Data Analysis XVIII. IDA 2020
IDA 2020 - 18th International Conference on Intelligent Data Analysis
274
286
978-3-030-44583-6
https://link.springer.com/chapter/10.1007/978-3-030-44584-3_22
Sì, ma tipo non specificato
27-29 April, 2020
Konstanz, Germany
International migration
Emigration
Big data
Twitter
progetto europeo HumMingBird (grant agreement no 870661) da inserire
2
open
Kim J.; Sirbu A.; Giannotti F.; Gabrielli L.
273
info:eu-repo/semantics/conferenceObject
04 Contributo in convegno::04.01 Contributo in Atti di convegno
   SoBigData Research Infrastructure
   SoBigData
   H2020
   654024

   Enhanced migration measures from a multidimensional perspective
   HumMingBird
   H2020
   870661
File in questo prodotto:
File Dimensione Formato  
prod_424530-doc_151405.pdf

accesso aperto

Descrizione: Digital footprints of international migration on twitter
Tipologia: Versione Editoriale (PDF)
Dimensione 796.42 kB
Formato Adobe PDF
796.42 kB Adobe PDF Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.14243/405181
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 18
  • ???jsp.display-item.citation.isi??? 12
social impact