Organised crime uses information technology systems to communicate, work or expand its influence. The EU FP7 Security Research Project CAPER (Collaborative information, Acquisition, Processing, Exploitation and Reporting for the prevention of organised crime), created in cooperation with European Law Enforcement Agencies (LEAs), aims to build a common collaborative and information sharing platform for the detection and prevention of organised crime, which exploits Open Source Intelligence (OSINT). LEAs are becoming more inclined to using OSINT tools, and particularly tools able to manage Online Social Networks (OSNs) data. This paper presents the CAPER Facebook crawling and analysis subsystem. Heuristic algorithms have been implemented in order to extract specific properties of Facebook's social graph, in particular user interactions. To support analysis tasks specifically, extensive effort has been spent on the analysis of textual user generated content and on the recognition of named-entities, in particular person names, locations and organisations. Relationships between users and entities mentioned in posts and in related comments are created and merged into the users networks extracted from the social graph. All entity relationships are finally visualised in user-friendly network graphs.
CAPER: Crawling and analysing Facebook for intelligence purposes
La Polla MN;Marchetti A;Tesconi M
2014
Abstract
Organised crime uses information technology systems to communicate, work or expand its influence. The EU FP7 Security Research Project CAPER (Collaborative information, Acquisition, Processing, Exploitation and Reporting for the prevention of organised crime), created in cooperation with European Law Enforcement Agencies (LEAs), aims to build a common collaborative and information sharing platform for the detection and prevention of organised crime, which exploits Open Source Intelligence (OSINT). LEAs are becoming more inclined to using OSINT tools, and particularly tools able to manage Online Social Networks (OSNs) data. This paper presents the CAPER Facebook crawling and analysis subsystem. Heuristic algorithms have been implemented in order to extract specific properties of Facebook's social graph, in particular user interactions. To support analysis tasks specifically, extensive effort has been spent on the analysis of textual user generated content and on the recognition of named-entities, in particular person names, locations and organisations. Relationships between users and entities mentioned in posts and in related comments are created and merged into the users networks extracted from the social graph. All entity relationships are finally visualised in user-friendly network graphs.I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.