TY - GEN
T1 - Automated Statistics Extraction of Public Security Events Reported Through Microtexts on Social Networks
AU - Ferreira, Flávio
AU - Duarte, Julio
AU - Ugulino, Wallace
N1 - Funding Information:
This work was partially supported by national funds through FINEP, Financiadora de Estudos e Projetos and FAPEB, Fundacao de Apoio a Pesquisa, Desenvolvimento e Inovacao do Exercito Brasileiro, under project Sistema de Sistemas de Comando e Controle with reference no 2904/20 under contract no 01.20.0272.00
Funding Information:
This work was partially supported by national funds through FINEP, Financiadora de Estudos e Projetos and FAPEB, Fundação de Apoio à Pesquisa, Desenvolvimento e Inovação do Exército Brasileiro, under project “Sistema de Sistemas de Comando e Controle” with reference no 2904/20 under contract no 01.20.0272.00.
Publisher Copyright:
© 2022 ACM.
PY - 2022/5/16
Y1 - 2022/5/16
N2 - Lately, Rio de Janeiro State has been characterized by the occurrence of successive public security events (shootings, assaults, robberies, etc.), causing great insecurity, affecting the daily lives of the population, and worrying public security agencies in the fight against crime. Although the indicators of public security events recently decreased, there is still a feeling of insecurity, while the population uses social networks to notify illegal acts that occurred in their vicinity. Although this collaboration is limited to the crimes that occurred, many published messages are difficult to interpret. Knowledge Discovery is a process of extracting data in an implicit, previously unknown, and useful way that can be applied for different purposes. In this context, Natural Language Processing is a powerful tool that allows the extraction of information from these unstructured data. This work proposes a methodology for automatic knowledge extraction, in the form of statistics related to public security events posted on social networks, particularly the ones occurred in Rio de Janeiro. The main contribution of this work is the proposal of a methodology for the construction of an Information System that allows the collection of statistics of notified public security events. In addition to this methodology, which can also be used in the construction of other Information Systems, this work contributes with a public security event recognition model that has a performance of 95%, and an available dataset that can be used to support other researches, such as: the identification of new behavior patterns, the discovery of hidden knowledge, among other fronts.
AB - Lately, Rio de Janeiro State has been characterized by the occurrence of successive public security events (shootings, assaults, robberies, etc.), causing great insecurity, affecting the daily lives of the population, and worrying public security agencies in the fight against crime. Although the indicators of public security events recently decreased, there is still a feeling of insecurity, while the population uses social networks to notify illegal acts that occurred in their vicinity. Although this collaboration is limited to the crimes that occurred, many published messages are difficult to interpret. Knowledge Discovery is a process of extracting data in an implicit, previously unknown, and useful way that can be applied for different purposes. In this context, Natural Language Processing is a powerful tool that allows the extraction of information from these unstructured data. This work proposes a methodology for automatic knowledge extraction, in the form of statistics related to public security events posted on social networks, particularly the ones occurred in Rio de Janeiro. The main contribution of this work is the proposal of a methodology for the construction of an Information System that allows the collection of statistics of notified public security events. In addition to this methodology, which can also be used in the construction of other Information Systems, this work contributes with a public security event recognition model that has a performance of 95%, and an available dataset that can be used to support other researches, such as: the identification of new behavior patterns, the discovery of hidden knowledge, among other fronts.
KW - Artificial Intelligence (AI)
KW - Data mining
KW - Machine Learning (ML)
KW - Natural Language Processing (NLP)
KW - Public security
KW - Text classification
KW - Text mining
KW - Twitter
UR - http://www.scopus.com/inward/record.url?scp=85133794737&partnerID=8YFLogxK
U2 - 10.1145/3535511.3535513
DO - 10.1145/3535511.3535513
M3 - Conference contribution
AN - SCOPUS:85133794737
SN - 978-1-4503-9698-1
T3 - ACM International Conference Proceeding Series
BT - Proceedings of the 18th Brazilian Symposium on Information Systems
A2 - Silva, Williamson
A2 - Graciano Neto, Valdemar Vicente
A2 - de Lima Fontao, Awdren
A2 - Berardi, Rita Cristina G.
A2 - Graeml, Alexandre R.
PB - Association for Computing Machinery
CY - New York, NY
T2 - 18th Brazilian Symposium on Information Systems, SBSI 2022
Y2 - 16 May 2022 through 19 May 2022
ER -