Inteligência artificial para classificaçãode fake news

MARTINS, Matheus Ferreira; SILVA, Pedro Henrique Mendes

Please use this identifier to cite or link to this item: https://ric.cps.sp.gov.br/handle/123456789/37068

Title:	Inteligência artificial para classificaçãode fake news
Other Titles:	Artificial intelligence for fake news classification
Authors:	MARTINS, Matheus Ferreira SILVA, Pedro Henrique Mendes
Advisor:	BORGES JÚNIOR, Sérgio Ricardo
Other contributor:	VOLPE, Valéria Maria ALVES, Isabela Alquaz
type of document:	Artigo Científico
Keywords:	Algoritmos;Desenvolvimento de software (avaliação);Machine learning;Fake news;Linguagem natural
Issue Date:	11-Dec-2020
Publisher:	121
Citation:	MARTINS, Matheus Ferreira; SILVA, Pedro Henrique Mendes. Inteligência artificial para classificaçãode fake news. 2020. Trabalho de Conclusão de Curso (Curso Superior de Tecnologia em Análise e Desenvolvimento de Sistemas) – Faculdade de Tecnologia de São José do Rio Preto, São José do Rio Preto, 2020.
Abstract:	As informações difundidas pela internet todos os dias nem sempre tem um conteúdo confiável. As Fake News são notícias que não representam a realidade, mas são compartilhadas como se fossem verídicas. A motivação de quem as fabrica e as divulgam são diversas. Nesta perspectiva, este trabalho possuiu como objetivo desenvolver uma aplicação para classificar notícias entre reais e fake news com a combinação de processamento de linguagem natural e aprendizagem de máquina. Para isso, utilizou-se os seguintes algoritmos de aprendizagem de máquina: Random Forest (RF), Support Vector Machine (SVM) e Naive Bayes (NB)a partir da biblioteca Scikit-learn da linguagem Python. O conjunto de dados utilizado foi o “fake and real” news obtido no Kaggle. Os resultados obtidos demonstram que RFobteve o melhor desempenho em todas as métricas utilizadas (Acurácia, Precisão, Recall e F1-Score), sendo: 99,7%, 100%, 99,7%, 99,3%, respectivamente. O SVM teve um bom desempenho, mas inferior ao RF, obtendo: 86,5%, 97,4%, 85,2% e 75,7%, respectivamente. Por fim, NB obteve o pior desempenho em todas as métricas: 60,6%, 61,6%, 61,6% e 61,7%. The information daily spread over the Internet doesn't always have a reliable content. Fake News is information that does not represent reality, but it is shared as if it was true. The motivation of those who manufacture and disseminate it is diverse. In this perspective, this work aimed to develop an application to classify news between true news and fake ones with the combination of natural language processing and machine learning. Therefore, the following machine learning algorithms were used: Random Forest (RF), Support Vector Machine (SVM) and Naive Bayes (NB) from the Scikit-learn library of the Python language. The dataset used was fake and real news obtained from Kaggle. The results obtained demonstrate that RF obtained the best performance in all the metrics used (Accuracy, Precision, Recall and F1-Score), being: 99.7%, 100%, 99.7%, 99.3%, respectively. SVM performed well, but below RF, obtaining: 86.5%, 97.4%, 85.2% and 75.7%, respectively. Finally, NB had the worst performance in all metrics: 60.6%, 61.6%, 61.6% and 61.7%.
URI:	https://ric.cps.sp.gov.br/handle/123456789/37068
Appears in Collections:	Trabalhos de Conclusão de Curso

Files in This Item:

File	Description	Size	Format
analiseedesenvolvimentodesistemas_2020_2_pedrohenriquemendessilva_inteligenciaartificialparacassificacaodefakenews.pdf Restricted Access		661.21 kB	Adobe PDF	View/Open Request a copy

Show full item record Recommend this item