UIJRT » United International Journal for Research & Technology

Afaan Oromo Fake News Detection Using Natural Language Processing and Passive-Aggressive

Daraje kaba Gurmessa
Keywords: Oromo, Fake News Detection, Passive-Aggressive, NLP, TF-IDF.

Cite ➜

Gurmessa, D.K., 2020. Afaan Oromo Fake News Detection Using Natural Language Processing and Passive-Aggressive. United International Journal for Research & Technology (UIJRT), 2(2), pp.33-40.

Abstract

The main objective of this study is to develop Afaan Oromo fake news detection system. The designed system involves preprocessing like tokenization, Normalization, stop word removing and abbreviation resolving, feature extraction-like Term-Frequency-inverted document frequency, term frequency, and hash to know word importance that appears in the news and word appears in the corpus and N-grams which are a powerful Natural Language Processing technique in order to capture semantic and syntactic sequences was also used. All possible combination of features extraction techniques and natural processing techniques were used with a passive-aggressive classification algorithm. Passive-Aggressive performs 97.2% with a classification error of 2.8% which was better than ensemble algorithms like gradient boosting and random forest and linear classifier like multinomial Naïve Bayes. Finally, a python Django was used for the web-based deployment of the model system using the Term Frequency-Inverted Document Frequency feature extraction with unigram and Passive aggressive classification algorithm.

References

  1. Worldatlas, “Most Ethnically Diverse Countries in the World,” 18 june 2019. [Online]. Available: https://googleweblight.com/i?u=https://www.worldatlas.com/articles/most-ethnically-diverse-countries-in-the-world.html&hl=en-ET.
  2. CenteralStatisticalAgency, “2007 population and housing census of Ethiopia,” Federal Democratic Repubilic of Ethiopia, Addis Abeba, 2012.
  3. Girma, “Afaan Oromo news text summarizer,” Addis Ababa University, Addis Ababa, 2012.
  4. Sakha, “https://www.sakhaglobal.com,” 03 05 2019. [Online]. Available: https://www.sakhaglobal.com/index.php/2018/09/28/detecting-fake-news-through-nlp/.
  5. FakeNewsChallenge, “Exploring how artificial intelligence technologies could be leveraged to combat fake news.,” 2019. [Online]. Available: http://www.fakenewschallenge.org/. [Accessed 01 10 2019].
  6. Daraje, M. Getachew and D. Jabesa, “Afaan Oromo Text Content-Based Fake News Detection using Multinomial Naive Bayes,” International Journal of Innovations in Management, Science and Engineering (IJIMSE), vol. 01, no. 01, pp. 26-37, 01 March 2020.
  7. y. R. Pratiwi, “study of hoax detection using neive bayes classfier in indonesian language,” in International Conference on Information & Communication Technology and System (ICTS), 2017.
  8. Pisarevskaya, “Deception Detection in News Reports in the Russian Language,” 2019.
  9. J. Conroy, “Automatic deception detection: Methods for finding fake news,,” in in Proceedings of the 78th ASIS&T Annual Meeting: Information Science with Impact: Research in and for the Community, USA, 2015.
  10. D’Souza, “An Introduction to Bag-of-Words in NLP,” 03 04 2018. [Online]. Available: https://medium.com/greyatom/an-introduction-to-bag-of-words-in-nlp-ac967d43b428.
  11. Bonaccorso, “Artificial Intelligence – Machine Learning – Data Science,” 10 06 2017. [Online]. Available: https://www.bonaccorso.eu/2017/10/06/ml-algorithms-addendum-passive-aggressive-algorithms/.
  12. Gilda, “Evaluating Machine Learning Algorithms for Fake News Detection,” in IEEE 15th Student Conference on Research and Development (SCOReD), 2017.

For Conference & Paper Publication

UIJRT Publication - International Journal