Leveraging Large Language Models for multilingual disinformation detection: the 2022 Kenyan General Elections case study

Orwaru, R. K.

Leveraging Large Language Models for multilingual disinformation detection: the 2022 Kenyan General Elections case study

dc.contributor.author	Orwaru, R. K.
dc.date.accessioned	2026-04-02T10:57:40Z
dc.date.issued	2025
dc.description	Full - text thesis
dc.description.abstract	The rise of disinformation on social media, particularly during elections, poses challenges to democratic processes. This study examined the effectiveness of Large Language Models (LLMs) in detecting disinformation during the 2022 Kenyan General Elections. It analyzed the structure of disinformation networks, key actors’ influence mechanisms, and the synergies between these methodologies. Using a dataset from X (formerly known as Twitter), the study applied Hugging Face’s mBERT mBERT model for its multilingual capabilities. The data used to train the model had 20,000 rows and 37 features. Textual data was tokenised and classified as 'Hate Speech' or 'Not Hate Speech,' with key performance metrics, including accuracy, precision, recall, and F1-score indicating strong performance but potential overfitting due to dataset similarities. Further analysis revealed that disinformation networks relied on repetitive messaging and high-retweet amplification, with key influencers shaping discourse. SNA identified coordinated clusters, while SA detected strategic use of neutral and coded language strategically used to reframe discussions. LLMs enhanced detection by recognising subtle framing techniques, including word substitutions and sentiment shifts. Findings showed that disinformation was amplified through coordinated behaviour involving high-profile figures and media accounts. The integration of SA, SNA, and LLMs demonstrated the potential of AI-driven tools in misinformation mitigation. The study highlights the need for a multi-pronged detection approach, combining network analysis, sentiment evaluation, and language modeling, with implications for policymakers, journalists, and researchers. Keywords: Large Language Models (LLMs), Disinformation, Hate Speech
dc.identifier.citation	Orwaru, R. K. (2025). Leveraging Large Language Models for multilingual disinformation detection: The 2022 Kenyan General Elections case study [Strathmore University]. https://hdl.handle.net/11071/16314
dc.identifier.uri	https://hdl.handle.net/11071/16314
dc.language.iso	en
dc.publisher	Strathmore University
dc.title	Leveraging Large Language Models for multilingual disinformation detection: the 2022 Kenyan General Elections case study
dc.type	Thesis

Files

Original bundle

Now showing 1 - 1 of 1

Name:: Leveraging Large Language Models for multilingual disinformation detection - the 2022 Kenyan General Elections case study.pdf
Size:: 13.89 MB
Format:: Adobe Portable Document Format

Download

License bundle

Now showing 1 - 1 of 1

Name:: license.txt
Size:: 1.71 KB
Format:: Item-specific license agreed upon to submission
Description:

Download

Collections

MSc. DSA Theses and Dissertations (2024)