Cross-browser extension for detecting fake news written in Filipino, powered by machine learning.
This repository hosts a fake news classifier trained partly on a corpus of Filipino articles, Fake News Filipino, adapted from Cruz et al. We have also built a similar corpus named Fake News Filipino 2024. As part of our undergraduate study at the University of the Philippines Visayas, we have built a cross-browser extension that employs our fake news classifier. We utilize Tampermonkey as a wrapper around our extension and Render to host our machine learning model on the cloud as an API. The fake news classifier only works on articles written primarily in Filipino with smatterings of English vernacular. Currently, we have deployed a trained Logistic Regression model that has achieved a top accuracy of ~98% accuracy on our test set. All resources that have been used in this study are available on this repository.
- Download and install Tampermonkey for your browser of choice.
- Download the latest release of the extension from releases.
- Extract the zip file and open
script.js
with Notepad or your plaintext editor of choice. - Open the Tampermonkey dashboard.
- Click the
+
button and it will open a new userscript. - Copy and paste the contents of
script.js
over the userscript, replacing the old contents. - Click
file
and thensave
.
- Pin Tampermonkey to your browser's address bar.
- Once pinned, click the Tampermonkey icon.
- Under
Fake News Detector
, clickDetect Fake News
. - Wait for the system to process the article you're reading.
- Profit!
Rene Andre Jocsing
Lead Programmer, Project Manager, WriterChancy Ponce de Leon
Programmer, Data AnalystCobe Austin Lupac
Presenter, ProgrammerRon Gerlan Naragdao
Programmer
If you wish to contribute to the project, please open a pull request.
@inproceedings{cruz2020localization,
title={Localization of Fake News Detection via Multitask Transfer Learning},
author={Cruz, Jan Christian Blaise and Tan, Julianne Agatha and Cheng, Charibeth},
booktitle={Proceedings of The 12th Language Resources and Evaluation Conference},
pages={2596--2604},
year={2020}
}
@article{evaluating2019cruz,
title={{Evaluating Language Model Finetuning Techniques for Low-resource Languages}},
author={Cruz, Jan Christian Blaise and Cheng, Charibeth},
journal={arXiv preprint arXiv:1907.00409},
year={2019}
}
@article{imperial2021application,
author={Imperial, J. M. and Ong, E.},
title={Application of Lexical Features Towards Improvement of Filipino Readability Identification of Children's Literature},
journal={arXiv preprint},
eprint={2101.10537},
year={2021}
}
@article{imperial2021diverse,
author={Imperial, J. M. and Ong, E.},
title={Diverse Linguistic Features for Assessing Reading Difficulty of Educational Filipino Texts},
journal={arXiv preprint},
eprint={2108.00241},
year={2021}
}
@inproceedings{imperial2020exploring,
author={Imperial, J. M. and Ong, E.},
title={Exploring Hybrid Linguistic Feature Sets To Measure Filipino Text Readability},
booktitle={2020 International Conference on Asian Language Processing (IALP)},
pages={175-180},
year={2020},
organization={IEEE}
}