University of Rwanda Digital Repository

Machine learning based prediction of malaria outbreak using environment data in Rwanda

Show simple item record

dc.contributor.author Dukuzumuremyi, Albert
dc.date.accessioned 2022-05-13T14:30:21Z
dc.date.available 2022-05-13T14:30:21Z
dc.date.issued 2020
dc.identifier.uri http://hdl.handle.net/123456789/1576
dc.description Masters' Dissertation en_US
dc.description.abstract Malaria is still one of the common diseases that cause a threat to human population globally, 2019 report of the World Health Organization (WHO), indicated that an estimated 228 million malaria cases were found in 2018 and in 2017 the estimated malaria cases were 231 million. Many studies have highlighted the environment factors as the contributing factors to the variability of malaria prevalence in different regions. Different study revealed that climate factors including temperature, humidity and rainfall should play an important/leading role in the model prediction of malaria outbreak. Furthermore, there is a need for intelligent predictive systems using machine learning techniques which can predict the malaria outbreak, based on historical data on malaria and environment factors, this study was based on building a predictive model of malaria outbreak which should be used as a warning system to health care providers, hospitals, and other health institutions to give a warning on an occurrence of malaria outbreak based on meteorological data. Historical data from 2016 to 2019 of malaria cases from Rwanda Biomedical Center (RBC) and meteorological data including maximum and minimum temperature, rainfall, elevation, latitude ,and longitude from Rwanda Meteorological Agency of ten districts including Nyarugenge, Kicukiro, Nyamagabe, Huye, Gicumbi, Musanze, Karongi, Rusizi, Kayonza, and Nyagatare that constitute the total of 2080 observations have been used to fit six machine learning algorithms including Decision tree, Random Forest, Naïve Bayes, Support vector machine, K-nearest neighbour, and Logistic regression thus the best algorithm model will be selected based on performance metrics of Accuracy, Precision, Recall, F-score and ROC have been applied on each machine learning algorithms to evaluate their performance. Among those classifiers, Random Forest comes with high performance compared to others with more than 90% in all evaluation metrics; it has shown the accuracy of 90.75%, F-score of 90.73%, Precision of 90.69% and Recall at 90.88%. However, the classifiers also have shown the high performance except for Support vector machine which shown only around 60% in all evaluation metrics, but other classifiers scored above 70% on the evaluation metrics. According to this study for malaria outbreak, it is recommended to use Random Forest for malaria prediction. The use of machine learning algorithms models for prediction of malaria outbreak proved to be used as alarming system for health-based, health care providers, and health practitioners thereby they can be well prepared and set up the new prevention measures. en_US
dc.language.iso en en_US
dc.publisher University of Rwanda en_US
dc.subject Machine learning; Malaria outbreak ; Classifiers ;Support vector machine; Logistic Regression; Naïve Bayes ; Decision tree; Random Forest ; K-Nearest Neighbor j. Training set k. Test set en_US
dc.title Machine learning based prediction of malaria outbreak using environment data in Rwanda en_US
dc.type Dissertation en_US
dc.type Learning Object en_US


Files in this item

This item appears in the following Collection(s)

Show simple item record

Search Repository


Browse

My Account