Species distribution modeling to predict tsetse fly (Glossina spp.) habitat suitability in Kenya

Publication Type

Journal Article

Journal Name

Parasites Vectors

Publication Date

9-24-2025

Abstract

BACKGROUND: African animal trypanosomosis (AAT) and human African trypanosomosis (HAT) are transmitted and spread primarily by tsetse flies (Glossina spp.) in sub-Saharan Africa. The animal disease poses significant challenges to agropastoral systems, including in Kenya, where 38 out of 47 counties are infested with eight species of Glossina. Climate change and human activities can also aggravate these infestations, putting rural-scale farmers who rely on agropastoral systems at a greater risk. Geographical gaps in existing entomological datasets limit a comprehensive understanding of tsetse fly distribution across the country, especially amid rapid landscape dynamics. METHODS: This study aimed to predict the spatial distribution of tsetse flies habitat in Kenya using recent entomological data (i.e., tsetse fly occurrence records), satellite-derived environmental variables, landscape structure, demographic indicators, and species-distribution modeling techniques. We applied four machine learning (ML) algorithms-random forest (RF), support vector machines (SVM), maximum entropy (MaxEnt), and generalized linear models (GLM)-to predict tsetse flies habitat suitability. Additionally, we developed ensemble models that combine the predictive power of the four algorithms. Predictions were made at the genus level (Glossina spp.) and the species level for one priority species (Glossina pallidipes). RESULTS: The models performed well with true skill statistic (TSS) and area under the curve (AUC) metric measures of 0.67 and 0.88 for Glossina spp. and 0.85 and 0.96 for G. pallidipes, respectively. The predictions indicated an estimated potential suitable area of about 26% of Kenya for Glossina spp. and 9% for G. pallidipes. Tsetse fly habitat suitability was positively correlated with increased sheep density, normalized difference vegetation index, and soil moisture. However, suitability declined when the maximum land surface temperature (LST) exceeded 40 °C and elevation increased above 400 m. CONCLUSIONS: These findings can help improve the targeting and, hence, the cost-effectiveness of surveillance and ultimately support an evidence-based progressive control of tsetse flies infestation in Kenya.

Keywords

Animal, Data science, Health, Machine learning, Remote sensing technology, Vector-borne diseases

Share

COinS