Skip to main content

Research Repository

Advanced Search

Identification of Risk Factors Associated with Tuberculosis in Southwest Iran: A Machine Learning Method

Amoori, Neda; Cheraghian, Bahman; Amini, Payam; Alavi, Seyed Mohammad

Authors

Neda Amoori

Bahman Cheraghian

Seyed Mohammad Alavi



Abstract

BackgroundTuberculosis is a principal public health issue. Reducing and controlling tuberculosis did not result in the expected success despite implementing effective preventive and therapeutic programs, one of the reasons for which is the delay in definitive diagnosis. Therefore, creating a diagnostic aid system for tuberculosis screening can help in the early diagnosis of this disease. This research aims to use machine learning techniques to identify economic, social, and environmental factors affecting tuberculosis.MethodsThis case-control study included 80 individuals with TB and 172 participants as controls. During January-October 2021, information was collected from thirty-six health centers in Ahvaz, southwest Iran. Five different machine learning approaches were used to identify factors associated with TB, including BMI, sex, age , marital status, education, employment status, size of the family, monthly income, cigarette smoking, hookah smoking, history of chronic illness, history of imprisonment, history of hospital admission, first-class family, second-class family, third-class family, friend, co-worker, neighbor, market, store, hospital, health center, workplace, restaurant, park, mosque, Basij base, Hairdressers and school. The data was analyzed using the statistical programming R software version 4.1.1.ResultsAccording to the calculated evaluation criteria, the accuracy level of 5 SVM, RF, LSSVM, KNN, and NB models is 0.99, 0.72, 0.97,0.99, and 0.95, respectively, and except for RF, the other models had the highest accuracy. Among the 39 investigated variables, 16 factors including First-class family (20.83%), friend (17.01%), health center (41.67%), hospital (24.74%), store (18.49%), market (14.32%), workplace (9.46%), history of hospital admission (51.82%), BMI (43.75%), sex (40.36%), age (22.83%), educational status (60.59%), employment status (43.58%), monthly income (63.80%), addiction (44.10%), history of imprisonment (38.19%) were of the highest importance on tuberculosis.ConclusionThe obtained results demonstrated that machine-learning techniques are effective in identifying economic, social, and environmental factors associated with tuberculosis. Identifying these different factors plays a significant role in preventing and performing appropriate and timely interventions to control this disease.

Journal Article Type Article
Acceptance Date Jan 17, 2024
Online Publication Date Jan 17, 2024
Publication Date 2024-01
Deposit Date Mar 19, 2024
Publicly Available Date Mar 19, 2024
Journal Medical journal of the Islamic Republic of Iran
Print ISSN 1016-1430
Electronic ISSN 2251-6840
Publisher Tehran University of Medical Sciences
Peer Reviewed Peer Reviewed
Volume 38
Issue 1
Pages 21-27
DOI https://doi.org/10.47176/mjiri.38.5
Keywords Tuberculosis, Classification, Risk factor, Machine Learning
Publisher URL https://mjiri.iums.ac.ir/article-1-8912-en.html&sw=Identification+of+Risk+Factors+Associated+With+Tuberculosis+in+Southwest+Iran%3A+A+Machine+Learning+Method
PMID 38434222

Files




You might also like



Downloadable Citations