University Links: Home Page | Site Map
Covenant University Repository

Single-label machine learning classification revealed some hidden but inter-related causes of five psychotic disorder diseases

Okagbue, H. I. and Ijezie, Ogochukwu A. and Ugwoke, P. O. and Adeyemi-Kayode, Temitope M. and Jonathan, Oluranti (2023) Single-label machine learning classification revealed some hidden but inter-related causes of five psychotic disorder diseases. Heliyon, 9. ISSN 2405-8440

[img] PDF
Download (958kB)

Abstract

Psychotic disorder diseases (PDD) or mental illnesses are group of illnesses that affect the minds and impair the cognitive ability, retard emotional ability and obstruct the process of communication and relationship with others and are characterized by delusions, hallucinations and disoriented or disordered pattern of thinking. Prognosis of PDD is not sufficient because of the nature of the diseases and as such adequate form of diagnosis is required to detect, manage and treat the illness. This paper applied the single-label classification (SLC) machine learning approach in mining of electronic health records of people with PDD in Nigeria using eleven independent (demographic) variables and five PDD as target variables. The five PDDs are Insomnia, Schizophrenia, Minimal Brain dysfunction (MBD), which is also known as Attention-Deficit/ Hyperactivity Disorder (ADHD), Vascular Dementia (VD) and Bipolar Disorder (BD). The aim of using SLC is that it would be easier to detect some PDDs that are related to each other without the loss of information, which is a plus over multi-label classification (MLC). ReliefF algorithm was used at each experiment to precipitate the order of importance of the independent variables and redundant variables were excluded from the analysis. The order of the variables in feature selection was matched with feature importance after the classifications and quantified using the Spearman rank correlation coefficient. The data was divided into: 70% for training and 30% for testing. Four new performance metrics adapted from the root mean square (RMSE) were proposed and used to measure the differences between the performance results of the 10 Machine learning models in terms of the training and testing and secondly, feature and without feature selection. The new metrics are close to zero which is an indication that the use of feature selection and cross validation may not greatly affects the accuracy of the SLC. When the PDDs are included as predictors for classifying others, there was a tremendous improvement as revealed by the four new metrics for classification accuracy (CA), precision and recall. Analysis of variance showed the four different metrics differs significantly for classification accuracy (CA) and precision. However, there were no significant difference between the CA and precision when the duo are compared together across the four evaluation metrics at p value less than 0.05.

Item Type: Article
Uncontrolled Keywords: Classification Diagnosis Feature importance Machine learning Psychotic disorder Single label approach
Subjects: Q Science > QA Mathematics > QA75 Electronic computers. Computer science
Divisions: Faculty of Engineering, Science and Mathematics > School of Electronics and Computer Science
Depositing User: nwokealisi
Date Deposited: 12 Jul 2024 15:47
Last Modified: 12 Jul 2024 15:47
URI: http://eprints.covenantuniversity.edu.ng/id/eprint/18195

Actions (login required)

View Item View Item