University Links: Home Page | Site Map
Covenant University Repository

Performance evaluation of features for gene essentiality prediction

Aromolaran, Olufemi and Oyelade, O. J. and Adebiyi, E. F. (2020) Performance evaluation of features for gene essentiality prediction. In: International Conference on Science and Sustainable Development (ICSSD 2020).

[img] PDF
Download (796kB)


Essential genes are subset of genes required by an organism for growth and sustenance of life and as well responsible for phenotypic changes when their activities are altered. They have been utilized as drug targets, disease control agent, etc. Essential genes have been widely identified especially in microorganisms, due to the extensive experimental studies on some of them such as Escherichia coli and Saccharomyces cerevisiae. Experimental approach has been a reliable method to identify essential genes. However, it is complex, costly, labour and time intensive. Therefore, computational approach has been developed to complement the experimental approach in order to minimize resources required for essentiality identification experiments. Machine learning approaches have been widely used to predict essential genes in model organisms using different categories of features with varying degrees of accuracy and performance. However, previous studies have not established the most important categories of features that provide the distinguishing power in machine learning essentiality predictions. Therefore, this study evaluates the discriminating strength of major categories of features used in essential gene prediction task as well as the factors responsible for effective computational prediction. Four categories of features were considered and kfold cross-validation machine learning technique was used to build the classification model. Our results show that ontology features with an AUROC score of 0.936 has the most discriminating power to classify essential and non-essential genes. This studyconcludes that more ontology related features will further improve the performance of machine learning approach and also sensitivity, precision and AUPRC are realistic measures of performance in essentiality prediction.

Item Type: Conference or Workshop Item (Paper)
Uncontrolled Keywords: Essential genes, Essential proteins, Classification features, Machine-learning
Subjects: Q Science > Q Science (General)
Q Science > QA Mathematics > QA75 Electronic computers. Computer science
Divisions: Faculty of Engineering, Science and Mathematics > School of Electronics and Computer Science
Depositing User: Mrs Hannah Akinwumi
Date Deposited: 22 Jun 2021 11:09
Last Modified: 22 Jun 2021 11:09

Actions (login required)

View Item View Item