University Links: Home Page | Site Map
Covenant University Repository

THE EFFECTS OF DECOMPOSITION OF THE GOALS SCORED IN CLASSIFYING THE OUTCOMES OF FIVE ENGLISH PREMIER LEAGUE SEASONS USING MACHINE LEARNING MODELS

Iyiola, Tomilayo Promise and Okagbue, H. I. and Adedotun, Adedayo F. and Akingbade, Toluwalase J. (2023) THE EFFECTS OF DECOMPOSITION OF THE GOALS SCORED IN CLASSIFYING THE OUTCOMES OF FIVE ENGLISH PREMIER LEAGUE SEASONS USING MACHINE LEARNING MODELS. Advances and Applications in Statistics, 87 (1). pp. 13-27. ISSN 0972-3617

[img] PDF
Download (117kB)
Official URL: http://www.pphmj.com

Abstract

The English Premier League (EPL) is one of the best football championships in the world and thus, data generated from it is highly sought after by users of football data. One of the uses of the data is in the prediction of outcome of the league matches. This paper applies four machine learning (ML) models in classifying the outcome (home win, draw, and away win) of five consecutive seasons of EPL using only six independent variables. Information Gain Ratio (IGR) and ReliefF were the feature selection algorithms that reduced the independent variables from 16 to 6. Spearman rank correlation gave a high significant positive correlation between the ranks of the 2 feature selection algorithms. The Kruskal-Wallis H test indicated that there is a significant difference in the dependent variable between the different Seasons (Chi-square = 15.36, Degrees of freedom = 4, P = 0.004). Adaptive boosting (AB), gradient boosting (GB), logistic regression (LR) and random forests (RF) were used in the classification of the outcome using the six independent variables and the performance metrics showed a perfect classification in almost all the models. This paper concluded that the knowledge of the number of goals scored by the home and away teams, and the number of Goals scored by home and away teams in the first half and second half are all that is needed to correctly classify the outcomes of the English Premier League (EPL). Secondly, the knowledge of the own goals and goals scored by penalty, and yellow and red cards conceded by the home or away teams is not necessarily needed in the determination or prediction of the outcomes of the EPL.

Item Type: Article
Uncontrolled Keywords: algorithm, classification, cross-validation, English Premier League, feature selection, machine learning, statistics.
Subjects: Q Science > QA Mathematics
Divisions: Faculty of Engineering, Science and Mathematics > School of Mathematics
Depositing User: nwokealisi
Date Deposited: 13 Nov 2023 10:37
Last Modified: 13 Nov 2023 10:37
URI: http://eprints.covenantuniversity.edu.ng/id/eprint/17593

Actions (login required)

View Item View Item