(Publisher of Peer Reviewed Open Access Journals)

International Journal of Advanced Technology and Engineering Exploration (IJATEE)

ISSN (Print):2394-5443    ISSN (Online):2394-7454
Volume-8 Issue-84 November-2021
Full-Text PDF
Paper Title : A systematic literature review on student performance predictions
Author Name : Hasnah Nawang, Mokhairi Makhtar and Wan Mohd Amir Fazamin Wan Hamza
Abstract :

Prediction of student performance in educational institutions is a major topic of debate among researchers in efforts to improve teaching and learning. Effective prediction techniques and features would help educators and teachers design appropriate teaching content to help learners study according to predicted outcomes. The purpose of this paper is to present a systematic literature review on predictions of students’ performance in higher education institutions and secondary schools using Machine Learning, Educational Data Mining, and Learning Analytics methodologies. The review used in this study was designed to: i) provide an overview of techniques and algorithms used to predict students' performance; and ii) identify the features that have the greatest impact on students' performance. This paper also outlined several future insights in terms of applying hybrid techniques to educational datasets in order to improve accuracy in predicting students’ performance.

Keywords : Educational data mining, Machine learning, Learning analytics, Students, Performance prediction.
Cite this article : Nawang H, Makhtar M, Hamza WM. A systematic literature review on student performance predictions. International Journal of Advanced Technology and Engineering Exploration. 2021; 8(84):1441-1453. DOI:10.19101/IJATEE.2021.874521.
References :
[1]Gaftandzhieva S, Docheva M, Doneva R. A comprehensive approach to learning analytics in Bulgarian school education. Education and Information Technologies. 2021; 26(1):145-63.
[Crossref] [Google Scholar]
[2]Alyahyan E, Düştegör D. Predicting academic success in higher education: literature review and best practices. International Journal of Educational Technology in Higher Education. 2020; 17(1):1-21.
[Crossref] [Google Scholar]
[3]Ferreira SA, Andrade A. Academic analytics: anatomy of an exploratory essay. Education and Information Technologies. 2016; 21(1):229-43.
[Crossref] [Google Scholar]
[4]Najafabadi MM, Villanustre F, Khoshgoftaar TM, Seliya N, Wald R, Muharemagic E. Deep learning applications and challenges in big data analytics. Journal of Big Data. 2015; 2(1):1-21.
[Crossref] [Google Scholar]
[5]Khatib KC, Kamble TD, Chendake BR, Sonavane GN. Social media data mining for sentiment analysis. International Research Journal of Engineering and Technology. 2016; 3(4):373-6.
[Google Scholar]
[6]Ozdemir D, Opseth HM, Taylor H. Leveraging learning analytics for student reflection and course evaluation. Journal of Applied Research in Higher Education. 2019; 12(1):27-37.
[Crossref] [Google Scholar]
[7]Nuutila K, Tuominen H, Tapola A, Vainikainen MP, Niemivirta M. Consistency, longitudinal stability, and predictions of elementary school students task interest, success expectancy, and performance in mathematics. Learning and Instruction. 2018; 56:73-83.
[Crossref] [Google Scholar]
[8]Lang C, Siemens G, Wise A, Gasevic D. Handbook of learning analytics. New York, NY, USA: SOLAR, Society for Learning Analytics and Research; 2017.
[Google Scholar]
[9]Nawang H, Makhtar M, Shamsudin SN. Classification model and analysis on students performance. Journal of Fundamental and Applied Sciences. 2017; 9(6S):869-85.
[Google Scholar]
[10]Tsai YS, Gasevic D. Learning analytics in higher education-challenges and policies: a review of eight learning analytics policies. In proceedings of the seventh international learning analytics & knowledge conference 2017 (pp. 233-42).
[Crossref] [Google Scholar]
[11]Shahiri AM, Husain W. A review on predicting students performance using data mining techniques. Procedia Computer Science. 2015; 72:414-22.
[Crossref] [Google Scholar]
[12]Kitchenham B, Brereton OP, Budgen D, Turner M, Bailey J, Linkman S. Systematic literature reviews in software engineering–a systematic literature review. Information and Software Technology. 2009; 51(1):7-15.
[Crossref] [Google Scholar]
[13]Gil PD, Da CMS, Moro S, Costa JM. A data-driven approach to predict first-year students’ academic success in higher education institutions. Education and Information Technologies. 2021; 26(2):2165-90.
[Crossref] [Google Scholar]
[14]Qazdar A, Er-raha B, Cherkaoui C, Mammass D. A machine learning algorithm framework for predicting students performance: a case study of baccalaureate students in Morocco. Education and Information Technologies. 2019; 24(6):3577-89.
[Crossref] [Google Scholar]
[15]Costa-mendes R, Oliveira T, Castelli M, Cruz-jesus F. A machine learning approximation of the 2015 Portuguese high school student grades:a hybrid approach. Education and Information Technologies. 2021; 26(2):1527-47.
[Crossref] [Google Scholar]
[16]Baars GJ, Stijnen T, Splinter TA. A model to predict student failure in the first year of the undergraduate medical curriculum. Health Professions Education. 2017; 3(1):5-14.
[Crossref] [Google Scholar]
[17]Youssef M, Mohammed S, Hamada EK, Wafaa BF. A predictive approach based on efficient feature selection and learning algorithms’ competition: case of learners’ dropout in MOOCs. Education and Information Technologies. 2019; 24(6):3591-618.
[Crossref] [Google Scholar]
[18]Kostopoulos G, Kotsiantis S, Verykios VS. A prognosis of junior high school students’ performance based on active learning methods. In international conference on brain function assessment in learning 2017 (pp. 67-76). Springer, Cham.
[Crossref] [Google Scholar]
[19]Moreno-marcos PM, Pong TC, Munoz-merino PJ, Kloos CD. Analysis of the factors influencing learners’ performance prediction with learning analytics. IEEE Access. 2020; 8:5264-82.
[Crossref] [Google Scholar]
[20]Al-obeidat F, Tubaishat A, Dillon A, Shah B. Analyzing students’ performance using multi-criteria classification. Cluster Computing. 2018; 21(1):623-32.
[Crossref] [Google Scholar]
[21]Asif R, Merceron A, Ali SA, Haider NG. Analyzing undergraduate students performance using educational data mining. Computers & Education. 2017; 113:177-94.
[Crossref] [Google Scholar]
[22]Yousafzai BK, Hayat M, Afzal S. Application of machine learning and data mining in predicting the performance of intermediate and secondary education level student. Education and Information Technologies. 2020; 25(6):4677-97.
[Crossref] [Google Scholar]
[23]Adekitan AI, Noma-osaghae E. Data mining approach to predicting the performance of first year student in a university using the admission requirements. Education and Information Technologies. 2019; 24(2):1527-43.
[Crossref] [Google Scholar]
[24]Azcona D, Hsiao IH, Smeaton AF. Detecting students-at-risk in computer programming classes with learning analytics from students’ digital footprints. User Modeling and User-Adapted Interaction. 2019; 29(4):759-88.
[Crossref] [Google Scholar]
[25]Akçapınar G, Hasnine MN, Majumdar R, Flanagan B, Ogata H. Developing an early-warning system for spotting at-risk students by using eBook interaction logs. Smart Learning Environments. 2019; 6(1):1-15.
[Google Scholar]
[26]Hussain S, Dahan NA, Ba-alwib FM, Ribata N. Educational data mining and analysis of students’ academic performance using WEKA. Indonesian Journal of Electrical Engineering and Computer Science. 2018; 9(2):447-59.
[Crossref] [Google Scholar]
[27]Adekitan AI, Salau O. The impact of engineering students performance in the first three years on their graduation result using educational data mining. Heliyon. 2019; 5(2):1-21.
[Crossref] [Google Scholar]
[28]Marbouti F, Diefes-dux HA, Madhavan K. Models for early prediction of at-risk students in a course using standards-based grading. Computers & Education. 2016; 103:1-15.
[Crossref] [Google Scholar]
[29]Altujjar Y, Altamimi W, Al-turaiki I, Al-razgan M. Predicting critical courses affecting students performance: a case study. Procedia Computer Science. 2016; 82:65-71.
[Crossref] [Google Scholar]
[30]Zhou Q, Quan W, Zhong Y, Xiao W, Mou C, Wang Y. Predicting high-risk students using internet access logs. Knowledge and Information Systems. 2018; 55(2):393-413.
[Crossref] [Google Scholar]
[31]Aydoğdu Ş. Predicting student final performance using artificial neural networks in online learning environments. Education and Information Technologies. 2020; 25(3):1913-27.
[Crossref] [Google Scholar]
[32]Karlos S, Kostopoulos G, Kotsiantis S. Predicting and interpreting students’ grades in distance higher education through a semi-regression method. Applied Sciences. 2020; 10(23):1-19.
[Crossref] [Google Scholar]
[33]Iqbal MS, Luo B. Prediction of educational institution using predictive analytic techniques. Education and Information Technologies. 2019; 24(2):1469-83.
[Crossref] [Google Scholar]
[34]Zohair LM. Prediction of students performance by modelling small dataset size. International Journal of Educational Technology in Higher Education. 2019; 16(1):1-8.
[Crossref] [Google Scholar]
[35]Ma X, Zhou Z. Student pass rates prediction using optimized support vector machine and decision tree. In 8th annual computing and communication workshop and conference 2018 (pp. 209-15). IEEE.
[Crossref] [Google Scholar]
[36]Hashim AS, Awadh WA, Hamoud AK. Student performance prediction model based on supervised machine learning algorithms. In IOP conference series: materials science and engineering 2020 (pp. 1-19). IOP Publishing.
[Crossref] [Google Scholar]
[37]Hamsa H, Indiradevi S, Kizhakkethottam JJ. Student academic performance prediction model using decision tree and fuzzy genetic algorithm. Procedia Technology. 2016; 25:326-32.
[Crossref] [Google Scholar]
[38]Pandey M, Taruna S. Towards the integration of multiple classifier pertaining to the students performance prediction. Perspectives in Science. 2016; 8:364-6.
[Crossref] [Google Scholar]
[39]Badr G, Algobail A, Almutairi H, Almutery M. Predicting students performance in university courses: a case study and tool in KSU mathematics department. Procedia Computer Science. 2016; 82:80-9.
[Crossref] [Google Scholar]
[40]Akçapınar G, Altun A, Aşkar P. Using learning analytics to develop early-warning system for at-risk students. International Journal of Educational Technology in Higher Education. 2019; 16(1):1-20.
[Crossref] [Google Scholar]
[41]Hussain M, Zhu W, Zhang W, Abidi SM, Ali S. Using machine learning to predict student difficulties from learning session data. Artificial Intelligence Review. 2019; 52(1):381-407.
[Crossref] [Google Scholar]
[42]Zheng G, Fancsali SE, Ritter S, Berman S. Using instruction-embedded formative assessment to predict state summative test scores and achievement levels in mathematics. Journal of Learning Analytics. 2019; 6(2):153-74.
[Crossref] [Google Scholar]
[43]Rovira S, Puertas E, Igual L. Data-driven system to predict academic grades and dropout. PLoS One. 2017; 12(2).
[Google Scholar]
[44]Rodríguez-muñiz LJ, Bernardo AB, Esteban M, Díaz I. Dropout and transfer paths: what are the risky profiles when analyzing university persistence with machine learning techniques?. Plos One. 2019; 14(6):1-21.
[Google Scholar]
[45]Francis BK, Babu SS. Predicting academic performance of students using a hybrid data mining approach. Journal of Medical Systems. 2019; 43(6):1-5.
[Crossref] [Google Scholar]
[46]Aiken JM, De BR, Hjorth-jensen M, Caballero MD. Predicting time to graduation at a large enrollment American university. Plos One. 2020; 15(11).
[Crossref] [Google Scholar]
[47]Hussain M, Zhu W, Zhang W, Abidi SM. Student engagement predictions in an e-learning system and their impact on student course assessment scores. Computational Intelligence and Neuroscience. 2018:1-21.
[Crossref] [Google Scholar]
[48]Czibula G, Mihai A, Crivei LM. S PRAR: a novel relational association rule mining classification model applied for academic performance prediction. Procedia Computer Science. 2019; 159:20-9.
[Crossref] [Google Scholar]
[49]Matzavela V, Alepis E. Decision tree learning through a predictive model for student academic performance in intelligent M-learning environments. Computers and Education: Artificial Intelligence. 2021.
[Crossref] [Google Scholar]
[50]Viloria A, López JR, Leyva DM, Vargas-mercado C, Hernández-palma H, Llinas NO, et al. Data mining techniques and multivariate analysis to discover patterns in university final researches. Procedia Computer Science. 2019; 155:581-6.
[Crossref] [Google Scholar]
[51]Deng H, Wang X, Guo Z, Decker A, Duan X, Wang C, et al. Performancevis: visual analytics of student performance data from an introductory chemistry course. Visual Informatics. 2019; 3(4):166-76.
[Crossref] [Google Scholar]
[52]Çetinkaya A, Baykan ÖK. Prediction of middle school students programming talent using artificial neural networks. Engineering Science and Technology, an International Journal. 2020; 23(6):1301-7.
[Crossref] [Google Scholar]
[53]Mokhairi M, Nawang H, Wan SN. Analysis on students performance using naïve. Journal of Theoretical and Applied Information Technology. 2017; 31(16):3993-4000.
[Google Scholar]
[54]Hu H, Zhang G, Gao W, Wang M. Big data analytics for MOOC video watching behavior based on Spark. Neural Computing and Applications. 2020; 32(11):6481-9.
[Crossref] [Google Scholar]
[55]Slater S, Joksimović S, Kovanovic V, Baker RS, Gasevic D. Tools for educational data mining: a review. Journal of Educational and Behavioral Statistics. 2017; 42(1):85-106.
[Crossref] [Google Scholar]
[56]Breiman L. Random forests. Machine Learning. 2001; 45(1):5-32.
[Crossref] [Google Scholar]
[57]Yusuf A. Prediction of students’performance in E-learning environment using random forest. Doctoral Dissertation, University of Technology Malaysia.
[Google Scholar]
[58]Noble WS. What is a support vector machine?. Nature Biotechnology. 2006; 24(12):1565-7.
[Crossref] [Google Scholar]
[59]Gupta N. Artificial neural network. Network and Complex Systems. 2013; 3(1):24-8.
[Google Scholar]
[60]Hamoud A, Hashim AS, Awadh WA. Predicting student performance in higher education institutions using decision tree analysis. International Journal of Interactive Multimedia and Artificial Intelligence. 2018; 5:26-31.
[Google Scholar]
[61]Zulfiker MS, Kabir N, Biswas AA, Chakraborty P, Rahman MM. Predicting students performance of the private universities of Bangladesh using machine learning approaches. International Journal of Advanced Computer Science and Applications. 2020; 11(3):672-9.
[Google Scholar]
[62]Sivakumar S, Venkataraman S, Selvaraj R. Predictive modeling of student dropout indicators in educational data mining using improved decision tree. Indian Journal of Science and Technology. 2016; 9(4):1-5.
[Crossref] [Google Scholar]
[63]Rish I. An empirical study of the naive Bayes classifier. In IJCAI workshop on empirical methods in artificial intelligence 2001 (pp. 41-6).
[Google Scholar]
[64]Sokkhey P, Okazaki T. Hybrid machine learning algorithms for predicting academic performance. International Journal of Advanced Computer Science and Applications. 2020; 11(1):32-41.
[Google Scholar]
[65]Dole L, Rajurkar J. A Decision support system for predicting student performance. International Journal of Innovative Research in Computer and Communication Engineering. 2014; 2(12):7232-7.
[Crossref] [Google Scholar]
[66]Mohamad M, Makhtar M, Abd RMN. The reconstructed heterogeneity to enhance ensemble neural network for large data. In international conference on soft computing and data mining 2016 (pp. 447-55). Springer, Cham
[Crossref] [Google Scholar]