TY - GEN
T1 - Insights in Learners’ Behaviour and Early Dropout Detection based on Coursera MOOCs
AU - Frank, Sarah
AU - Perez-Sanagustin, Mar
AU - Hilliger, Isabel
AU - Gütl, Christian
PY - 2021
Y1 - 2021
N2 - Increased utilization of distance learning makes the creation of effective learning tools ever more important. However, while Massive Open Online Courses (MOOCs) are a popular online learning tool, they often suffer from high user attrition. This paper investigated this effect for 6 different MOOCs with more than 35 thousand users, using AdaBoosted decision trees to create a model which was then used for the prediction of dropouts, as well as the calculation of feature importance scores. The resulting model generally scored high accuracy scores which were plotted for each course, and feature importance scores were especially high for the features encompassing the number of user requests, the user's total active time and the average time between clicks. Furthermore, the paper explored the results of the inclusion of forum data in this setting. While the forum features did not lead to a major increase in model accuracy, there was a statistical difference between the number of forum interactions for those who completed the MOOCs and those who did not, which opens up possibilities for future research into utilization of forum interaction data, and forums in MOOCs in general.
AB - Increased utilization of distance learning makes the creation of effective learning tools ever more important. However, while Massive Open Online Courses (MOOCs) are a popular online learning tool, they often suffer from high user attrition. This paper investigated this effect for 6 different MOOCs with more than 35 thousand users, using AdaBoosted decision trees to create a model which was then used for the prediction of dropouts, as well as the calculation of feature importance scores. The resulting model generally scored high accuracy scores which were plotted for each course, and feature importance scores were especially high for the features encompassing the number of user requests, the user's total active time and the average time between clicks. Furthermore, the paper explored the results of the inclusion of forum data in this setting. While the forum features did not lead to a major increase in model accuracy, there was a statistical difference between the number of forum interactions for those who completed the MOOCs and those who did not, which opens up possibilities for future research into utilization of forum interaction data, and forums in MOOCs in general.
KW - Coursera
KW - Dropout Detection
KW - MOOC
UR - http://www.scopus.com/inward/record.url?scp=85130095492&partnerID=8YFLogxK
U2 - 10.1109/ITHET50392.2021.9759784
DO - 10.1109/ITHET50392.2021.9759784
M3 - Conference paper
T3 - 2021 19th International Conference on Information Technology Based Higher Education and Training, ITHET 2021
BT - 2021 19th International Conference on Information Technology Based Higher Education and Training, ITHET 2021
T2 - 19th International Conference on Information Technology Based Higher Education and Training
Y2 - 4 November 2021 through 6 November 2021
ER -