Copy the page URI to the clipboard
Hlosta, Martin; Zdrahal, Zdenek and Zendulka, Jaroslav
(2017).
DOI: https://doi.org/10.1145/3027385.3027449
Abstract
This paper focuses on the problem of identifying students, who are at risk of failing their course. The presented method proposes a solution in the absence of data from previous courses, which are usually used for training machine learning models. This situation typically occurs in new courses. We present the concept of a "self-learner" that builds the machine learning models from the data generated during the current course. The approach utilises information about already submitted assessments, which introduces the problem of imbalanced data for training and testing the classification models.
There are three main contributions of this paper: (1) the concept of training the models for identifying at-risk students using data from the current course, (2) specifying the problem as a classification task, and (3) tackling the challenge of imbalanced data, which appears both in training and testing data.
The results show the comparison with the traditional approach of learning the models from the legacy course data, validating the proposed concept.