Skip to content

PaPJoIN/supervised-learning-class-dropout

Folders and files

NameName
Last commit message
Last commit date

Latest commit

ย 

History

15 Commits
ย 
ย 
ย 
ย 
ย 
ย 

Repository files navigation

Applying Supervised Learning to predict student dropout

Evaluating student demographics, engagement and performance metrics for predicting dropout likelihood using supervised learning techniques - Neural Networks & Decision Trees.

๐Ÿ”ธ Data

The data used in this project was divided into 3 stages, each representing the students' progression through their academic journey, starting with enrolment details, followed by incorporating engagement metrics and finally introducing subject gradings.

๐Ÿ”ธ Methodology

  • Performed data exploration and the necessary preprocessing steps, including encoding and scaling.
  • Identified the key correlations between variables.
  • Defined, compiled and trained the Neural Network models.
  • Performed hyperparameter optimisation using Keras Tuner to further refine accuracy.
  • Instantiated and trained the Extreme Gradient Boosting models.
  • Applied Grid & Random-Search to further tune the XGBoost models.
  • Calculated and compared the models' performances across a range of metrics [Accuracy, AUC, Recall & Precision].

๐Ÿ”ธ Tools

  • Python, NumPy, Pandas
  • Matplotlib, Seaborn, Scikit-learn
  • Keras-Tensorflow & Keras Tuner
  • XGboost & Sklearn's GridSearch Cross Validation

๐Ÿ”ธ Results & Findings

  • The conducted data exploration and analysis provided valuable insights into the factors influencing student dropout. The predictive performance of both XGBoost and Neural Network models progressively improved across the three stages of data inclusion, demonstrating the benefit of incorporating more detailed and performance-based student information (Figures below).
  • To support the goal of improving student retention, the following recommendations are proposed based on the observed patterns in the data:
    • Refined Candidate Vetting and tracking Risk profiles per intake: Conduct further refinement and incorporate predictive indicators identified in application data into the admissions process to better evaluate dropout risk and provide guided support.

    • Enhanced Student Engagement Monitoring: Integrate systems for continuous monitoring of student absences and engagement metrics and develop automated alerts to flag students exhibiting decreasing attendance.

    • Strengthened Academic Support triggers: Design and implement targeted programmes for addressing academic learning challenges, such as academic leads being alerted when students reach dropout-risk thresholds (e.g., >30 unauthorised absences, 1+ failed modules).

image image

About

Evaluating student demographics, engagement and performance metrics for predicting dropout likelihood using supervised learning - Neural Networks & Decision Trees.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors