An Interpretable Cart-Based Framework for Multi-Target Educational Prediction Using Feature Selection and Model Pruning

Authors

Arlene B. Laurel

Faculty of College of Informatics and Computing Sciences, Batangas State University, The National Engineering University Batangas City (Philippines)

Article Information

DOI: 10.47772/IJRISS.2026.10100316

Subject Category: Education

Volume/Issue: 10/1 | Page No: 4014-4059

Publication Timeline

Submitted: 2026-01-16

Accepted: 2026-01-23

Published: 2026-02-05

Abstract

The implementation of the K-12 Basic Education Program in the Philippines introduced Senior High School (SHS) as a critical stage where students must select an academic strand aligned with their interests, abilities and future career goals. This decision is particularly significant because it influences students’ academic readiness, motivation, and long-term educational outcomes. For Grade 10 learners, choosing among SHS strands such as Science, Technology, Engineering, and Mathematics (STEM), Accountancy, Business, and Management (ABM), Humanities and Social Sciences (HUMSS), and General Academic Strand (GAS) often occurs at a formative stage when self-awareness and academic guidance are still developing. Consequently, inaccurate or poorly informed strand choices may lead to academic difficulties, disengagement, or later program shifts, underscoring the importance of informed and evidence-based SHS decision making.
In recent years, educational institutions have increasingly explored the use of machine learning (ML) techniques to support academic advising and student performance prediction. ML-based decision-support systems offer the potential to analyse large volumes of student data and uncover patterns that may not be immediately apparent through traditional counselling approaches. However, despite their predictive power, many existing ML models particularly ensemble and deep learning methods operate as black-box systems. These models generate predictions without providing clear explanations of how decisions are made, limiting their suitability for educational contexts where transparency, accountability, and human oversight are essential.

Keywords

Interpretable, Cart-Based, Framework, Multi-Target, Educational, Prediction

Downloads

References

1. Asor, J. R., Catedrilla, G. M. B., Buama, C. A. C., et al. (2022). Prediction of Senior High School Students’ Performance in a State University: An Educational Data Mining Approach. International Journal of Innovate Engineering & Technology (IJIET), 13(6). Retrieved from https://www.ijiet.org/vol13/IJIET-V13N6-1888.pdf [Google Scholar] [Crossref]

2. Guyon, I., Weston, J., Barnhill, S., & Vapnik, V. (2002). Gene selection for cancer classification using support vector machines. Machine Learning, 46(1–3), 389–422. [Google Scholar] [Crossref]

3. https://doi.org/10.1023/A:1012487302797 [Google Scholar] [Crossref]

4. Loh, W.-Y. (2014). Fifty years of classification and regression trees. International Statistical Review, 82(3), 329–348. https://doi.org/10.1111/insr.12016 [Google Scholar] [Crossref]

5. Ravi, K. B. (2017). Cost-Complexity Pruning of Random Forests. arXiv. [Google Scholar] [Crossref]

6. https://arxiv.org/abs/1703.05430 [Google Scholar] [Crossref]

7. Romero, C., & Ventura, S. (2007). Educational data mining: A survey (1995–2005). Computers & Education, 51(1), 98–112. https://doi.org/10.1016/j.compedu.2007.05.004 [Google Scholar] [Crossref]

8. Romero, C., & Ventura, S. (2020). Educational data mining and learning analytics: An updated survey. In Learning Analytics in Education (pp. 1–42). [PDF]. Retrieved from https://bookdown.org/chen/la-manual/files/Romero%20and%20Ventura%20-%202020.pdf [Google Scholar] [Crossref]

9. Schmid, L., et al. (2023). Tree-based ensembles for multi-output regression. Expert Systems with Applications. https://doi.org/10.1016/j.eswa.2022.120146 [Google Scholar] [Crossref]

10. Vergara, J. R., & Estévez, P. A. (2013). A review of feature selection methods based on mutual information. Neural Computing and Applications, 24(1), 175–186. https://doi.org/10.1007/s00521-013-1368-0 [Google Scholar] [Crossref]

11. “Precision in Progress: Leveraging Data Mining Technique to Empower Career Path Selection for Incoming Senior High School Students.” (2024). International Journal of Research and Innovation in Social Science (IJRISS). Retrieved from https://rsisinternational.org/journals/ijriss/articles/precision-in-progress-leveraging-data-mining-technique-to-empower-career-path-selection-for-incoming-senior-high-school-students/ [Google Scholar] [Crossref]

12. Liu, H., & Park, J. (2024). Comparative analysis of decision tree algorithms for academic outcome prediction. Journal of Educational Data Mining, 16(2), 45–63. [Google Scholar] [Crossref]

13. Gonzales, R., & Lim, M. (2025). Evaluating interpretable decision trees for SHS strand recommendation. Computers & Education, 183, 104585. [Google Scholar] [Crossref]

14. Raman, S., & Das, A. (2023). Predicting at-risk students using CART and pruning techniques. Education and Information Technologies, 28, 1123–1142. [Google Scholar] [Crossref]

15. Kim, S., & Velasco, P. (2024). Hybrid feature selection methods for multi-output educational datasets. Expert Systems with Applications, 209, 118235. [Google Scholar] [Crossref]

16. Ravi, K. B. (2017). Cost-complexity pruning of Random Forests. arXiv:1703.05430. [Google Scholar] [Crossref]

17. Schmid, L., et al. (2023). Tree-based ensembles for multi-output regression. Expert Systems with Applications. https://doi.org/10.1016/j.eswa.2022.120146 [Google Scholar] [Crossref]

18. Romero, C., & Ventura, S. (2020). Educational data mining and learning analytics: An updated survey. Wiley Interdisciplinary Reviews: Data Mining and Knowledge Discovery, 10(3), e1355. [Google Scholar] [Crossref]

19. Simon, H. A. (2020). Rational decision-making in educational systems. Educational Research Review, 30, 100321. [Google Scholar] [Crossref]

20. Kumar, V., & Singh, K. (2021). Constructivist learning theory in data-driven education. Journal of Educational Technology Systems, 50(1), 3–21. [Google Scholar] [Crossref]

21. Li, J., Cheng, K., Wang, S., et al. (2020). Feature selection: A data perspective. ACM Computing Surveys, 52(5), 94. [Google Scholar] [Crossref]

22. Ahmed, M., & Elaraby, I. (2023). Hybrid feature selection methods for educational data mining. IEEE Access, 11, 33421–33435. [Google Scholar] [Crossref]

Metrics

Views & Downloads

Similar Articles