An Interpretable Cart-Based Framework for Multi-Target Educational Prediction Using Feature Selection and Model Pruning
Authors
Faculty of College of Informatics and Computing Sciences, Batangas State University, The National Engineering University Batangas City (Philippines)
Article Information
DOI: 10.47772/IJRISS.2026.10100316
Subject Category: Education
Volume/Issue: 10/1 | Page No: 4014-4059
Publication Timeline
Submitted: 2026-01-16
Accepted: 2026-01-23
Published: 2026-02-05
Abstract
The implementation of the K-12 Basic Education Program in the Philippines introduced Senior High School (SHS) as a critical stage where students must select an academic strand aligned with their interests, abilities and future career goals. This decision is particularly significant because it influences students’ academic readiness, motivation, and long-term educational outcomes. For Grade 10 learners, choosing among SHS strands such as Science, Technology, Engineering, and Mathematics (STEM), Accountancy, Business, and Management (ABM), Humanities and Social Sciences (HUMSS), and General Academic Strand (GAS) often occurs at a formative stage when self-awareness and academic guidance are still developing. Consequently, inaccurate or poorly informed strand choices may lead to academic difficulties, disengagement, or later program shifts, underscoring the importance of informed and evidence-based SHS decision making.
In recent years, educational institutions have increasingly explored the use of machine learning (ML) techniques to support academic advising and student performance prediction. ML-based decision-support systems offer the potential to analyse large volumes of student data and uncover patterns that may not be immediately apparent through traditional counselling approaches. However, despite their predictive power, many existing ML models particularly ensemble and deep learning methods operate as black-box systems. These models generate predictions without providing clear explanations of how decisions are made, limiting their suitability for educational contexts where transparency, accountability, and human oversight are essential.
Keywords
Interpretable, Cart-Based, Framework, Multi-Target, Educational, Prediction
Downloads
References
1. Asor, J. R., Catedrilla, G. M. B., Buama, C. A. C., et al. (2022). Prediction of Senior High School Students’ Performance in a State University: An Educational Data Mining Approach. International Journal of Innovate Engineering & Technology (IJIET), 13(6). Retrieved from https://www.ijiet.org/vol13/IJIET-V13N6-1888.pdf [Google Scholar] [Crossref]
2. Guyon, I., Weston, J., Barnhill, S., & Vapnik, V. (2002). Gene selection for cancer classification using support vector machines. Machine Learning, 46(1–3), 389–422. [Google Scholar] [Crossref]
3. https://doi.org/10.1023/A:1012487302797 [Google Scholar] [Crossref]
4. Loh, W.-Y. (2014). Fifty years of classification and regression trees. International Statistical Review, 82(3), 329–348. https://doi.org/10.1111/insr.12016 [Google Scholar] [Crossref]
5. Ravi, K. B. (2017). Cost-Complexity Pruning of Random Forests. arXiv. [Google Scholar] [Crossref]
6. https://arxiv.org/abs/1703.05430 [Google Scholar] [Crossref]
7. Romero, C., & Ventura, S. (2007). Educational data mining: A survey (1995–2005). Computers & Education, 51(1), 98–112. https://doi.org/10.1016/j.compedu.2007.05.004 [Google Scholar] [Crossref]
8. Romero, C., & Ventura, S. (2020). Educational data mining and learning analytics: An updated survey. In Learning Analytics in Education (pp. 1–42). [PDF]. Retrieved from https://bookdown.org/chen/la-manual/files/Romero%20and%20Ventura%20-%202020.pdf [Google Scholar] [Crossref]
9. Schmid, L., et al. (2023). Tree-based ensembles for multi-output regression. Expert Systems with Applications. https://doi.org/10.1016/j.eswa.2022.120146 [Google Scholar] [Crossref]
10. Vergara, J. R., & Estévez, P. A. (2013). A review of feature selection methods based on mutual information. Neural Computing and Applications, 24(1), 175–186. https://doi.org/10.1007/s00521-013-1368-0 [Google Scholar] [Crossref]
11. “Precision in Progress: Leveraging Data Mining Technique to Empower Career Path Selection for Incoming Senior High School Students.” (2024). International Journal of Research and Innovation in Social Science (IJRISS). Retrieved from https://rsisinternational.org/journals/ijriss/articles/precision-in-progress-leveraging-data-mining-technique-to-empower-career-path-selection-for-incoming-senior-high-school-students/ [Google Scholar] [Crossref]
12. Liu, H., & Park, J. (2024). Comparative analysis of decision tree algorithms for academic outcome prediction. Journal of Educational Data Mining, 16(2), 45–63. [Google Scholar] [Crossref]
13. Gonzales, R., & Lim, M. (2025). Evaluating interpretable decision trees for SHS strand recommendation. Computers & Education, 183, 104585. [Google Scholar] [Crossref]
14. Raman, S., & Das, A. (2023). Predicting at-risk students using CART and pruning techniques. Education and Information Technologies, 28, 1123–1142. [Google Scholar] [Crossref]
15. Kim, S., & Velasco, P. (2024). Hybrid feature selection methods for multi-output educational datasets. Expert Systems with Applications, 209, 118235. [Google Scholar] [Crossref]
16. Ravi, K. B. (2017). Cost-complexity pruning of Random Forests. arXiv:1703.05430. [Google Scholar] [Crossref]
17. Schmid, L., et al. (2023). Tree-based ensembles for multi-output regression. Expert Systems with Applications. https://doi.org/10.1016/j.eswa.2022.120146 [Google Scholar] [Crossref]
18. Romero, C., & Ventura, S. (2020). Educational data mining and learning analytics: An updated survey. Wiley Interdisciplinary Reviews: Data Mining and Knowledge Discovery, 10(3), e1355. [Google Scholar] [Crossref]
19. Simon, H. A. (2020). Rational decision-making in educational systems. Educational Research Review, 30, 100321. [Google Scholar] [Crossref]
20. Kumar, V., & Singh, K. (2021). Constructivist learning theory in data-driven education. Journal of Educational Technology Systems, 50(1), 3–21. [Google Scholar] [Crossref]
21. Li, J., Cheng, K., Wang, S., et al. (2020). Feature selection: A data perspective. ACM Computing Surveys, 52(5), 94. [Google Scholar] [Crossref]
22. Ahmed, M., & Elaraby, I. (2023). Hybrid feature selection methods for educational data mining. IEEE Access, 11, 33421–33435. [Google Scholar] [Crossref]
Metrics
Views & Downloads
Similar Articles
- Assessment of the Role of Artificial Intelligence in Repositioning TVET for Economic Development in Nigeria
- Teachers’ Use of Assure Model Instructional Design on Learners’ Problem Solving Efficacy in Secondary Schools in Bungoma County, Kenya
- “E-Booksan Ang Kaalaman”: Development, Validation, and Utilization of Electronic Book in Academic Performance of Grade 9 Students in Social Studies
- Analyzing EFL University Students’ Academic Speaking Skills Through Self-Recorded Video Presentation
- Major Findings of The Study on Total Quality Management in Teachers’ Education Institutions (TEIs) In Assam – An Evaluative Study