Study on Classification Algorithms for Multi-relational Data Mining

Authors

Mahfuza Mallika

Lecturer in CSE, Centre for General Education, Bangladesh Islami University, Dhaka (Bangladesh)

Article Information

DOI: 10.51244/IJRSI.2025.1210000276

Subject Category: Education

Volume/Issue: 12/10 | Page No: 3179-3187

Publication Timeline

Submitted: 2025-10-30

Accepted: 2025-11-06

Published: 2025-11-19

Abstract

Data Mining (DM) constitutes a fundamental stage in the Knowledge Discovery in Databases (KDD) process, emphasizing the systematic analysis of large-scale datasets to uncover meaningful patterns, trends, and relationships. As data continues to grow in volume and complexity, the application of advanced analytical techniques has become essential for transforming raw data into actionable knowledge. DM employs a range of methods to address analytical challenges, including classification algorithms, association rule mining, and neural network approaches. This study investigates the effectiveness of various classification algorithms namely: ID3, C4.5, J48 and the general Decision Tree methodology in solving classification problems within data mining tasks. A benchmark dataset from the UCI Machine Learning Repository is utilized to illustrate the practical application of these algorithms. The WEKA software tool is employed for data preprocessing, model development, and performance evaluation through metrics such as accuracy and predictive power. The experimental results highlight the capability of classification techniques to categorize data points efficiently and extract valuable insights. Overall, the study underscores the critical role of classification-based data mining techniques in enhancing knowledge discovery and supporting informed decision-making across diverse domains.

Keywords

Classification and Classification Algorithm, Data Mining Techniques, Decision Tree, Weka, Application of J48 algorithm, ID3 Algorithm, C4.5 Algorithm, MLP, DRDM, UCI.

Downloads

References

1. Agrawal, R., Imielinski, T., & Swami, A. (1993). Mining association rules between sets of items in large databases. ACM SIGMOD Record, 22(2), 207–216. [Google Scholar] [Crossref]

2. Bishop, C. M. (2006). Pattern recognition and machine learning. Springer. [Google Scholar] [Crossref]

3. Breiman, L., Friedman, J. H., Olshen, R. A., & Stone, C. J. (1984). Classification and regression trees. CRC Press. [Google Scholar] [Crossref]

4. Centers for Disease Control and Prevention. (2022). Hepatitis: Overview and statistics. Retrieved from https://www.cdc.gov/hepatitis [Google Scholar] [Crossref]

5. Dua, D., & Graff, C. (2019). UCI machine learning repository. University of California, Irvine. Retrieved from https://archive.ics.uci.edu [Google Scholar] [Crossref]

6. Džeroski, S. (2003). Multi-relational data mining: An introduction. ACM SIGKDD Explorations Newsletter, 5(1), 1–16. [Google Scholar] [Crossref]

7. Han, J., Kamber, M., & Pei, J. (2011). Data mining: Concepts and techniques (3rd ed.). Morgan Kaufmann. [Google Scholar] [Crossref]

8. Han, J., Kamber, M., & Pei, J. (2012). Data mining: Concepts and techniques. Elsevier. [Google Scholar] [Crossref]

9. Habrard, A., Bernard, M., &Jacquenet, F. (2003, October). Multi-relational data mining in medical databases. In Conference on Artificial Intelligence in Medicine in Europe (pp. 365–374). Springer Berlin Heidelberg. [Google Scholar] [Crossref]

10. Haykin, S. (1999). Neural networks: A comprehensive foundation (2nd ed.). Prentice Hall. [Google Scholar] [Crossref]

11. Kaczmarek, B. L. J., &Knobbe, A. J. (2006). Multi-relational data mining (Vol. 145). IOS Press. [Google Scholar] [Crossref]

12. Kanodia, J. (2005). Structural advances for pattern discovery in multi-relational databases. [Google Scholar] [Crossref]

13. Kantardzic, M. (2019). Data mining: Concepts, models, methods, and algorithms. Wiley. [Google Scholar] [Crossref]

14. Knobbe, A., Blockeel, H., &Siebes, A. (1999). Multi-relational decision tree induction. In Proceedings of the Third European Symposium on Principles of Data Mining and Knowledge Discovery (pp. 1–12). [Google Scholar] [Crossref]

15. Morariu, D., Crețulescu, R., &Breazu, M. (2017). The WEKA multilayer perceptron classifier. International Journal of Advanced Statistics and IT&C for Economics and Life Sciences, 7(1), 1–7. [Google Scholar] [Crossref]

16. Murtagh, F. (1991). Multilayer perceptrons for classification and regression. Neurocomputing, 2(5–6), 183–197. [Google Scholar] [Crossref]

17. Padhy, N., &Panigrahi, R. (2012). Multi-relational data mining approaches: A data mining technique. arXiv preprint arXiv:1211.3871. [Google Scholar] [Crossref]

18. Pal, S. K., & Mitra, S. (1992). Multilayer perceptron, fuzzy sets, and classification. IEEE Transactions on Neural Networks, 3(5), 683–697. [Google Scholar] [Crossref]

19. Quinlan, J. R. (1986). Introduction to decision trees. Machine Learning, 1(1), 81–106. [Google Scholar] [Crossref]

20. Quinlan, J. R. (1993). C4.5: Programs for machine learning. Morgan Kaufmann. [Google Scholar] [Crossref]

21. Singhal, S., & Jena, M. (2013). A study on WEKA tool for data preprocessing, classification and clustering. International Journal of Innovative Technology and Exploring Engineering (IJITEE), 2(6), 250–253. [Google Scholar] [Crossref]

22. Tan, P. N., Steinbach, M., & Kumar, V. (2019). Introduction to data mining (2nd ed.). Pearson. [Google Scholar] [Crossref]

23. UCI Machine Learning Repository. (n.d.). Hepatitis dataset. University of California, Irvine. Retrieved from https://archive.ics.uci.edu [Google Scholar] [Crossref]

24. World Health Organization. (2023). Hepatitis. Retrieved from https://www.who.int/health-topics/hepatitis [Google Scholar] [Crossref]

25. Zhang, S., Li, X., & Yang, J. (2019). Multi-relational data mining and neural network integration for efficient knowledge discovery. International Journal of Data Mining and Knowledge Management Process, 9(2), 15–27. [Google Scholar] [Crossref]

Metrics

Views & Downloads

Similar Articles