From Segmentation to Prediction: A Unified RFM–Machine Learning Model for Online Retail Analytics
Authors
Department of Computer Science, The Federal Polytechnic Ado Ekiti, Ekiti State (Nigeria)
Department of Computer Science, The Federal Polytechnic Ado Ekiti, Ekiti State (Nigeria)
Article Information
DOI: 10.51584/IJRIAS.2025.101100117
Subject Category: Social science
Volume/Issue: 10/11 | Page No: 1267-1280
Publication Timeline
Submitted: 2025-12-04
Accepted: 2025-12-11
Published: 2025-12-23
Abstract
Customer analytics features in the digital retail age that provides an understanding of the buying behaviour and makes improvements in marketing practice. As e-commerce platforms have massive volumes of transaction data, predictive analytics can provide an effective customer segmentation and future behavioural prediction methodology in doing business. Most online retailers still do not have a means to identify high-value customers correctly or forecast repeat buy. Even though they have access to detailed transaction logs, they cannot use the information effectively. The lack of this insight can easily lead to ineffective targeting and thus unreached revenue potential. In this paper, the researcher tries to study purchase behaviour segmentation with the help of RFM (Recency, Frequency, Monetary) and employ machine learning capabilities to learn customer purchasing behaviour. Using the Online Retail dataset, this study uses data preprocessing, customer segmentation based on RFM scoring, Logistic Regression and Random Forest models to predict the future purchasing behaviour. In the analysis, the frequency and monetary score of customers provide a strong indication of the ability of customers to make repeat purchases. Of all the classifier models tested, Random Forest was found to be more effective and accurate than the precision of other models confirming its suitability in loud customer tests. The findings are beneficial to e-commerce websites that wish to maximize on the strategy of personalized marketing and management of customers. The opportunity to increase predictive accuracy even more is to address hybrid models and real-time segmentation in the future studies.
Keywords
RFM segmentation, predictive analytics, customer behavior
Downloads
References
1. Adekiigbe, A., Bakar, K. A., & Simeon, O. O. (2011). A review of cluster-based congestion control protocols in wireless mesh networks. International Journal of Computer Science Issues (IJCSI), 8(4), 42. [Google Scholar] [Crossref]
2. Ajibade, S.-S. M., Jasser, M. B., Alebiosu, D. O., Issa, B., San, L. W., & ALDharhani, G. S. (2025). A Comparative Analysis of Detection Methods for Phishing Websites Using Machine Learning. 2025 IEEE International Conference on Automatic Control and Intelligent Systems (I2CACIS), [Google Scholar] [Crossref]
3. Ajibade, S.-S. M., Jasser, M. B., San, L. W., Achick-Muyu, V. A., Moorthy, U., & VE, S. (2025). Intelligent Obesity Pattern Prediction using Machine Learning: Applications in Automated Health Analytics. 2025 8th International Conference on Computing Methodologies and Communication (ICCMC), [Google Scholar] [Crossref]
4. Akande, O. N., Akande, H. B., Asani, E. O., & Dautare, B. T. (2024). Customer Segmentation through RFM Analysis and K-means Clustering: Leveraging Data-Driven Insights for Effective Marketing Strategy. 2024 International Conference on Science, Engineering and Business for Driving Sustainable Development Goals (SEB4SDG), [Google Scholar] [Crossref]
5. Akter, J., Roy, A., Rahman, S., Mohona, S., & Ara, J. (2025). Artificial intelligence-driven customer lifetime value (CLV) forecasting: Integrating RFM analysis with machine learning for strategic customer retention. Journal of Computer Science and Technology Studies, 7(1), 249-257. [Google Scholar] [Crossref]
6. Anand, B., Chakravarty, H., Athalye, M. S. G., Varalaxmi, P., & Mishra, A. K. (2023). Understanding consumer behaviour in the digital age: a study of online shopping habits. Shodha Prabha (UGC Care J.), 48(3), 84-93. [Google Scholar] [Crossref]
7. Bukhari, T. T., Oladimeji, O., Etim, E. D., & Ajayi, J. O. (2022). Customer Lifetime Value Prediction Using Gradient Boosting Machines. Gyanshauryam, International Scientific Refereed Research Journal, 5(4), 488-506. [Google Scholar] [Crossref]
8. Chaudhry, M., Shafi, I., Mahnoor, M., Vargas, D. L. R., Thompson, E. B., & Ashraf, I. (2023). A systematic literature review on identifying patterns using unsupervised clustering algorithms: A data mining perspective. Symmetry, 15(9), 1679. [Google Scholar] [Crossref]
9. Chen, A. H.-L., & Gunawan, S. (2023). Enhancing retail transactions: a data-driven recommendation using modified RFM analysis and association rules mining. Applied Sciences, 13(18), 10057. [Google Scholar] [Crossref]
10. Hooshmand Pakdel, G., He, Y., & Chen, X. (2025). Predicting customer demand with deep learning: An LSTM-based approach incorporating customer information. International Journal of Production Research, 1-13. [Google Scholar] [Crossref]
11. Joung, J., & Kim, H. (2023). Interpretable machine learning-based approach for customer segmentation for new product development from online product reviews. International Journal of Information Management, 70, 102641. [Google Scholar] [Crossref]
12. Nasir, F., Ahmed, A. A., Kiraz, M. S., Yevseyeva, I., & Saif, M. (2024). Data-Driven Decision-Making for Bank Target Marketing Using Supervised Learning Classifiers on Imbalanced Big Data. Computers, Materials & Continua, 81(1). [Google Scholar] [Crossref]
13. Ogunnusi, O. S., Abd Razak, S., & Abdullah, A. H. (2015a). A LIGHTWEIGHT ONE-PASS AUTHENTICATION MECHANISM FOR AGENT COMMUNICATION IN MULTI-AGENT SYSTEM BASED APPLICATIONS. Jurnal Teknologi (Sciences & Engineering), 77(18). [Google Scholar] [Crossref]
14. Ogunnusi, O. S., Abd Razak, S., & Abdullah, A. H. (2015b). A THRESHOLD-BASED CONTROLLER FOR MULTI-AGENT SYSTEMS. Jurnal Teknologi (Sciences & Engineering), 77(18). [Google Scholar] [Crossref]
15. Ogunnusi, O. S., Razak, S., & Adu, M. K. (2018). An approach to secure mobile agent communication in multi-agent systems. Int J Comput Inf Eng, 12, 743-747. [Google Scholar] [Crossref]
16. Paramasivan, A. (2024). Harnessing AI for Behavioral Insights Unlocking the Potential of Transactional Data. IJLRP-International Journal of Leading Research Publication, 5(10). [Google Scholar] [Crossref]
17. Rajesh, M., Rao Chintalapudi, S., & Krishna Prasad, M. (2024). Data-Driven Decisions: Empowering E-Commerce with RFM and Machine Learning-Based Customer Segmentation. International Conference on Intelligent Computing and Communication, [Google Scholar] [Crossref]
18. Sagar, S. (2024). The impact of digital transformation on retail management and consumer behavior. Journal of Business and Management, 26(1), 06-14. [Google Scholar] [Crossref]
19. Şentürk, H., Geçici, E., & Alp, S. (2024). Customer segmentation with clustering methods in the retail industry. İstanbul Aydın Üniversitesi Sosyal Bilimler Dergisi, 16(4), 551-573. [Google Scholar] [Crossref]
20. Shirole, R., Salokhe, L., & Jadhav, S. (2021). Customer segmentation using rfm model and k-means clustering. Int. J. Sci. Res. Sci. Technol, 8(3), 591-597. [Google Scholar] [Crossref]
21. Smith, A. D., & Rupp, W. T. (2003). Strategic online customer decision making: leveraging the transformational power of the Internet. Online information review, 27(6), 418-432. [Google Scholar] [Crossref]
22. Sudha, M., & Mehertaj, S. (2025). Machine Learning For Enhanced Digital Marketing: Strategies, Models, and Interpretability. International Journal of Knowledge Exploration in Computational Intelligence, 1(1), 15-21. [Google Scholar] [Crossref]
23. Suguna, R., Suriya Prakash, J., Aditya Pai, H., Mahesh, T., Vinoth Kumar, V., & Yimer, T. E. (2025). Mitigating class imbalance in churn prediction with ensemble methods and SMOTE. Scientific reports, 15(1), 16256. [Google Scholar] [Crossref]
24. Tabianan, K., Velu, S., & Ravi, V. (2022). K-means clustering approach for intelligent customer segmentation using customer purchase behavior data. Sustainability, 14(12), 7243. [Google Scholar] [Crossref]
25. Theodorakopoulos, L., & Theodoropoulou, A. (2024). Leveraging big data analytics for understanding consumer behavior in digital marketing: A systematic review. Human Behavior and Emerging Technologies, 2024(1), 3641502. [Google Scholar] [Crossref]
26. Usman-Hamza, F. E., Balogun, A. O., Capretz, L. F., Mojeed, H. A., Mahamad, S., Salihu, S. A., Akintola, A. G., Basri, S., Amosa, R. T., & Salahdeen, N. K. (2022). Intelligent decision forest models for customer churn prediction. Applied Sciences, 12(16), 8270. [Google Scholar] [Crossref]
27. Yap, J. J., Yong, Y. L., Jasser, M. B., Ajibade, S.-S. M., & Al-Hadi, I. A. A.-Q. (2024). Improving object detection in videos: a comprehensive evaluation of faster R-CNN employed in partial occlusion handling. 2024 20th IEEE International Colloquium on Signal Processing & Its Applications (CSPA), [Google Scholar] [Crossref]
28. Zaghloul, M., Barakat, S., & Rezk, A. (2025). Enhancing customer retention in Online Retail through churn prediction: A hybrid RFM, K-means, and deep neural network approach. Expert Systems with Applications, 290, 128465. [Google Scholar] [Crossref]
29. Zhu, X., Zhang, F., & Li, H. (2022). Swarm deep reinforcement learning for robotic manipulation. Procedia computer science, 198, 472-479. [Google Scholar] [Crossref]
Metrics
Views & Downloads
Similar Articles
- The Impact of Ownership Structure on Dividend Payout Policy of Listed Plantation Companies in Sri Lanka
- Urban Sustainability in North-East India: A Study through the lens of NER-SDG index
- Performance Assessment of Predictive Forecasting Techniques for Enhancing Hospital Supply Chain Efficiency in Healthcare Logistics
- The Fractured Self in Julian Barnes' Postmodern Fiction: Identity Crisis and Deflation in Metroland and the Sense of an Ending
- Impact of Flood on the Employment, Labour Productivity and Migration of Agricultural Labour in North Bihar