User Behaviour Prediction in E-Commerce Using Logistic Regression
Main Article Content
Abstract
From a psychological perspective, human behaviour reflects underlying thoughts and decision-making patterns, for example, consumer behaviour may correlate with the purchase decisions. In the fast-evolving e-commerce industry, predicting user behaviour is essential for enhancing marketing strategies, improving customer experiences, and increasing sales. However, traditional heuristic (e.g. market basket analysis) approaches to analyse buyer behaviour are often rigid and fail to adapt to complex consumer interactions. This research work develops a predictive model that analyses user behaviour based on data such as historical purchasing patterns and demographic attributes. Based on a review of previous studies, Logistic Regression (LR) is utilized as the primary machine learning algorithm to estimate the likelihood of user performing specific actions including churning and conversion rate. The dataset undergoes preprocessing steps, including data cleaning, feature selection, and normalization, to enhance model accuracy. Evaluation metrics, including accuracy, confusion matrix, precision, recall and F1-Score are used to ensure the model’s performance is reliable and effective. Unlike traditional heuristic approaches, this data-driven method offers a scalable and adaptable solution for behaviour prediction. The findings of this research have the potential to revolutionize e-commerce by providing businesses with actionable insights into consumer behaviour. By leveraging predictive analytics, companies can implement targeted marketing campaigns, personalize recommendations, and improve customer retention strategies. Additionally, this study highlights the significance of behavioural modelling in detecting early signs of customer churn, allowing businesses to take proactive measures. Ultimately, this research contributes to the growing field of data-driven decision-making, offering a scalable and adaptable solution for understanding and predicting user behaviour in online shopping environments.
Article Details

This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.
All articles published in JIWE are licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International (CC BY-NC-ND 4.0) License. Readers are allowed to
- Share — copy and redistribute the material in any medium or format under the following conditions:
- Attribution — You must give appropriate credit, provide a link to the license, and indicate if changes were made. You may do so in any reasonable manner, but not in any way that suggests the licensor endorses you or your use;
- NonCommercial — You may not use the material for commercial purposes;
- NoDerivatives — If you remix, transform, or build upon the material, you may not distribute the modified material.
References
R. Ketipov, V. Angelova, L. Doukovska, and R. Schnalle, “Predicting user behavior in e-commerce using machine learning”, Cybernetics and Information Technologies, vol. 23, no. 3, p. 89-101, 2023, doi: 10.2478/cait-2023-0026
M. Romzi, N.A. Nabila, S.-C. Haw, W. E. Kong, H. A. Santoso, and G. K. Tong, “Generative AI recommender system in E-commerce,” International Journal on Advanced Science, Engineering and Information Technology, vol. 14, no. 6, pp. 1823–1835, Dec. 2024, doi: 10.18517/ijaseit.14.6.10509.
T.E. Tong, S.-C. Haw, K.W. Ng, M. Al-Tarawneh, and G.K. Tong, “Performance evaluation on resolution time prediction using machine learning techniques,” International Journal on Informatics Visualization, vol. 8, no. 2, pp. 583-590, May 2024, doi: 10.62527/joiv.8.2.2305.
X. Zhao and P. Keikhosrokiani, “Sales prediction and product recommendation model through user behavior analytics,” Computers, Materials and Continua, vol. 70, no. 2, pp. 3855-3874, 2022, doi: 10.32604/cmc.2022.019750.
J. Jayaram, S.-C. Haw, N. Palanichamy, E. Anaam, and S.K. Thillaigovindhan, “A systematic review on effectiveness and contributions of machine learning and deep learning methods in lung cancer diagnosis and classifications,” International Journal of Computing and Digital Systems, vol. 17, no. 1, pp. 1–12, Jan. 2025, doi: 10.12785/ijcds/1571032811.
Z.-B. Phang, S.-C. Haw, T.-E. Tai, and K.-W. Ng, “Interactive data visualization to optimize decision-making process,” in International Symposium on Parallel Computing and Distributed Systems (PCDS), IEEE, pp. 1–6, Sep. 2024, doi: 10.1109/PCDS61776.2024.10743427.
P. A. Sunarya, U. Rahardja, S. C. Chen, Y. M. Lic, and M. Hardini, “Deciphering digital social dynamics: A comparative study of logistic regression and random forest in predicting e-commerce customer behavior,” Journal of Applied Data Sciences, vol. 5, no. 1, pp. 100-113, 2024, doi: 10.47738/jads.v5i1.155.
N. Sharma, “Analyzing customer behavior patterns & predicting online product return intentions: a data mining approach,” Master’s Thesis, Faculty of Communication and Environment, Rhine-Waal University of Applied Sciences, Kleve, Germany, 2024.
C.J. Liu, T.S. Huang, P.T. Ho, J.C. Huang, and C.T. Hsieh, “Machine learning-based E-commerce platform repurchase customer prediction model,” PLoS One, vol. 15, no. 12, 2020, doi: 10.1371/journal.pone.0243105.
X. Hu, Y. Yang, S. Zhu and L. Chen, “Research on a Hybrid Prediction Model for Purchase Behavior Based on Logistic Regression and Support Vector Machine,” 2020 3rd International Conference on Artificial Intelligence and Big Data (ICAIBD), Chengdu, China, 2020, pp. 200-204, doi: 10.1109/ICAIBD49809.2020.9137484.
X. Chen, S. Ding, Y. Xiang, and L. Liu, “Research on prediction of online purchasing behavior based on hybrid model,” in Journal of Physics: Conference Series, 2021, vol. 1827, no. 1, 2021, doi: 10.1088/1742-6596/1827/1/012075.
S. Xiao and W. Tong, “Prediction of user consumption behavior data based on the combined model of TF-IDF and logistic regression,” in Journal of Physics: Conference Series, vol. 1757, no. 1, 2021, doi: 10.1088/1742-6596/1757/1/012089.
Y. Jiang, “Research on prediction of E-commerce repurchase behavior based on multiple fusion models,” Applied and Computational Engineering, vol. 2, no. 1, pp. 868-882, 2023, doi: 10.54254/2755-2721/2/20220555.
P. Rajapaksha, and D. Asanka, “Recommendations to increase the customer interaction of E-commerce applications with web usage mining,” in Proceedings of IEEE InC4 2023 - 2023 IEEE International Conference on Contemporary Computing and Communications, pp. 1-6, 2023, doi: 10.1109/InC457730.2023.10263131.
J. Guo, “Research on mobile E-commerce recommendation algorithms based on logistic regression improved model features,” Academic Journal of Engineering and Technology Science, vol. 7, no. 5, 2024, doi: 10.25236/AJETS.2024.070514.
L. Feng, “Online shopping consumer perception analysis and future network security service technology using logistic regression model,” PeerJ Computer Science, vol. 10, 2024, doi: 10.7717/peerj-cs.1777.
Y. Qiu, and C. Li, “Research on E-commerce user churn prediction based on logistic regression,” 2017 IEEE 2nd Information Technology, Networking, Electronic and Automation Control Conference (ITNEC), Chengdu, China, pp. 87-91, 2017, doi: 10.1109/ITNEC.2017.8284914.
A.R. Lubis, S. Prayudani, Julham, O. Nugroho, Y.Y. Lase, and M. Lubis, "Comparison of model in predicting customer churn based on users' habits on e-commerce", 2022 5th International Seminar on Research of Information Technology and Intelligent Systems (ISRITI), pp. 300-305, 2022, https://doi.org/10.1109/isriti56927.2022.10052834.
C. Sunita, “Artificial Intelligence in E-commerce: Exploring the purchase decisions through logistic regression analysis,” Quing: International Journal of Commerce and Management, vol. 3, no. 3, pp. 301-309, 2023, doi: 10.54368/qijcm.3.3.0008.
O. Alkan and S. Unver, “Determination of factors that affect use of E-Commerce in Eastern Turkey through categorical data analysis,” Toros University FEASS Journal of Social Sciences, 8(Special Issue), pp. 22-36, 2021.
V. Malik, R. Mittal, and S.V. Singh, “EPR-ML: E-commerce product recommendation using NLP and machine learning algorithm,” in Proceedings of 5th International Conference on Contemporary Computing and Informatics (IC3I), pp. 1778-1783, 2022, doi: 10.1109/IC3I56241.2022.10073224.
K. Ariansyah, E.R.E. Sirait, B.A. Nugroho, and M. Suryanegara, “Drivers of and barriers to E-commerce adoption in Indonesia: Individuals’ perspectives and the implications,” Telecommunications Policy, vol. 45, no. 8, 2021, doi: 10.1016/j.telpol.2021.102219.
S. Unver, and O. Alkan, “Determinants of E-commerce use at different educational levels: Empirical evidence from Turkey E-commerce use at different educational levels,” International Journal of Advanced Computer Science and Applications, vol. 12, no. 3, 2021, doi: 10.14569/IJACSA.2021.0120305.
S.A. Alquhtani, and A. Muniasamy, “Analytics in Support of E-Commerce Systems Using Machine Learning,” 2022 International Conference on Electrical, Computer and Energy Technologies (ICECET), Prague, Czech Republic, pp. 1-5, 2022, doi: 10.1109/ICECET55527.2022.9872592.
Y. Li, J. Shi, F. Cao, and A. Cui, “Product Reviews Analysis of E-commerce Platform Based on Logistic-ARMA Model,” 2021 IEEE International Conference on Power, Intelligent Computing and Systems (ICPICS), Shenyang, China, pp. 714-717, 2021, doi: 10.1109/ICPICS52425.2021.9524238.
P. Kathuria, P. Sethi, and R. Negi, “Sentiment analysis on e-commerce reviews and ratings using ml & nlp models to understand consumer behavior”, 2022 International Conference on Recent Trends in Microelectronics, Automation, Computing and Communications Systems (ICMACC), 2022. https://doi.org/10.1109/icmacc54824.2022.10093674
S. Zulfiker, A. Chowdhury, D. Roy, S. Datta, and S. Momen, “Bangla E-Commerce Sentiment Analysis Using Machine Learning Approach,” 2022 4th International Conference on Sustainable Technologies for Industry 4.0 (STI), Dhaka, Bangladesh, pp. 1-5, 2022, doi: 10.1109/STI56238.2022.10103350.
I.A. Shah, “Comprehensive synthetic e-commerce dataset,” Kaggle. Accessed: Dec. 12, 2024. [Online]. Available: https://www.kaggle.com/datasets/imranalishahh/comprehensive-synthetic-e-commerce-dataset.