Design of a CNN–NLP Based Visual Chatbot for e-Commerce Systems
Main Article Content
Abstract
E-commerce platforms have limitations to assist interactive and highly personalized experiences using conventional text or voice-based chatbots, particularly in context-aware areas where product discovery and visual identification are necessary. Hence, we propose to integrate an image as a visual-based chatbot. We present a conceptual framework that combines Convolutional Neural Networks (CNN) for image recognition with Natural Language Processing (NLP) for dynamic, context-aware search in a chatbot. The CNN model enables the recognition of products from user-uploaded images, while the NLP component processes and generates appropriate responses to enhance the shopping experience by providing suggestions. This hybrid system will be developed using the Python programming language, which uses libraries such as Keras, OpenCV, and Flask. By allowing users to interact with a chatbot that uses visual inputs, this system aims to create a more intuitive, personalized e-commerce experience that could lead to high engagement and customer satisfaction. The evaluation metrics are measured in terms of the usage of this system by three users in real-world e-commerce applications. This small-scale test may offer insights into how image-based communication can revolutionize the online shopping experience by analysing usability, interaction efficiency, user satisfaction, engagement behaviour, and system responsiveness in practical scenarios during testing.
Article Details

This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.
All articles published in JIWE are licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International (CC BY-NC-ND 4.0) License. Readers are allowed to
- Share — copy and redistribute the material in any medium or format under the following conditions:
- Attribution — You must give appropriate credit, provide a link to the license, and indicate if changes were made. You may do so in any reasonable manner, but not in any way that suggests the licensor endorses you or your use;
- NonCommercial — You may not use the material for commercial purposes;
- NoDerivatives — If you remix, transform, or build upon the material, you may not distribute the modified material.
References
D. Patil, "Artificial intelligence in retail and e-commerce: enhancing customer experience through personalization, predictive analytics, and real-time engagement," Nov. 26, 2024. [Online]. Available: http://dx.doi.org/10.2139/ssrn.5057420.
J. Sidlauskiene, Y. Joye, and V. Auruskeviciene, “AI-based chatbots in conversational commerce and their effects on product and price perceptions,” Electronic Markets, vol. 33, no. 1, pp. 24, 2023, doi: 10.1007/s12525-023-00633-8.
R. Bansal, A. H. Ngah, A. Chakir, and N. Pruthi, “Leveraging ChatGPT and artificial intelligence for effective customer engagement,” in Advances in Human-Computer Interaction and Systems Design, IGI Global, Hershey, PA, USA, pp. 1–27, 2024, doi: 10.4018/979-8-3693-0815-8.
M. F. McTear, “The rise of the conversational interface: A new kid on the block?” in Future and Emerging Trends in Language Technology: Machine Learning and Big Data - 2nd International Workshop, FETLT 2016, Revised Selected Papers, J. F. Quesada, F. J. Martin Mateos, and T. Lopez-Soto, Eds. Cham, Switzerland: Springer Verlag, 2017, pp. 38–49, 2017, doi: 10.1007/978-3-319-69365-1_3.
D. Adiwardana, M. T. Luong, D. R. So, J. Hall, N. Fiedel, R. Thoppilan, and Q. V. Le, “Towards a human-like open-domain chatbot,” arXiv, arXiv:2001.09977, 2020.
R. Kaur, A. Kumar, and P. Singh, “SignBot: A multimodal sign language-enabled chatbot for the hearing impaired,” Internaltional. Journal of Human-Computer Interaction, vol. 37, no. 17, pp. 1592–1606, 2021.
J. Weizenbaum, “ELIZA—a computer program for the study of natural language communication between man and machine”, Communications of the ACM, vol. 9, no. 1, pp. 36-45, 1966, doi: 10.1145/365153.365168.
S. La Bua, “Latent semantic analysis and its application in conversational chatbots,” unpublished, 2015.
mySimon, MySimon price comparison website, 2006. [Online]. Available: https://web.archive.org
Inktomi Corp., Inktomi search engine, 2006. [Online]. Available: https://web.archive.org
J. Hauswald, M. Laurenzano, Y. Zhang, C. Li, A. Rovinski, A. Khurana et al., “Sirius: An open end-to-end voice and vision personal assistant and its implications for future warehouse scale computers”, ACM SIGARCH Computer Architecture News, vol. 43, no. 1, pp. 223-238, 2015, doi: 10.1145/2786763.2694347.
Google Goggles, “Google Goggles overview and requirements,” Google Inc., 2012. [Online]. Available: https://en.wikipedia.org/wiki/Google_Goggles
L. Rao, “Image recognition startup SnapTell acquired by Amazon subsidiary A9.com,” TechCrunch, Jun. 16, 2009. [Online]. Available: https://techcrunch.com/2009/06/16/image-recognition-startup-snaptell-acquired-by-amazon-subsidiary-a9com/
Nokia Point & Find, “Nokia Point & Find,” Wikipedia, the free encyclopedia, 2012. [Online]. Available: https://en.wikipedia.org/wiki/Nokia_Point_%26_Find
Slyce, “Slyce image recognition and barcode scanning,” Slyce Inc., 2013. [Online]. Available: https://en.wikipedia.org/wiki/Slyce
N. Zhang, “To improve image recognition and product recommendation in e-commerce systems,” in Proceedings of E-commerce conferences, 2024.
N. Goel, “Shopbot: An image-based search application for e-commerce domain,” Master's Projects, 516, 2017, doi: 10.31979/etd.r7a5-6dzf.
R. Dahal, S. Dhakal, R. Timalsina, and S. Neupane, “Re-commerce site with image processing and voice recognition,” in Innovations in Retail AI, 2024.
P. Badave, B. Bhomaj, B. Bindu, R. Shivarkar, and P. N. Dhavase, “E-commerce website with recommendation system including chatbot and reverse image search,” Ijraset Journal For Research in Applied Science and Engineering Technology, 2022, doi: 10.22214/ijraset.2022.46904.
M. Kuo, H.-T. Chan, and C.-H. Hsia, “Study on Mask R-CNN with data augmentation for retail product detection,” in 2021 International Symposium on Intelligent Signal Processing and Communication Systems (ISPACS), Hualien City, Taiwan, pp. 1-2, 2021, doi: 10.1109/ISPACS51563.2021.9651028.
H. Wu, Y. Gao, X. Guo, Z. Al-Halah, S. Rennie, K. Grauman et al., “Fashion IQ: A new dataset towards retrieving images by natural language feedback", 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pp. 11302-11312, 2021, doi: 10.1109/cvpr46437.2021.01115.
D. Jain, E. M. Thazhathu, I. Adiraju, J. Bhattacharya, and D. Singh, “FashionAI: image-based clothing detection and shopping recommendation,” 2024 3rd International Conference for Innovation in Technology (INOCON), Bangalore, India, pp. 1-6, 2024, doi: 10.1109/INOCON60754.2024.10512219.
C. J. Li, “A personalized product recommendation system for e-commerce platforms based on artificial intelligence and image processing technologies”, Traitement Du Signal, vol. 41, no. 6, pp. 2961-2971, 2024, doi: 10.18280/ts.410615.
Z. B. Ter, P. Naveen, and J. Jayapradha, “Generative AI-based meal recommender system,” Journal of Informatics and Web Engineering, vol. 4, no. 2, pp. 20–35, 2025, doi: 10.33093/jiwe.2025.4.2.20.
Y.-X. Lim, S.-C. Haw, and J. Jayapradha, “Optimizing reviewer assignment with recommender systems: Models, related work, and evaluation,” International Journal on Robotics Automation and Sciences, vol. 7, no. 2, pp. 56-76, 2025, doi: 10.33093/ijoras.2025.7.2.6.
T. Natadirja, H. Akbar, G. Firmansyah, and B. Tjahjono, “E-commerce product image-based recommendation system Kalcare.com using deep learning,” Jurnal Indonesia Sosial Teknologi, vol. 4, no. 8, pp. 930–940, 2023, doi: 10.59141/jist.v4i8.669.
T. K. Hanchinal, V. D. Bhavani, and V. B. Mindolli, “Intelligent beauty product recommendation using deep learning,” in 2024 1st International Conference on Cognitive, Green and Ubiquitous Computing (IC-CGU), Bhubaneswar, India, pp. 1–5, 2024, doi: 10.1109/IC-CGU58078.2024.10530808.
A. Serban and F. Bota, “A conceptual framework for software fault prediction using neural networks,” in Simian, D., Stoica, L. (eds) Modelling and Development of Intelligent Systems. MDIS 2019. Communications in Computer and Information Science, vol 1126. Springer, Cham., vol. 1126, Cham, Switzerland: Springer, pp. 137–148, 2020, doi: 10.1007/978-3-030-39237-6_12.