Enhancing Fraud Detection in Financial Transactions using LightGBM and Random Forest
Main Article Content
Abstract
In recent years, the frequency and complexity of financial fraud have been rising and have become an urgent challenge for the global financial system. Traditional rule-based detection methods struggle to cope with new types of fraud, especially in terms of real-time detection, generalization ability, and accuracy. To overcome these limitations, machine learning techniques have gradually emerged as a promising solution for identifying fraudulent transactions with better flexibility and scalability. Based on the publicly available European credit card fraud transaction dataset, this study proposes a hybrid model that combines the advantages of LightGBM and Random Forest, aiming to improve the accuracy, robustness, and interpretability of fraud detection. To handle the severe data imbalance problem (fraud cases accounting for only 0.17%), this study applies a RandomUnderSampling strategy and further enhances model performance through systematic parameter tuning using RandomizedSearchCV and decision threshold optimization. Stratified K-Fold cross-validation is also used to validate model stability. In addition, the model is compared with alternative resampling methods including SMOTE and ADASYN, and the results reaffirm the effectiveness and practicality of the undersampling approach. The final model achieves an overall accuracy of 99%, a recall of 86% on the fraud class, ROC-AUC of 0.9746, and PR-AUC of 0.6639. While the precision is relatively low (13%), it reflects a deliberate strategy of prioritizing fraud detection over false positives. This hybrid approach demonstrates a good balance between detection performance and practicality, offering better interpretability and lower computational cost compared to many deep learning models.
Article Details

This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.
All articles published in JIWE are licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International (CC BY-NC-ND 4.0) License. Readers are allowed to
- Share — copy and redistribute the material in any medium or format under the following conditions:
- Attribution — You must give appropriate credit, provide a link to the license, and indicate if changes were made. You may do so in any reasonable manner, but not in any way that suggests the licensor endorses you or your use;
- NonCommercial — You may not use the material for commercial purposes;
- NoDerivatives — If you remix, transform, or build upon the material, you may not distribute the modified material.
References
INTERPOL, “INTERPOL Financial Fraud assessment: A global threat boosted by technology,” INTERPOL. Accessed: Jul. 17, 2025. [Online]. Available: https://www.interpol.int/en/News-and-Events/News/2024/INTERPOL-Financial-Fraud-assessment-A-global-threat-boosted-by-technology?utm_source=chatgpt.com.
L. Levy, “Inside the billion dollar identity fraud ecosystem,” TechRadar Pro. Accessed: Jul. 16, 2025. [Online]. Available: https://www.techradar.com/pro/inside-the-billion-dollar-identity-fraud-ecosystem?utm_source=chatgpt.com.
O. L. Poidevin, “UN Report Urges Stronger Measures to Detect AI-Driven Deepfakes,” Reuters. Accessed: Jul. 17, 2025. [Online]. Available: https://www.reuters.com/business/un-report-urges-stronger-measures-detect-ai-driven-deepfakes-2025-07-11/?.
S. Lalchand, V. Srinivas, B. Maggiore, and J. Henderson, “Generative AI is Expected to Magnify the Risk of Deepfakes and Other Fraud in Banking,” Deloitte Insights. Accessed: Jul. 17, 2025. [Online]. Available: https://www.deloitte.com/us/en/insights/industry/financial-services/deepfake-banking-fraud-risk-on-the-rise.html.
N. Conte, “Visualizing Global Losses from Financial Scams.” Accessed: Jul. 17, 2025. [Online]. Available: https://www.visualcapitalist.com/global-losses-from-financial-scams/#google_vignette.
B. M. Naman, and A. M. Abdulazeez, “Credit card fraud detection based on machine learning classification algorithm”, Indonesian Journal of Computer Science, vol. 13, no. 3, 2024, doi: 10.33022/ijcs.v13i3.3996.
Y. Chen, C. Zhao, Y. Xu, and C. Nie, “Year-over-year developments in financial fraud detection via deep learning: a systematic literature review,” arXiv, 2025, doi: 10.48550/arXiv.2502.00201.
J. Xu, T. Yang, S. Zhuang, H. Li, and W. Lu, “AI-based financial transaction monitoring and fraud prevention with behaviour prediction”, Preprints, 2024, doi: 10.20944/preprints202407.1107.v1
S. Simaiya, U. K. Lilhore, S. K. Sharma, and N. K. Trivedi, “An efficient credit card fraud detection model based on machine learning methods,” International Journal of Advanced Science and Technology, vol. 29, no. 5, pp. 3414–3424, Jan. 2020.
P. Hajek, M. Abedin, and U. Sivarajah, “Fraud detection in mobile payment systems using an XGBoost-based framework”, Information Systems Frontiers, vol. 25, no. 5, pp. 1985-2003, 2022, doi: 10.1007/s10796-022-10346-6.
H. O. Bello, C. Idemudia, and T. V. Iyelolu, “Integrating machine learning and blockchain: Conceptual frameworks for real-time fraud detection and prevention,” World Journal of Advanced Research and Reviews, vol. 23, no. 1, pp. 056–068, Jul. 2024, doi: 10.30574/wjarr.2024.23.1.1985.
I. D. Mienye, and T. G. Swart, “A hybrid deep learning approach with Generative Adversarial Network for credit card fraud detection,” Technologies (Basel), vol. 12, no. 10, Oct. 2024, doi: 10.3390/technologies12100186.
V. C. Maheshwari, N. A. Osman, and N. Aziz, “A hybrid approach adopted for credit card fraud detection based on deep neural networks and attention mechanism,” Journal of Advanced Research in Applied Science and Engineering Technology, Sep. 2023, doi: 10.37934/araset.32.1.315331.
A. Dal Pozzolo, O. Caelen, R. A. Johnson, and G. Bontempi, “Credit Card Fraud Detection Dataset,” Kaggle.
N. Chawla, K. Bowyer, L. Hall, and W. Kegelmeyer, “SMOTE: Synthetic minority over-sampling technique”, Journal of Artificial Intelligence Research, vol. 16, pp. 321-357, 2002, doi: 10.1613/jair.953.
H. He, and E. A. Garcia, “Learning from imbalanced data," in IEEE Transactions on Knowledge and Data Engineering, vol. 21, no. 9, pp. 1263-1284, Sept. 2009, doi: 10.1109/TKDE.2008.239.
F. K. Alarfaj, I. Malik, H. U. Khan, N. Almusallam, M. Ramzan, and M. Ahmed, “Credit card fraud detection using state-of-the-art machine learning and deep learning algorithms,” in IEEE Access, vol. 10, pp. 39700-39715, 2022, doi: 10.1109/ACCESS.2022.3166891.
D. Varmedja, M. Karanovic, S. Sladojevic, M. Arsenovic, and A. Anderla, “Credit card fraud detection - machine learning methods,” 2019 18th International Symposium INFOTEH-JAHORINA (INFOTEH), East Sarajevo, Bosnia and Herzegovina, pp. 1-5, 2019, doi: 10.1109/INFOTEH.2019.8717766.
M. A. Talukder, M. Khalid, and M. A. Uddin, “An integrated multistage ensemble machine learning model for fraudulent transaction detection”, Journal of Big Data, vol. 11, no. 1, 2024, doi: 10.1186/s40537-024-00996-5.
Y. Wang, “A data balancing and ensemble learning approach for credit card fraud detection," 2025 4th International Symposium on Computer Applications and Information Technology (ISCAIT), Xi'an, China, pp. 386-390, 2025, doi: 10.1109/ISCAIT64916.2025.11010591.
V. V. K. Reddy, R. V. K. Reddy, M. S. K. Munaga, B. Karnam, S. K. Maddila, and C. S. Kolli, “Deep learning-based credit card fraud detection in federated learning”, Expert Systems With Applications, vol. 255, pp. 124493, 2024, doi: 10.1016/j.eswa.2024.124493.
D. Breskuviene, and G. Dzemyda, “Enhancing credit card fraud detection: highly imbalanced data case”, Journal of Big Data, vol. 11, no. 1, 2024, doi: 10.1186/s40537-024-01059-5.