ENHANCING RETAIL FRAUD DETECTION WITH ISOLATION FORESTS AND AUTOENCODERS: OVERCOMING DATA LIMITATIONS AND REGULATORY CHALLENGES

Authors

  • Bhupendrasinh Thakre Walmart, USA Author

Keywords:

Retail Fraud Detection, Isolation Forest, Autoencoder, Feature Engineering, Regulatory Compliance

Abstract

 

Retail fraud results in significant financial losses for businesses worldwide, and the challenges are compounded by limited access to data and strict regulatory constraints. This article proposes a novel approach that combines isolation forests, autoencoders, and strategic feature engineering to enhance fraud detection in retail settings under these limitations. The proposed methodology leverages the strengths of isolation forests in detecting anomalies in high-dimensional data, autoencoders in learning normal transaction patterns, and feature engineering in capturing domain-specific insights. Experiments conducted on a real-world retail transaction dataset demonstrate the superior performance of the proposed approach compared to traditional anomaly detection techniques. The isolation forest and autoencoder models achieve high precision and recall, outperforming benchmark studies. The strategic feature engineering approach further enhances the models' ability to capture the intricacies of retail fraud. The proposed methodology offers a scalable and adaptable solution for retailers to combat fraud effectively while adhering to data limitations and regulatory requirements.

References

National Retail Federation. (2021). 2020 Organized Retail Crime Survey. Available: https://nrf.com/research/2020-organized-retail-crime-survey

A. Abdallah, M. A. Maarof, and A. Zainal, "Fraud detection system: A survey," Journal of Network and Computer Applications, vol. 68, pp. 90-113, 2016, doi: 10.1016/j.jnca.2016.04.007.

S. Bhattacharyya, S. Jha, K. Tharakunnel, and J. C. Westland, "Data mining for credit card fraud: A comparative study," Decision Support Systems, vol. 50, no. 3, pp. 602-613, 2011, doi: 10.1016/j.dss.2010.08.008.

Association of Certified Fraud Examiners. (2020). Report to the Nations: 2020 Global Study on Occupational Fraud and Abuse. Available: https://www.acfe.com/report-to-the-nations/2020/

European Parliament and Council of the European Union. (2016). Regulation (EU) 2016/679 of the European Parliament and of the Council of 27 April 2016 on the protection of natural persons with regard to the processing of personal data and on the free movement of such data, and repealing Directive 95/46/EC (General Data Protection Regulation). Official Journal of the European Union, L119, 1-88.

S. Bhattacharyya, S. Jha, K. Tharakunnel, and J. C. Westland, "Data mining for credit card fraud: A comparative study," Decision Support Systems, vol. 50, no. 3, pp. 602-613, 2011, doi: 10.1016/j.dss.2010.08.008.

A. Dal Pozzolo, O. Caelen, Y. A. Le Borgne, S. Waterschoot, and G. Bontempi, "Learned lessons in credit card fraud detection from a practitioner perspective," Expert Systems with Applications, vol. 41, no. 10, pp. 4915-4928, 2014, doi: 10.1016/j.eswa.2014.02.026.

F. T. Liu, K. M. Ting, and Z. H. Zhou, "Isolation Forest," in 2008 Eighth IEEE International Conference on Data Mining, 2008, pp. 413-422, doi: 10.1109/ICDM.2008.17.

J. Weston, F. Ratle, H. Mobahi, and R. Collobert, "Deep learning via semi-supervised embedding," in Neural Networks: Tricks of the Trade, Springer, 2012, pp. 639-655.

F. T. Liu, K. M. Ting, and Z. H. Zhou, "Isolation Forest," in 2008 Eighth IEEE International Conference on Data Mining, 2008, pp. 413-422, doi: 10.1109/ICDM.2008.17.

S. Hariri, M. C. Kind, and R. J. Brunner, "Extended Isolation Forest," IEEE Transactions on Knowledge and Data Engineering, vol. 33, no. 4, pp. 1479-1489, 2021, doi: 10.1109/TKDE.2019.2947676.

S. Ghosh and D. L. Reilly, "Credit card fraud detection with a neural-network," in Proceedings of the 27th Annual Hawaii International Conference on System Sciences, 1994, pp. 621-630, doi: 10.1109/HICSS.1994.323314.

A. Dal Pozzolo, O. Caelen, Y. A. Le Borgne, S. Waterschoot, and G. Bontempi, "Learned lessons in credit card fraud detection from a practitioner perspective," Expert Systems with Applications, vol. 41, no. 10, pp. 4915-4928, 2014, doi: 10.1016/j.eswa.2014.02.026.

G. E. Hinton and R. R. Salakhutdinov, "Reducing the dimensionality of data with neural networks," Science, vol. 313, no. 5786, pp. 504-507, 2006, doi: 10.1126/science.1127647.

J. An and S. Cho, "Variational Autoencoder based Anomaly Detection using Reconstruction Probability," Special Lecture on IE, vol. 2, pp. 1-18, 2015.

M. Schreyer, T. Sattarov, D. Borth, A. Dengel, and B. Reimer, "Detection of anomalies in large scale accounting data using deep autoencoder networks," arXiv preprint arXiv:1709.05254, 2017.

V. Hanagandi, A. Dhar, and K. Buescher, "Density-based clustering and radial basis function modeling to generate credit card fraud scores," in Proceedings of the IEEE/IAFE Conference on Computational Intelligence for Financial Engineering, 1996, pp. 247-251, doi: 10.1109/CIFER.1996.501843.

A. Correa Bahnsen, D. Aouada, A. Stojanovic, and B. Ottersten, "Feature engineering strategies for credit card fraud detection," Expert Systems with Applications, vol. 51, pp. 134-142, 2016, doi: 10.1016/j.eswa.2015.12.030.

S. Bhattacharyya, S. Jha, K. Tharakunnel, and J. C. Westland, "Data mining for credit card fraud: A comparative study," Decision Support Systems, vol. 50, no. 3, pp. 602-613, 2011, doi: 10.1016/j.dss.2010.08.008.

A. Dal Pozzolo, G. Boracchi, O. Caelen, C. Alippi, and G. Bontempi, "Credit card fraud detection: A realistic modeling and a novel learning strategy," IEEE Transactions on Neural Networks and Learning Systems, vol. 29, no. 8, pp. 3784-3797, 2018, doi: 10.1109/TNNLS.2017.2736643.

R. J. Bolton and D. J. Hand, "Statistical fraud detection: A review," Statistical Science, vol. 17, no. 3, pp. 235-249, 2002, doi: 10.1214/ss/1042727940.

A. Dal Pozzolo, O. Caelen, Y. A. Le Borgne, S. Waterschoot, and G. Bontempi, "Learned lessons in credit card fraud detection from a practitioner perspective," Expert Systems with Applications, vol. 41, no. 10, pp. 4915-4928, 2014, doi: 10.1016/j.eswa.2014.02.026.

S. Bhattacharyya, S. Jha, K. Tharakunnel, and J. C. Westland, "Data mining for credit card fraud: A comparative study," Decision Support Systems, vol. 50, no. 3, pp. 602-613, 2011, doi: 10.1016/j.dss.2010.08.008.

A. Pumsirirat and L. Yan, "Credit card fraud detection using deep learning based on auto-encoder and restricted Boltzmann machine," International Journal of Advanced Computer Science and Applications, vol. 9, no. 1, pp. 18-25, 2018, doi: 10.14569/IJACSA.2018.090103.

A. Roy, J. Sun, R. Mahoney, L. Alonzi, S. Adams, and P. Beling, "Deep learning detecting fraud in credit card transactions," in 2018 Systems and Information Engineering Design Symposium (SIEDS), 2018, pp. 129-134, doi: 10.1109/SIEDS.2018.8374722.

A. C. Bahnsen, D. Aouada, A. Stojanovic, and B. Ottersten, "Feature engineering strategies for credit card fraud detection," Expert Systems with Applications, vol. 51, pp. 134-142, 2016, doi: 10.1016/j.eswa.2015.12.030.

R. J. Bolton and D. J. Hand, "Statistical fraud detection: A review," Statistical Science, vol. 17, no. 3, pp. 235-249, 2002, doi: 10.1214/ss/1042727940.

S. Wang, C. Liu, X. Gao, H. Qu, and W. Xu, "Session-based fraud detection in online e-commerce transactions using recurrent neural networks," in Joint European Conference on Machine Learning and Knowledge Discovery in Databases, 2017, pp. 241-252, doi: 10.1007/978-3-319-71246-8_15.

F. T. Liu, K. M. Ting, and Z. H. Zhou, "Isolation Forest," in 2008 Eighth IEEE International Conference on Data Mining, 2008, pp. 413-422, doi: 10.1109/ICDM.2008.17.

Downloads

Published

2024-06-13