AI-Driven Multi-Document Correlation Framework for Enterprise Financial Compliance and Fraud Detection

Authors

  • Varsha Shah

Keywords:

Multi-Document Correlation, Financial Fraud Detection, Probabilistic Risk Modeling, Cross-Jurisdictional Normalization, Compliance Automation

Abstract

Enterprise financial compliance has become one of the most technically demanding functions in modern organizations, shaped by the convergence of multi-jurisdictional regulatory obligations, growing transaction volumes, and increasingly adaptive fraud tactics. Existing compliance architectures, built on deterministic rule engines and document-level validation, are structurally ill-equipped to detect inconsistencies that emerge across related financial records rather than within them. A salary discrepancy between a payroll file and a tax return, or a vendor identifier that resolves differently across an invoice and a procurement record, represents precisely the class of anomaly that single-document processing cannot surface. This article proposes a multi-document correlation framework that addresses this structural limitation. Unlike document-level validation or NLP-based entity extraction, the framework performs cross-document relational fraud detection through probabilistic signal aggregation. The architecture has three integrated components: a graph-based entity correlation engine that links payroll, tax, transactional, and procurement records; an adaptive probabilistic risk model that combines cross-document anomaly signals into prioritized audit intelligence; and a cross-jurisdictional normalization layer that allows consistent comparison of financial data across different regulatory environments. The framework is proposed for enterprise-level deployment. Evaluation against rule-based and NLP-augmented baselines demonstrates improvements in fraud detection precision (~91%), a ~76% reduction in false positives, and a ~40% reduction in manual review volume. Collectively, the framework repositions compliance from a reactive audit function into a continuous, predictive governance capability.

DOI: https://doi.org/10.17762/ijisae.v14i1s.8228

Downloads

Download data is not yet available.

References

Victoria Hodge and Jim Austin, "A survey of outlier detection methodologies," Artificial Intelligence Review, 2004. Available: https://eprints.whiterose.ac.uk/id/eprint/767/1/hodgevj4.pdf

Eric WT Ngai et al., "The application of data mining techniques in financial fraud detection: A classification framework and an academic review of literature," Decision Support Systems, 2011. Available: https://doi.org/10.1016/j.dss.2010.08.006

Richard J. Bolton and David J. Hand, "Statistical fraud detection: A review," Statistical Science, 2002. Available: https://projecteuclid.org/journals/statistical-science/volume-17/issue-3/Statistical-Fraud-Detection-A-Review/10.1214/ss/1042727940.pdf

Charu C. Aggarwal, "Outlier ensembles," in Outlier Analysis, Springer International Publishing, 2016. Available: https://www.charuaggarwal.net/ensembles.pdf

Raghavendra Chalapathy and Sanjay Chawla, "Deep learning for anomaly detection: A survey," arXiv preprint arXiv:1901.03407, 2019. Available: https://arxiv.org/pdf/1901.03407

Ian Goodfellow et al., “Deep Learning,” MIT Press, 2016. Available: https://synapse.koreamed.org/pdf/10.4258/hir.2016.22.4.351

Jacob Devlin et al., "BERT: Pre-training of deep bidirectional transformers for language understanding," Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2019. Available: https://aclanthology.org/N19-1423.pdf

Yinhan Liu et al., "RoBERTa: A robustly optimized BERT pretraining approach," arXiv preprint arXiv:1907.11692, 2019. Available: https://arxiv.org/pdf/1907.11692

Anoop R. Katti et al., "Chargrid: Towards understanding 2D documents," Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, 2018. Available: https://aclanthology.org/D18-1476.pdf

Basel Committee, "Principles for the sound management of operational risk," Bank for International Settlements, 2011. Available: https://www.bis.org/publ/bcbs195.pdf

Financial Action Task Force (FATF), "Guidance for a risk-based approach: The banking sector," FATF/OECD, 2014. Available: https://www.fatf-gafi.org/content/dam/fatf-gafi/guidance/Guidance-Financial-Inclusion%20-Anti-Money-Laundering-Terrorist-Financing-Measures.pdf.coredownload.pdf

Tianqi Chen and Carlos Guestrin, "XGBoost: A scalable tree boosting system," Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2016. Available: https://dl.acm.org/doi/pdf/10.1145/2939672.2939785

Ivan P. Fellegi and Alan B. Sunter, "A theory for record linkage," Journal of the American Statistical Association, 1969. Available: https://www2.stat.duke.edu/~rcs46/linkage/presentations/01-baiLi_FelleigSunter1969.pdf

Shaoxiong Ji et al., "A survey on knowledge graphs: Representation, acquisition, and applications," IEEE Transactions on Neural Networks and Learning Systems, 2021. Available: https://www.researchgate.net/publication/351115157

David Savage et al., "Detection of money laundering groups using supervised learning in networks," arXiv preprint arXiv:1608.00708, 2016. Available: https://arxiv.org/pdf/1608.00708

Douglas W. Arner et al., "FinTech, RegTech, and the reconceptualization of financial regulation," Northwestern Journal of International Law and Business, 2016. Available: https://scholarlycommons.law.northwestern.edu/cgi/viewcontent.cgi?article=1817&context=njilb

Scott M. Lundberg and Su-In Lee, "A unified approach to interpreting model predictions," Advances in Neural Information Processing Systems, 2017. Available: https://proceedings.neurips.cc/paper/2017/file/8a20a8621978632d76c43dfd28b67767-Paper.pdf

Brendan McMahan et al., "Communication-efficient learning of deep networks from decentralized data," Proceedings of the 20th International Conference on Artificial Intelligence and Statistics (AISTATS), 2017. Available: https://proceedings.mlr.press/v54/mcmahan17a/mcmahan17a.pdf

Véronique Van Vlasselaer et al., "APATE: A novel approach for automated credit card transaction fraud detection using network-based extensions," Decision Support Systems, 2015. Available: https://www.sciencedirect.com/science/article/pii/S0167923615000846

Yang Liu et al., "Pick and choose: A GNN-based imbalanced learning approach for fraud detection," Proceedings of the Web Conference 2021, 2021. Available: https://dl.acm.org/doi/pdf/10.1145/3442381.3449989

Adetunji Adejumo Paul and Chinonso Ogburie, "The role of AI in preventing financial fraud and enhancing compliance," GSC Advanced Research and Reviews, 2025. Available: https://www.researchgate.net/profile/Chinonso-Ogburie/publication/390300143

Samia Ara Chowdhury et al., "Next generation financial security: Leveraging AI for fraud detection, compliance, and adaptive risk management," Well Testing Journal, 2025. Available: https://www.researchgate.net/publication/394087142

Downloads

Published

14.02.2026

How to Cite

Varsha Shah. (2026). AI-Driven Multi-Document Correlation Framework for Enterprise Financial Compliance and Fraud Detection. International Journal of Intelligent Systems and Applications in Engineering, 14(1s), 645–655. Retrieved from https://ijisae.org/index.php/IJISAE/article/view/8228

Issue

Section

Research Article