MAB-scriptieprijs |
|
Corresponding author: Long Le ( ledoanhlong3@gmail.com ) Academic editor: Ralph ter Hoeven
© 2024 Long Le, Eric Mantelaers .
This is an open access article distributed under the terms of the Creative Commons Attribution License (CC BY-NC-ND 4.0), which permits to copy and distribute the article for non-commercial purposes, provided that the article is not altered or modified and the original author and source are credited.
Citation:
Le L, Mantelaers E (2024) Benford’s Law and Beyond: A framework for auditors. Maandblad voor Accountancy en Bedrijfseconomie 98(7): 427-438. https://doi.org/10.5117/mab.98.134061
|
We propose a systematic framework that integrates Benford’s Law – a mathematical principle predicting the frequency distribution of leading digits – with advanced statistical and machine learning techniques for enhanced anomaly detection in financial auditing, especially highlighting actionable next steps for implementation. This framework combines Benford’s Law with K-means clustering and multi-digit analysis (First-two and First-three digits) to effectively distinguish between errors, benign anomalies, and fraudulent activities. Empirical validation on financial transaction data demonstrates significant improvements in fraud detection accuracy and reliability, offering practical insights and guidance for auditors on adopting a more robust approach to anomaly detection in modern auditing practices.
Benford’s Law, Fraud Detection, K-Mean Clustering, Multiple Digits Analyses, Financial Auditing
This framework equips auditors with an advanced toolset, integrating Benford’s Law with statistical and machine learning techniques to enhance anomaly detection accuracy. Beyond initial detection, it offers actionable insights and practical guidance, empowering auditors to reliably address financial irregularities and improve the precision of financial reporting.
In today’s globalized economy, financial markets are crucial for capital flow, risk management, and supporting business operations. Yet, the increasing complexity of transactions and the massive data generated by digitized systems have made fraud detection and ensuring the accuracy of financial statements more challenging (
As financial data becomes more complex, auditors need advanced tools for effective anomaly detection. Benford’s Law has emerged as a key tool, leveraging statistical principles to predict leading digit frequencies in naturally occurring datasets. In such datasets, the number 1 appears as the leading digit roughly 30% of the time – far more than the expected 11.1% if digits were uniformly distributed (
However, detecting anomalies is only the first step. Auditors must determine whether deviations result from simple errors, like data entry mistakes, or more serious fraud. The increasing sophistication of fraud schemes requires a more refined approach to post-anomaly detection, as fraudsters find ways to manipulate data that conform to traditional models.
This paper aims to bridge the gap by developing a systematic framework for auditors to follow after identifying anomalies using Benford’s Law. By integrating advanced statistical and machine learning techniques like K-means clustering and multi-digit analysis, the framework aims to improve the accuracy of fraud detection. Despite the extensive use of Benford’s Law, there is limited guidance on subsequent investigative procedures. This article addresses this gap by providing auditors with a clear set of steps to distinguish between errors and fraud more accurately.
As the auditing profession evolves with technological advancements, integrating modern tools such as Benford’s Law into standard practices has become increasingly important (
“What auditing procedures should auditors perform after identifying possible anomalies using Benford’s Law?”
The paper is structured as follows: the introduction has outlined the role of Benford’s Law and the need for systematic post-detection procedures. The literature review in section 2 examines current methodologies, while the methodology section (section 3) details the approach used in this study. Section 4 (findings) and section 5 (discussion) present the results and analyze their implications for auditors. Section 6 (conclusion) summarizes key insights, discusses study limitations, and suggests future research directions.
By focusing on post-anomaly detection actions, this paper provides practical guidance and a robust framework to enhance the effectiveness of financial audits. It contributes to the field of fraud detection and anomaly analysis, offering a practical solution tailored to the complexities of today’s financial environment.
Integrating Benford’s Law into auditing has significantly advanced the detection of anomalies and potential fraud. Auditors use Benford’s Law as a diagnostic tool to identify irregularities in financial records, such as income statements and balance sheets, which may suggest fraudulent activities or errors (
Beyond the First Leading Digit Test, Benford’s Law includes further methods like the Second-Digit Test, First-Two Digit Test, and Last-Two Digit Test, adding depth to anomaly detection (
While these tests can detect subtle irregularities, they have limitations. The First-Two Digit Test requires large, diverse datasets and statistical expertise, which may challenge smaller audit practices (
The International Auditing and Assurance Standards Board (
Machine learning (ML) and artificial intelligence (AI) have significantly expanded the scope of anomaly detection in auditing.
Challenges in using ML for fraud detection include training data imbalances, where fraudulent cases are far fewer than legitimate transactions, leading to less accurate models (
Despite the widespread use of Benford’s Law for anomaly detection in auditing, significant gaps remain in guiding auditors on subsequent investigative procedures and integrating advanced analytical tools (
Auditors’ methods range from traditional manual audits to complex machine learning techniques (
The analysis of existing research emphasizes Benford’s Law’s potential as a cost-effective and versatile tool for detecting irregularities across various datasets. Yet, there is no consensus on best practices for investigating anomalies post-detection, with significant variation across studies. This highlights the need for standardized guidelines that align with the complexities of modern financial systems. Furthermore, a lack of empirical studies confirming the synergy of advanced statistical and machine learning tools with Benford’s Law limits their practical application. A multidisciplinary approach that integrates these advanced methods with traditional auditing practices could significantly improve auditors’ ability to differentiate between benign anomalies, errors, and fraud.
This research seeks to address these gaps by developing a systematic framework for auditors to follow after detecting anomalies using Benford’s Law. By combining empirical validation with the integration of advanced technologies and conventional methods, the study aims to enhance both audit quality and financial reporting reliability. The findings have practical implications for audit practices, offering a foundation for a more refined approach to fraud detection and setting a new standard for anomaly detection in financial audits.
This paper aims to develop a framework to assist auditors beyond the anomaly’s detection of Benford’s Law. The independent variable “Benford’s Law” is analyzed on how it will affect the dependent variable, “Potential Fraud Detection” with the moderator being “Types of Analysis”. The conceptual framework can be seen in Figure
Figure
According to
One of the most common tests to see if a dataset is conformed to Benford’s Law is the Chi-square test due to its robustness in analyzing distribution of data (
First-Two-Digits Test: this test is a refined tool for auditors following initial anomalies detected with Benford’s Law, particularly when deviations involve specific starting digits, such as ‘4’.
First-Three-Digits Test: extending the First-Two-Digits analysis, the First-Three-Digits test delves deeper into transactional data to uncover finer details of potential manipulation (
77 datasets of financial transactions were analyzed using Benford’s Law First Digit. Of which only 36 datasets satisfy the Benford’s requirements:
This paper will show the detailed analysis of 3 financial datasets:
These datasets contained financial transactions of the above-mentioned companies (dataset 1, dataset 2 and dataset 3). As a part of the data cleaning process, all the transactions starting with 0 such as 0,5; 0,99; etc. were removed. They were not rounded up to 1 since it will skew the distribution. Furthermore, all the transactions were taken at absolute value, meaning that all the negative values (credit entries) are ignored (
| Dataset | Possible anomalies |
|---|---|
| Dataset 1 (Perfect Dataset) | 40 (Small) |
| 50 (Small) | |
| 100 (Medium) | |
| 103 (High) | |
| Dataset 2 (Higher) | 100 |
| Dataset 3 (Lower) | 499 |
Initially, dataset 1 appeared to follow Benford’s Law. However, as we divided the dataset into 3 subsets using K-Mean Clustering, statistically significant deviations were found in each subset. In contrast, the dataset for Client B revealed a substantial number of deviations from the expected Benford distribution. Both the Z-test and MAD test identified significant irregularities, with certain digits, particularly those starting with ‘1’. Using the First-Two and First-Three Digit Tests, it was found that this was cost by a large amount of transaction starting with figure 100. This is the same case for dataset 3 with figure 499. For more detailed analysis, see the Appendix
Manual audit was performed by an auditor by examining the transactions closely to validate the finding of this research. For dataset 2, out of 664 transactions starting with 100, 604 of them were transactions with value 100. Among those 604 transactions, 310 of them were labelled: “Inhouding eigen bijdrage auto”, meaning “Deduction of personal contribution for car” and 236 of them were labelled: “Netto vergoeding”, meaning: “Net compensation”. These 2 types of transaction combined make up the majority of transactions with a value of 100 and is the reason why digit 100 is significantly higher than expected. The auditor confirmed that these transactions are not fraudulent since the company in questioned deducted 100 euros from every employee’s yearly salary as a part of personal contribution to company’s cars and some employees are then compensated these 100 euros back. This explains why there are so many transactions starting with 100, since the company has to do this for every single employee.
For dataset 3, out of 516 transactions starting with 499, 442 of them were transactions with value 499. Among those 442 transactions, 164 of them were labelled: “Betalingen via iDEAL”, meaning “Payment by IDEAL”, and 198 of them were labelled: “Geld Onderweg”, meaning “Money on the way”. These 2 types of transaction combined make up most of transactions with a value of 499, and is the reason why digit 499 is significantly higher than expected. However, the auditor also confirmed that these transactions are not fraudulent since the company in questioned sell a lot of products online which have a price of 499. They seem to sell a lot of this products and most of the payment were made by cards, which explains why there are so many transactions with the value of 499 labelled “Payment via Ideal” and “Money on the way”.
For dataset 1, the result was inconclusive with just examining the transactions and the auditor did not have enough time to conduct a full manual audit. Therefore, further investigation is needed for those anomalies (
Figure
The process of sub-setting the data before applying Benford’s Law can help us reveal new insights that were hidden. However, the segmentation of the dataset into three subsets: small, medium and high is tricky and can be time consuming. Therefore, it should only be done when the dataset seems to follow Benford’s Law perfectly. If there is a significant deviation between Benford’s law and the actual data, then the further digit tests (First-Two and First-Three) are enough to significantly reduce the manual audit sample size. After the audit sample size has been determined, a detailed manual audit must be done. The procedures include interviewing the managers and employees responsible for the transactions, looking for patterns in the anomalies and examine the transactions carefully. This provides an answer to our main research question: “What auditing procedures should auditors perform after identifying possible anomalies using Benford’s Law?”
Based on the framework suggested in Figure
Despite the substantial discoveries and contributions of this research, it is essential to point out several limitations. First, the applicability of Benford’s Law necessitates that datasets satisfy specific criteria, including size, natural occurrence, and diversity. The generalizability of the findings to other types of financial data that do not inherently conform to Benford’s Law may be restricted by the fact that this study exclusively examined datasets that met these criteria. Additionally, although effort was made to ensure that only datasets of sufficient size were chosen, certain subsets may still be affected by the limited sample sizes, particularly when they are divided into smaller groups. This has the potential to impact the reliability and robustness of anomaly detection results. Finally, the auditor’s judgment remains a significant factor in the interpretation of anomalies. Although statistical tools serve as a foundation for identifying potential issues, the ultimate determination of fraud versus error demands an in-depth understanding of the context, which can differ significantly among auditors.
Building on the findings of this study, future research could further investigate the combination of K-Means Clustering with Benford’s Law. The results from this study showed that dividing data into clusters and applying Benford’s Law significantly improved fraud detection accuracy. Future studies could explore other clustering algorithms and their impact on fraud detection efficiency. Furthermore, the datasets used in this paper did not contain fraudulent data. Therefore, future research could use the research framework proposed by this paper and apply it to data that contained fraudulent data to validate the accuracy and reliability of the algorithms.
In addition, future study could extend this model to include the Last-Two Digit test. The Last-Two- Digits test is particularly valuable for identifying falsifications related to financial rounding or the creation of fictitious numbers (
The integration of Benford’s Law into auditing practices, as explored in this article, underscores its potential as a powerful tool for anomaly detection in financial datasets. This research not only reaffirms the utility of Benford’s Law in identifying irregularities but also addresses a critical gap in the literature by proposing a systematic framework for post-detection auditing procedures. The empirical validation of this framework, through case studies and advanced statistical methods such as K-mean clustering, demonstrates its efficacy in distinguishing between benign anomalies and potential fraud. Moreover, the incorporation of machine learning techniques alongside traditional auditing methods enhances the precision and reliability of fraud detection, offering a robust toolkit for auditors. This multidisciplinary approach, blending insights from forensic accounting, data science, and risk management, represents a significant advancement in auditing methodologies. By providing actionable guidelines for auditors, this paper contributes to the enhancement of audit quality and the integrity of financial reporting. Future research should continue to explore the integration of emerging technologies to further refine and validate these procedures, ensuring their adaptability to the evolving landscape of financial fraud and complexity of modern financial systems.
L. Le – Long
Dr. E.J.H.J. Mantelaers RA AA CFE CISA C|CISO – Eric is a partner at the Center of Digital Innovation (CODI) of RSM Netherlands. He is also a professor of applied sciences at the Future-proof Auditor research group of Zuyd University of Applied Sciences and a senior lecturer in the Audit & Assurance program at Maastricht University. At Erasmus University, Eric is affiliated with the Executive Program Financial Forensic Expert.
This article is based on Long Le’s master thesis. This makes him one of the winners of the MAB Thesis Award 2024.
Dataset 1 analysis
Dataset 2 analysis
Dataset 3 analysis