MAB-scriptieprijs |
|
Corresponding author: Stephan Kramer ( skramer@rsm.nl ) Academic editor: Paula Dirks
© 2026 Lizi Burduli, Stephan Kramer.
This is an open access article distributed under the terms of the Creative Commons Attribution License (CC BY-NC-ND 4.0), which permits to copy and distribute the article for non-commercial purposes, provided that the article is not altered or modified and the original author and source are credited.
Citation:
Burduli L, Kramer S (2026) Hidden in plain sight: Unraveling compensation disclosure bloat with generative AI and its impact on executive compensation. Maandblad voor Accountancy en Bedrijfseconomie 100(2): 69-78. https://doi.org/10.5117/mab.100.169964
|
Whether compensation contract design reflects efficient contracting or rent extraction is an ongoing debate in academic research and public discourse. We contribute to this debate by examining whether textual bloat in compensation contract disclosures is associated with excess CEO compensation. We construct a measure of bloat, defined as irrelevant, boilerplate, and redundant content, by summarizing firms’ Compensation Discussion and Analysis sections with a large language model for a sample of S&P 1500 firms during 2011–2018. In line with our hypotheses, we find a positive association between bloat and excess CEO compensation. We find no empirical evidence that governance characteristics explain the magnitude of bloat in firms’ compensation disclosures. Our findings suggest that bloated disclosures can be used as an instrument to obscure compensation levels that are unrelated to the economics of the firm.
Generative, AI, LLM, executive compensation, bloat, corporate governance
As reflected in the EU Shareholder Rights Directive (Directive 2007/36/EC), European regulators seek to increase transparency and shareholder involvement, improve the oversight of directors’ remuneration, and facilitate the flow of information. The findings of this study support these objectives by demonstrating that the substance and understandability of compensation disclosures, rather than their length, matter more for effective monitoring. In the Netherlands’ stakeholder-oriented governance model, generative AI tools can help boards, auditors, and investors in evaluating whether compensation disclosures genuinely communicate important information.
The widening pay gap between CEOs and workers in recent decades underscores a central debate in corporate governance research: whether executive compensation reflects efficient contracting or managerial rent extraction. The efficient contracting view is based on classical agency theory (
One potential mechanism to achieve this is obfuscation through bloat, i.e., adding redundant, overly complex, and irrelevant information to disclosures about compensation. In the United States, the Compensation Discussion and Analysis (CD&A) section of the proxy statement is the primary source of investors’ information about executive compensation packages.
Prior research has attempted to capture obfuscation in other disclosure contexts by using traditional textual analysis methods, such as readability or sentiment (e.g.,
Recent advancements in generative Artificial Intelligence (AI) and Large Language Models (LLMs) offer an innovative methodological solution to this problem. Unlike older models, LLMs are pre-trained on vast datasets, enabling them to grasp the contextual significance and filter out irrelevant content in a manner that approximates human-like judgment. This allows for a novel and more precise measure of obfuscation. By tasking an LLM with summarizing a document to its essential core, we can quantify bloat as the proportion of the original text that is discarded as irrelevant or redundant. To date, no study has applied an LLM-based bloat measure to compensation disclosures to examine its link with excess pay. We address this gap by answering the following research question:
How can a measure of compensation disclosure bloat be developed using generative AI, and what is its relationship with excess executive compensation?
Using 7,786 firm-year observations from S&P 1500 companies between 2011 and 2018, we use a Large Language Model to construct summaries of CD&A sections and measure bloat as the difference between the length of the original document and its summary, scaled over the length of the original document. We quantify excess compensation using the methodology of
Our results show a significant positive association between bloat and excess compensation. This association remains robust after controlling for traditional textual metrics, which suggests our measure provides an incrementally informative dimension to the study of compensation design. We find no empirical evidence that governance mechanisms, as proxied by board gender diversity percentage and the CEO serving as chair of the board, are significant determinants of bloat. Taken at face value, this opposes the idea that firms that award the CEO with abnormally high pay packages and are poorly governed should have more bloated compensation disclosures (
We contribute to the literature as follows. First, this study is one of the first attempts to develop a bloat measure from CD&A sections using generative AI. While
A central question in corporate governance research is whether executive pay reflects efficient contracting or managerial rent extraction. The efficient contracting view, rooted in classical agency theory (
In contrast, the managerial power or rent extraction view (
A central challenge to the rent extraction view is explaining how this practice persists in the face of mandatory compensation disclosures to enable investor monitoring. One potential mechanism is obfuscation, i.e., increasing the redundancy, complexity, and length of the provided information to draw attention away from key details and making it difficult for investors with limited attention and cognitive processing constraints to draw accurate conclusions (
In the United States, investors mainly rely on the CD&A section of the proxy statement to understand executive compensation packages. While mandated by the SEC to improve clarity and transparency (
Financial disclosures have grown substantially in length over the past two decades (
Traditional textual analysis methods, such as document readability and length (e.g.,
The introduction of generative AI represents a methodological shift and overcomes existing limitations through contextual understanding and reasoning. Unlike traditional NLP models, LLMs are pre-trained on a large set of data, meaning that the model learns patterns in language, such as grammar, word associations, sentence structures, and facts (
Hence, LLMs’ main advantage over traditional models is their ability to perform tasks that involve human-like judgment by understanding the context surrounding each word and the relationships between sentences. These advantages of LLMs are expected to offer a new way of quantifying disclosure bloat in executive compensation disclosures and generate summaries that contain incrementally informative content associated with excess compensation beyond traditional textual measures.
Although the structure of performance-based incentives embedded within executive compensation contracts often serves as a mechanism to align managerial actions with shareholder interests (
H1: Disclosure bloat in the CD&A sections of proxy statements is positively associated with excess executive compensation.
This hypothesis is not without tension, because not all bloat is necessarily opportunistic, as compliance with regulatory requirements and legal risk management may require firms to include lengthy and redundant text to their disclosures.
While high executive compensation may be concealed by bloat, executives’ power over their compensation may be influenced by the strength of the corporate governance systems in place. Although monitoring and higher shareholder power can limit executives’ opportunistic behaviour (
Thus, governance mechanisms can be possible determinants of disclosure bloat. Particularly, strong governance mechanisms, such as board gender diversity, should encourage more concise and transparent reporting, hence reducing bloat, while weaker governance, such as CEOs holding both CEO and board chair roles, can increase the opportunity for obfuscation and bloat. These arguments lead to the second hypothesis:
H2a: Board gender diversity is negatively associated with disclosure bloat.
H2b: CEO-Chair duality is positively associated with disclosure bloat.
The dataset comprises CD&A sections extracted from DEF 14A proxy filings in the SEC’s EDGAR database for S&P 1500 firms between 2011 and 2018, 7,786 firm-year observations. Data regarding the executive compensation and control variables are obtained from Compustat, ExecuComp, and BoardEx. Missing values are further retrieved from Refinitiv Eikon and the company’s annual reports.
Variable measurement consists of three main sets of variables and the procedure to construct the summaries. All continuous variables are winsorized at the 1st and 99th percentiles to mitigate the influence of outliers, except when bounded between 0 and 1.
Excess executive compensation is measured using the residual of actual compensation compared to the predicted compensation based on the economic determinants, following prior literature by
Recent advances in machine learning have improved natural language processing, with Transformer-based models becoming some of the most successful architectures to date. The Transformer is a type of deep learning model that uses a neural network architecture, processing sequences through multi-head self-attention to capture relationships between all parts of the input simultaneously (
LLMs generate summaries by rephrasing ideas based on context and instruction (prompt), not by copying text. The model writes a shorter version using new sentences, and because the model is trained on vast amounts of data, including financial documents, it can infer what is relevant to investors, even when the language is complex. Therefore, LLMs can tailor summaries for investors and help them understand the CD&A sections more quickly and easily.
Summaries are generated using the DeepSeek-r1-distill-qwen-7b model hosted on a local machine by dividing each CD&A section into 30,000-character chunks and using the following prompt: Write an investor-friendly summary of all relevant information in this document. The summary should be highly informative and detailed. Do not skip important data or context – include all major figures, policies, and justifications.”
The process works as follows: the CD&A files are split into chunks on a local machine. These chunks, accompanied by a standardized prompt and the model’s role, are then sent individually to the LLM. The communication and data exchange between the local and hosted machines occur using Python and an Application Programming Interface (API), which is a set of rules and protocols that allows different applications to interact in a well-documented way (
Bloat is measured following the methodology by
To control for the variables that may impact bloat and influence the relationship between the variables of interest, building on prior literature, several controls are incorporated: firm size (
To answer the research question and test H1, whether bloat is positively associated with excess compensation, the following OLS panel regression with fixed effects is employed:
Excess(Comp)it = β0 + β1Bloatit + β2TraditionalMethods + β3FirmControlsit + αi + δt + θi + ɛit (1)
where αi is firm fixed effect, δt is year fixed effect, θi is industry fixed effects using the
To address H2a and H2b and find possible determinants of bloat, board gender diversity and CEO-chair duality are regressed with CEO pay slice (
Bloatit = β0 + β1GovernanceMechanismsit + β2OtherFactorsit + β3FirmFactorsit + δt + θi + ɛit (2)
Table
| Variable | Number of Observations | Mean | Standard Deviation | Q1 | Median | Q3 |
|---|---|---|---|---|---|---|
| ExcessComp1 | 6,668 | 0.000 | 0.662 | –0.297 | 0.066 | 0.381 |
| ExcessComp2 | 6,668 | 0.000 | 0.790 | –0.428 | 0.029 | 0.464 |
| ExcessPay | 7,786 | –0.942 | 1.001 | –0.362 | 0.000 | 0.315 |
| Bloat | 7,786 | 0.874 | 0.033 | 0.862 | 0.888 | 0.893 |
| Fog | 7,786 | 22.100 | 1.760 | 20.900 | 22.000 | 23.100 |
| File_Length | 7,786 | 14,017 | 6,882 | 9,356 | 12,998 | 17,628 |
| File_Size_kb | 7,786 | 107.699 | 49.903 | 72.987 | 101.836 | 134.690 |
| SentimentLM | 7,786 | –0.000 | 0.004 | –0.003 | 0.000 | 0.003 |
| Redundancy | 7,786 | 0.189 | 0.058 | 0.150 | 0.184 | 0.224 |
| Boilerplate | 7,786 | 0.016 | 0.005 | 0.013 | 0.015 | 0.018 |
| Leverage | 7,786 | 0.200 | 0.195 | 0.033 | 0.153 | 0.306 |
| Firm_Size | 7,786 | 7.651 | 1.637 | 6.538 | 7.590 | 8.700 |
| Sales_Growth | 7,786 | 0.089 | 0.229 | -0.009 | 0.062 | 0.147 |
| R&D_Assets | 7,786 | 0.038 | 0.078 | 0.000 | 0.000 | 0.039 |
| ROE | 7,786 | 0.075 | 0.497 | 0.032 | 0.108 | 0.187 |
| BoardDiversity | 7,786 | 0.099 | 0.136 | 0.000 | 0.000 | 0.200 |
| CEO_Duality | 7,786 | 0.459 | 0.498 | 0.000 | 0.000 | 1.000 |
| CEO_Pay_Slice | 7,786 | 0.397 | 0.120 | 0.331 | 0.404 | 0.466 |
| Board_Size | 7,786 | 5.698 | 1.066 | 5.000 | 5.000 | 6.000 |
| Loss | 7,786 | 0.182 | 0.390 | 0.000 | 0.000 | 0.000 |
| Volatility | 7,526 | 0.335 | 0.160 | 0.223 | 0.301 | 0.402 |
| Growth_Options | 7,786 | 2.735 | 2.346 | 1.361 | 1.959 | 3.108 |
The first analysis tests whether AI-generated summaries contain incrementally informative content beyond traditional textual measures in explaining excess compensation. Using a regression model in Equation (1), Table
| Dependent Variable = | ExcessComp1 | |||
|---|---|---|---|---|
| (1) | (2) | (3) | (4) | |
| Bloat | 0.520*** | 0.152 | ||
| (0.200) | (0.262) | |||
| Fog_Index | –0.000 | –0.000 | 0.009 | 0.009 |
| (0.009) | (0.009) | (0.007) | (0.007) | |
| Redundancy | 0.340 | 0.326 | 0.304 | 0.371 |
| (0.249) | (0.247) | (0.372) | (0.299) | |
| SentimentLM | 8.518*** | 8.285*** | –4.184 | –4.233 |
| (3.148) | (3.160) | (3.530) | (3.504) | |
| File_Length | 0.007*** | 0.007*** | 0.016*** | 0.015*** |
| (0.002) | (0.005) | (0.004) | (0.004) | |
| Boilerplate | 0.200 | 1.047 | –1.233 | –1.075 |
| (3.188) | (3.223) | (3.311) | (3.304) | |
| Firm_Size | 0.074* | 0.074* | 0.024 | 0.024 |
| (0.045) | (0.045) | (0.017) | (0.017) | |
| Leverage | –0.257*** | –0.259*** | 0.181 | 0.181* |
| (0.088) | (0.089) | (0.091) | (0.091) | |
| Res_File_Size | 0.002*** | 0.002*** | 0.007*** | 0.007*** |
| (0.001) | (0.001) | (0.001) | (0.001) | |
| Sales_Growth | 0.268*** | 0.267*** | ||
| (0.038) | (0.038) | |||
| R&D_Assets | 0.523 | 0.523 | ||
| (0.542) | (0.542) | |||
| ROE | 0.005 | 0.005 | ||
| (0.014) | (0.014) | |||
| Firm FE | Yes | Yes | No | No |
| Year FE | Yes | Yes | Yes | Yes |
| Industry FE | No | No | Yes | Yes |
| Adj. R2 | 0.606 | 0.606 | 0.094 | 0.094 |
| Within R2 | 0.015 | 0.016 | 0.074 | 0.074 |
| N | 6,668 | 6,668 | 6,668 | 6,668 |
The positive direction of File_Length (p < 0.001) and Res_File_Size (p < 0.001) coefficients is not surprising since larger file lengths and sizes suggest higher textual complexity. Every additional 1,000 words is associated with a 0.7% increase in excess compensation. Moreover, the significance of the Res_File_Size implies that firms that use more complex formatting and other textual characteristics beyond the word count are also more likely to provide longer and more detailed information, not for the sake of transparency, but to overwhelm the readers and obscure important information (
The coefficient of Leverage (–0.257, p < 0.001) indicates that higher debt constrains executives’ ability to inflate pay because creditors, particularly banks, act as additional monitors when they provide substantial financing. Hence, increased oversight limits executives’ ability to earn excess compensation. The Fog_Index coefficient is insignificant (p > 0.1), meaning that it cannot explain excess compensation. This conclusion opposes the initial expectation based on
When industry-fixed effects replace firm-fixed effects in Columns (3) and (4), SentimentLM, Leverage, and Bloat lose significance, and the adjusted R2 decreases from 60.6% to 9.4%, implying that industry fixed effects provide a much weaker control structure than firm fixed effects. Firm fixed effects absorb all stable, unobserved firm-level heterogeneity, whereas industry fixed effects only control for broad sector differences. As a result, switching to industry fixed effects introduces additional noise and reduces explanatory power, making coefficient estimates less precise.
Table
| Dependent Variable = | ExcessComp2 | |||
|---|---|---|---|---|
| (1) | (2) | (3) | (4) | |
| Bloat | 0.944*** | 0.317 | ||
| (0.278) | (0.370) | |||
| Fog_Index | 0.009 | 0.010 | 0.022** | 0.022** |
| (0.012) | (0.012) | (0.008) | (0.008) | |
| Redundancy | –0.040 | –0.066 | 0.231 | 0.222 |
| (0.292) | (0.289) | (0.497) | (0.494) | |
| SentimentLM | 10.302** | 9.880** | –3.990 | –4.093 |
| (4.025) | (4.030) | (3.909) | (3.822) | |
| File_Length | 0.010*** | 0.009*** | 0.015*** | 0.014*** |
| (0.002) | (0.002) | (0.004) | (0.004) | |
| Boilerplate | –1.468 | 0.689 | –1.868 | –1.538 |
| (3.822) | (3.852) | (3.473) | (3.427) | |
| Firm_Size | 0.090** | 0.090** | 0.023 | 0.023 |
| (0.040) | (0.040) | (0.017) | (0.017) | |
| Leverage | –0.218** | –0.222** | 0.186* | 0.186* |
| (0.104) | (0.104) | (0.104) | (0.104) | |
| Res_File_Size | 0.002*** | 0.003*** | 0.007*** | 0.007*** |
| (0.001) | (0.001) | (0.001) | (0.001) | |
| Sales_Growth | 0.333*** | 0.333*** | ||
| (0.039) | (0.039) | |||
| R&D_Assets | 0.200 | 0.200 | ||
| (0.436) | (0.436) | |||
| ROE | 0.015 | 0.015 | ||
| (0.020) | (0.020) | |||
| Firm FE | Yes | Yes | No | No |
| Year FE | Yes | Yes | Yes | Yes |
| Industry FE | No | No | Yes | Yes |
| Adj. R2 | 0.498 | 0.499 | 0.069 | 0.069 |
| Within R2 | 0.012 | 0.014 | 0.056 | 0.056 |
| N | 6,668 | 6,668 | 6,668 | 6,668 |
Similarly, Columns (3) and (4) in Table
The results align with theoretical expectations. As argued before, most traditional measures rely on keywords and rules, while ignoring contextual meaning (
Following this rationale, the added explanatory power of the Bloat variable is a reasonable outcome of using a technology that more accurately replicates human judgment and intent when determining relevant content (
To address whether summaries predict excess compensation, given that Bloat varies over time within firms, I use column (2) in Tables
The result is consistent with expectations from prior literature. According to agency theory, executives are incentivized to exploit their managerial power and camouflage the structure of their compensation (
Similar to the results in
Untabulated tests on sentiment tone reveal that the negative sentiment coefficient is insignificant (p > 0.1), while the positive sentiment coefficient is significant (p < 0.001) in the main regression results. This suggests that firms use a more positive tone to obscure information since highlighting positive events can make investors judge higher executive compensation less, consistent with
The robustness check, where excess executive compensation is calculated using an alternative proxy of the
Regarding the economic magnitude of the Bloat variable, using the estimates from column (2) of Table
We find no significant relationship between BoardDiversity or CEO_Duality and Bloat, suggesting that formal governance structures may not effectively constrain redundancy. Hence, H2a and H2b are rejected. Overall, the model explains only 0.9% of the variation in Bloat, further indicating the heterogeneous complexity of disclosure practices. However, the findings align with managerial power theory (
This research explored whether large language models can identify redundant content in the CD&A sections of proxy statements and whether this redundancy is associated with excess executive compensation. The study introduces a new way of understanding excess executive compensation using a quantifiable, verbose content measure – bloat – beyond traditional textual metrics. To address the research question of how a measure of compensation disclosure bloat can be developed using generative AI and its relationship with excess executive compensation, fixed-effects panel regression models were created for 7,786 firm-year observations covering S&P 1500 companies between 2011–2018.
The results show that the bloat measure significantly explains variation in excess executive compensation, and it captures the part of excess compensation that traditional natural language processing models cannot explain. The regression results from Equation (1) are consistent with H1, which suggests that bloated CD&A sections are associated with higher excess executive compensation. The rationale is that according to agency theory, executives’ goals are to maximize personal benefits, and bloat allows them to obscure key pay details by overwhelming investors with irrelevant information and thus justifying excess compensation because investors have limited cognitive resources and restricted attention to analyzing full financial data (
Interestingly, including the industry-fixed effects rather than the firm-fixed effects turned a significant bloat coefficient into an insignificant one. This implies that the effect of the bloat variable is driven by within-firm variation over time, and a large portion of the variation in excess compensation is explained by firm-specific factors that do not vary much over time.
Contrary to H2a and H2b expectations, we did not find strong evidence that governance mechanisms are associated with less bloat. This may be due to measurement limitations or the choice of governance proxies. In addition, compensation disclosure practices may be shaped more by institutional or advisory norms than by formal governance structures, which can be a focus for future studies.
This research improves both social and scientific understanding of corporate disclosure by demonstrating the value of LLMs in identifying complex textual relationships that traditional metrics often fail to identify (
Our findings suggest important implications for regulators by showing that lengthy or more complex executive compensation disclosures do not automatically improve transparency. They also matter for investors, who can leverage generative AI to analyze the key elements of the compensation disclosure sections faster and more easily. Finally, corporate boards should emphasize the significance of clear and less complex communication in financial documents.
We acknowledge some limitations that offer opportunities for future research. First, the output of the LLM varies depending on the model and prompt used. As a result, the length of the summarized documents reflects the LLM’s judgment of contextual relevance and can change depending on the assigned task to a specific LLM, which may also affect the measure of the bloat variable. Future research can utilize different and more advanced models to assess whether the findings are generalizable.
Moreover, the analysis is restricted to S&P 1500 firms from 2011–2018, limiting generalizability to smaller or international firms with different disclosure practices. Expanding the sample and improving CD&A data extraction would strengthen external validity. Future research can also examine non-financial reports, such as sustainability reports, to investigate whether management obscures relevant information when the content is less numerical.
Although our empirical analysis is based on U.S. firms, the main message that quality matters more than length is highly relevant to the European context. Future research could apply the bloat measure to European remuneration reports to see whether the results hold under the Shareholder Rights Directive.
Finally, as with most empirical corporate governance research, absent an exogenous shock endogeneity remains a concern. We therefore do not claim that our associations imply causality. Nevertheless, the findings provide an important starting point for deepening the understanding of how the complexity of text in executive pay disclosures may relate to governance outcomes and remain valuable for theory development and policy discussions.
L. Burduli – Lizi Burduli holds a MScBA in Accounting & Financial Management from RSM, Erasmus University and is currently an auditor at Deloitte.
Dr. S. Kramer – Stephan Kramer is a Professor of Financial Decision Making and Control at RSM, Erasmus University.
This article is based on Lizi Burduli’s master’s thesis. This makes her one of the winners of the MAB Thesis Award 2025.