文章基本信息

标题：Hand in the cookie jar: an experimental investigation of equity-based compensation and managerial fraud.
作者：Bruner, David ; McKee, Michael ; Santore, Rudy 等
期刊名称：Southern Economic Journal
印刷版ISSN：0038-4038
出版年度：2008
期号：July
语种：English
出版社：Southern Economic Association
摘要：It has long been recognized that the managers and owners of a firm have different incentives. The owner seeks to maximize the value of the firm, while the manager may derive utility from additional factors, such as the firm's market share, total output, and total employment. Equity-based compensation has been an increasingly popular means by which to align the incentives of top management with those of the shareholders. By providing a share in the ownership of the firm, equity provides the manager with a greater incentive to maximize the value of the firm. However, recent theoretical and empirical work indicates that the use of equity-based compensation has the (unintended, presumably) consequence of creating an incentive to commit fraud. Management's ability to manipulate information regarding the firm's actual performance raises the possibility that reported output (profits) will be overstated. This paper provides behavioral evidence (gleaned from a laboratory experiment) that increasing the level of equity causes both the level of effort and the amount of fraud to increase.
关键词：Equity (Finance);Fraud;Fraud investigation;Managers;Pay structure;Wage payment systems

Hand in the cookie jar: an experimental investigation of equity-based compensation and managerial fraud.

Bruner, David ; McKee, Michael ; Santore, Rudy 等

1. Introduction

It has long been recognized that the managers and owners of a firm have different incentives. The owner seeks to maximize the value of the firm, while the manager may derive utility from additional factors, such as the firm's market share, total output, and total employment. Equity-based compensation has been an increasingly popular means by which to align the incentives of top management with those of the shareholders. By providing a share in the ownership of the firm, equity provides the manager with a greater incentive to maximize the value of the firm. However, recent theoretical and empirical work indicates that the use of equity-based compensation has the (unintended, presumably) consequence of creating an incentive to commit fraud. Management's ability to manipulate information regarding the firm's actual performance raises the possibility that reported output (profits) will be overstated. This paper provides behavioral evidence (gleaned from a laboratory experiment) that increasing the level of equity causes both the level of effort and the amount of fraud to increase.

Several recent empirical studies have cited the increasing use of equity-based compensation for top-level executives (Anderson, Banker, and Ravindran 2000; Hall and Murphy 2002; Hall 2003; Itner, Lambert, and Larcker 2003; Murphy 2003). In 1984, for example, stocks and options comprised less than 1% of total CEO compensation for the median firm for U.S. publicly traded corporations. By 2001 stocks and options accounted for nearly two thirds of total executive compensation for the median firm. This phenomenon is even more pronounced in "new economy" firms, defined as companies in the computer, software, Internet, telecommunications, and networking industries. Fama (1980) alludes to this phenomenon by arguing that it is the market for executive labor that demands the use of performance-based compensation. It is ironic that the very market creating the incentive to use equity-based pay may be a victim of the incentive equity-based pay creates to improve accounting and financial statements, fraudulently if necessary. Denis, Hanouna, and Sarin (2006); Erickson, Hanlon, and Maydew (2006); and Johnson, Ryan, and Tian (2006) find that executives in firms accused of corporate malfeasance relied significantly more on equity-based compensation than did those in firms that had not been accused of fraud. Furthermore, Chert et al. (2006) find evidence that weaker corporate governance, as measured by board characteristics, is associated with a higher incidence of fraud. (1)

Recent theoretical models emphasize management's ability to manipulate the reported earnings of the firm. Goldman and Slezak (2006) and Robison and Santore (2006) derive agency models in which a key element is the agent's ability to provide false information to the principal concerning the outcome of the agent's effort. While these models differ in the details, equity compensation provides the incentive for all agents to overstate the value of the firms they manage. Thus, increasing equity compensation is predicted to increase managerial effort as well as fraudulent reporting; although, this latter effect will be dampened by increased auditing (enforcement) and sanctions for fraudulent activity. The purpose of this paper is to empirically test these theoretical predictions using behavioral evidence from a laboratory experiment.

There are obvious limitations to the use of laboratory investigations of managerial malfeasance. (2) Despite the many insights of the empirical literature utilizing field data, there are several issues that are difficult to address with such data. While the component of executive compensation that is based on the equity value of the firm is public knowledge, the effective enforcement (audit) effort is not. Further, absent an explicit policy intervention, the field data typically do not contain specific changes in the enforcement levels. The theoretical predictions of management behavior are often predicated on the fact that the managers are fully aware of the probability of an audit by a regulatory agency and of the effectiveness of such audits, but, as we have noted, there is often considerable uncertainty, and the analyst working with field data must make inferences regarding management's perceptions of the regulatory processes. Of course, with field data, by necessity one can only measure detected fraud. Finally, the reactions to changes in equity compensation levels and to the probability of fraud being detected depend on individual risk attitudes, which are not easily observed in the field. (3)

The laboratory offers the researchers considerable control via the construction of the institution and the use of induced values (Smith 1982). This control affords us an opportunity to test the predictions of the recent theoretical models of managerial malfeasance through varying parameters predicted to affect the level of such malfeasance. In the controlled environment of the lab we are able to collect data on the actual effort and fraud choices of human subjects and to observe how these choices are affected by a change in the level of equity-based compensation and in the likelihood of fraud detection. While not the case with the field data, the laboratory allows us to observe the amount of fraud committed when it goes undetected. Also, by manipulating a single variable and holding all other factors constant, we are able to observe causation rather than simply correlation. (4) Our design introduces orthogonal variation to equity-based compensation and the probability that fraud will be detected. Further, our design allows us to control for risk attitudes, since we are able to elicit individual risk attitudes over the domain of the payoffs provided in the effort and fraud decision setting.

2. A Model of Managerial Behavior

Goldman and Slezak (2006) and Robison and Santore (2006) provide the basis for the following theoretical discussion of the effects of equity-based pay and monitoring on the amount of fraud that is committed. In these models, the manager is compensated via a two-part contract, (w, [alpha]), where w denotes the manager's salary income and [alpha] denotes the percentage of total firm equity given to the manager. For her part, the manager must make two decisions: the level of effort and the value of the firm to report to the market.

More precisely, the manager first must choose an effort level, L, which adds value to the firm. There are diminishing marginal returns to effort such that the value of the firm, g(L), is a strictly concave function of the amount of effort the manager contributes [i.e., g'(L) > 0, g"(L) < 0]. Providing value-adding effort is costly to the manager and therefore reduces the manager's (certain) salary-based income. In the following discussion, the cost of effort is normalized: Each unit of effort costs the manager one unit of salary income. The manager's choice of effort determines the true value of the firm, but this value is observed by the manager only; the owners do not observe the true value of the firm.

After the manager chooses effort, she must choose the value of the firm to report to the shareholders. Any value in excess of the true value reported by the manager is considered fraud. Thus, the reported value of the firm is the true value of the firm plus any additional value the manager chooses to report, g(L) + F, where F is the amount of fraud committed by the manager. (5) The potential to commit fraud is sufficient to generate a reaction from the market. The market rationally expects some level of fraud, [F.sup.e]. (6) It is not costless for the manager to defraud the shareholders. There is a known probability, p, that the manager will be caught committing fraud and that sanctions, s(F), will be imposed on the manager. The sanctions function is increasing and strictly convex in the amount of fraud [i.e., s'(F) > 0 and s"(F) > 0].

The manager's preferences over potentially random distributions of wealth are given by the mean-variance utility function shown in Equation 1:

[EU.sub.M] = E([W.sub.M]) - [r.sub.M][[sigma].sup.2.sub.wm], (1)

where [W.sub.M] is the manager's wealth; [[sigma].sup.2.sub.wm] is the variance of the manager's wealth; and [r.sub.M] [greater than or equal to] 0 is a risk aversion parameter. It is straightforward to calculate the variance of s(F), which equals the variance in the manager's wealth: (7)

[[sigma].sup.2.sub.s](F) = p[[s - ps].sup.2] + (1 - p)[[0 - ps].sup.2] = p(1 - p)[s.sup.2].

Recalling that s"(F) > 0, it follows that

d[[sigma].sup.2.sub.s](F) / dF = 2p(1 -p)s(F)s'(F) > 0,

[d.sup.2][[sigma].sup.2.sub.s](F) / [dF.sup.2] = 2p(1 -p)(s(F)s"(F) + [[s'(F)].sup.2]) > 0.

Choosing a greater value of fraud increases the variance of the manager's wealth. So the cost of choosing greater fraud has two costs for the risk-averse manager: the increase in the expected sanction and the increase in risk.

We solve for the manager's optimal choices backwards since a rational manager will anticipate her future choice of fraud when she chooses effort. Once effort has been chosen, the manager must choose a level of fraud. The manager's objective function is given by the following:

Max [alpha](g(L) + F - [F.sup.e]) - L - ps(F) - [r.sub.M][[sigma].sup.2.sub.s](F). (2)

At an interior solution to Equation 2, the first-order condition with respect to F is

[alpha] - ps'(F) - [r.sub.M] d[[sigma].sup.2.sub.s](F) / dF = 0. (3)

Given that s(F) is convex, the sufficient second-order condition is satisfied thus:

-ps"(F) - [r.sub.M] [d.sup.2][[sigma].sup.2.sub.s](F;[t.sub.F] / [dF.sup.2] < 0.

Equation 3 implicitly defines the optimal level of fraud, [F.sup.*] = F([alpha], p), which is independent of the level of effort. In choosing the optimal level of effort, the manager anticipates her future choice of fraud, thus:

[MATHEMATICAL EXPRESSION NOT REPRODUCIBLE IN ASCII.]

At an interior solution, the first-order condition with respect to L is

[alpha]g'(L) - 1 = 0. (4)

Equation 4 implicitly defines the optimal level of effort, [L.sup.*] = L([alpha]). The only parameter that enters the manager's choice of effort is [alpha], the percentage of equity.

As Equation 5 shows, effort is increasing in [alpha]:

[dL.sup.*] / d[alpha] = -g'([L.sup.*]) / [alpha]g"([L.sup.*]) > 0. (5)

The optimal level of fraud is increasing in the share of the firm and decreasing in the probability of detection and degree of risk aversion:

[partial derivative][F.sup.*] / [partial derivative][alpha] = 1 / ps" + [r.sub.M] [d.sup.2][[sigma].sup.2.sub.s](F;[t.sub.F) / [dF.sup.2] > 0, (6)

[partial derivative][F.sup.*] / [partial derivative]p = -s' / ps" + [r.sub.M] [d.sup.2][[sigma].sup.2.sub.s](F;[t.sub.F) / [dF.sup.2] > 0, (7)

[partial derivative][F.sup.*] / [partial derivative][r.sub.M] = -d[[sigma].sup.2.sub.s](F) / dF / ps" + [r.sub.M] [d.sup.2][[sigma].sup.2.sub.s](F) / [dF.sup.2] > 0, (8)

Equation 6 shows that as the share of the firm the manager owns increases, there is also an increased incentive to artificially inflate the firm's market value. On the other hand, Equations 7 and 8 show that the optimal amount of fraud is a decreasing function of the probability of the fraud being detected and the degree of the agent's risk aversion.

3. Experimental Design

The experimental setting assigns human subjects the role of manager while the shareholder role is computerized. We implement the basic elements of the theoretical model presented above to investigate the effects of equity-based pay and monitoring on the amount of fraud committed. In the first stage of a period, subjects choose the level of effort, and in the second stage subjects choose a level of fraud. Finally, the results of a random audit process are revealed to the subjects, and the outcomes and payoffs for the period are summarized.

The experiment was programmed and conducted with the software Z-Tree (Fischbacher 2007). In the experiment subjects interacted with a computer interface in which the instructions were presented, and their decisions were recorded. Subjects were volunteers recruited through announcements in undergraduate classes at the University of Tennessee. When they arrived at the lab, the subjects were seated in individual privacy carrels and entered all of their decisions via the computer mouse. They were not permitted to communicate with other subjects, and they proceeded through the experiment at their own pace. Sessions lasted approximately 60 minutes, and subjects earned an average of $15 U.S. dollars (actual range of earnings is $8 to $19) based on their decisions. After the subjects read the instructions and the experimenter answered clarifying questions, the experiment began. Subjects completed five practice periods and 20 periods for actual money.

[FIGURE 1 OMITTED]

Both the Goldman and Slezak (2006) and the Robison and Santore (2006) models are based on the assumption that the manager is risk averse. In order to control for the risk preferences of subjects, we employ a measure similar to that of Holt and Laury (2002). (8) For each of 10 individual gambles, subjects choose between a lottery paying either 1000 or 0 lab dollars or a guaranteed payoff of 500 lab dollars. The gambles vary in their respective probabilities of winning the large prize. (9) Figure 1 presents the screen image for the gamble choice exercise. A subject's pattern of responses results in an independent measure of risk preference. If preferences are as described in Equation 1, the subject's risk aversion parameter determines the option at which they choose to switch from the guaranteed amount (Choice B) to the gamble (Choice A). Thus, the subject will choose Choice B if

[r.sub.i] > 10002[p.sub.j] - 500 / [1000.sup.2][p.sub.j](1 - [p.sub.j]).

Since subjects make 10 decisions, we are able to construct bounds on the implied risk aversion parameter. The ranges of the implied risk aversion parameter are given in column 2 of Table 1. The observed frequencies are given in column 4.

As can be seen from Table 1, subjects are fairly symmetrically distributed around risk neutrality. (10) Employing a random utility framework first introduced by McFadden (1974) allows us to estimate the average risk aversion parameter from Equation 1. In order to account for the panel structure of our data we estimate a random effects probit model. (11) The estimated population average risk aversion parameter is -0.0002 but is not statistically different from zero. Thus, our subjects appear to be risk neutral on average but exhibit some heterogeneity.

After entering their decisions for the gamble exercise, subjects receive written instructions, which they retain throughout the experiment, and also proceed through more thorough instructions on the computer. Here they are presented with examples of the relevant information screens, definitions and descriptions of the information being provided on those screens, and the calculations that will determine their payoffs.

In the first stage of a period the computer displays the two-part contract to the subject: the endowment and the share. Subjects are told that the first part of their payoff for the period is simply their endowment (salary) less their contribution (effort). (12) The second part of their payoff is their share (equity) multiplied by their reported level of output. After observing the contract the subject must choose the level of contribution (effort) by selecting a radio button on the screen. Choice of contribution ranges from 0 to 200 in increments of eight units. (13) A payoff table containing the first part of their payoff and the amount of output that will result from their choice is displayed on the screen. Figure 2 shows the subject's screen image for the first decision.

In the second stage of the experiment the computer displays the true level of output that results from the subjects' choice of contribution, their share (equity) of reported output, and the probability that they are checked (audited). If they are checked and they have reported additional output, a penalty is subtracted from the second part of their payoff. Subjects are informed that the penalty is an increasing function of additional output reported. (14) Another payoff table is displayed reporting the second part of their payoff and the possible penalty associated with each amount of additional output. The amount of additional output that can be reported ranges from 0 to 500 in 20-unit increments. Subjects are then asked to choose how much additional output to report. Additional output is reported by selecting a radio button on the screen. Figure 3 shows the subject's screen image for this second decision.

[FIGURE 2 OMITTED]

After the two decisions have been completed, the subjects are informed of the audit outcome and shown a summary page with the parameter values, their decisions, and their payoffs for the round. The subjects are informed whether they were checked or not and are provided the penalties that have been incurred as a result. The period summary screen is shown in Figure 4.

The experiment treatment structure yields a 2 x 2 factorial design resulting in four experiment treatments. The treatment parameters in each session are the value of [[alpha].sub.i], the manager's percentage equity, and the value of p, the probability of an audit. The equity parameter [[alpha].sub.i] takes two levels, 30% and 50%, and the probability of an audit p also takes two values: 15% and 25%. Table 2 summarizes the design.

[FIGURE 3 OMITTED]

[FIGURE 4 OMITTED]

The theoretical model directly lends itself to some testable hypotheses, and the following hypotheses will be tested using the data from our experiments:

H1: Effort is strictly increasing in the level of equity, ceteris paribus. According to Equation 5, the optimal choice of effort is an increasing function of the level of equity. Thus, the level of effort is predicted to increase from T1 to T3 and from T2 to T4.

H2: Effort is independent of the probability of detection, ceteris paribus. According to Equation 4, the optimal choice of effort is independent of the probability of detection. Thus, the level of effort chosen is predicted to be the same for T1 and T2 and for T3 and T4.

H3: Fraud is increasing in the level of equity, ceteris paribus. According to Equation 6, the optimal choice of fraud is an increasing function of the level of equity. Thus, the level of fraud is predicted to increase from T1 to T3 and from T2 to T4.

H4: Fraud is decreasing in the probability of detection, ceteris paribus. According to Equation 7, the optimal choice of fraud is a decreasing function of the probability of detection. Thus, the level of fraud is predicted to decrease from T1 to T2 and from T3 to T4.

H5: There is an interactive effect of equity and the probability of detection on the level of fraud chosen, ceteris paribus. According to Equation 6, the response of the optimal choice of fraud to a change in the level of equity is dependent on the probability of detection. Thus, the change in the level of fraud chosen from T1 to T3 is predicted to be greater than the change in the level of fraud chosen from T2 to T4. (15)

H6: Fraud is decreasing in the degree of risk aversion, ceteris paribus. According to Equation 8, the optimal choice of fraud is a decreasing function of the agent's degree of risk aversion. Thus, across all treatments, individuals who are more risk averse are predicted to choose lower levels of fraud.

Twenty subjects participated in each treatment, and they faced the decision setting for 20 periods. This is a between-subject design; subjects face a single set of parameters in a given session and the subjects are only permitted to participate in one session (set of parameters). (16)

4. Data Analysis and Results

Our data set constitutes a panel of 1600 pooled observations. Observations for any given subject are correlated across periods; thus, any analysis must account for such correlation. To account for the structure of the data we utilize a panel estimation approach, which accounts for the repeated observations on each individual and allows us to incorporate the potential for learning and feedback during the experiment session. The variables used in our analysis are defined in Table 3 with descriptive statistics included. The first two variables are the dependent variables in the subsequent estimations. There are four treatment dummy variables. Since we use a between-subjects design, all treatment parameters are time invariant. (17) We construct a dummy variable that indicates whether or not a subject exhibited risk-averse preferences in the gamble exercise. (18) There are lagged feedback variables that include a dummy indicator with regard to whether a subject was audited in the previous period, the amount of the penalty the subject incurred in the previous period, and the amount of accumulated earnings at the beginning of a decision period. (19) We also use the inverse of the decision period as a proxy for learning. (20)

In Table 4 we report the results from generalized least-squares estimation of the level of effort chosen. Three alternate specifications are reported here. In order to account for the panel structure of the data, all three specifications are estimated with panel-specific heteroskedastic error terms. The first model is the most parsimonious, containing only treatment effects. The second model incorporates learning by including the inverse of the period (inverse period). The static model being tested does not account for learning effects; thus, they must be controlled for in the analysis. The third model adds to the learning effects the possible feedback effects, such as whether the subject was audited in the previous period (lagged audit), the amount of any penalties in the previous period (lagged penalty), and the amount of accumulated earnings in the experiment (lagged wealth). Again, since we are testing a static model, there are no theoretical predictions about the sign or magnitudes of these effects. However, in order to appropriately test the theoretical predictions, these effects must be controlled for in the analysis. The third model is used for hypotheses tests. Thus, the estimated effort equation can be written as follows:

effort = [[beta].sub.1][T.sub.1] + [[beta].sub.2][T.sub.2] + [[beta].sub.3][T.sub.3] + [[beta].sub.4][T.sub.4] + [[beta].sub.5]averse + [[beta].sub.6]inverse_period + [[beta].sub.7]lagged_audit + [[beta].sub.8]lagged_penalty + [[beta].sub.9]lagged_wealth + [e.sub.i].

Recall that the first two hypotheses presented in the previous section are related to the response of effort to changes in the level of equity compensation and the probability of detection, respectively. We formally test these two hypotheses using a Wald test, thus:

H1: Effort is strictly increasing in the level of equity.

[B.sub.3] - [B.sub.1] > 0, [X.sup.2] = 342.40 p-value of 0.000, [B.sub.4] - [B.sub.2] > 0, [X.sup.2] = 332.14 p-value of 0.000.

For either level of the probability of detection, we reject the null hypothesis that effort is independent of the level of equity. Thus, we find support for our first hypothesis.

H2: Effort is independent of the probability of detection, ceteris paribus.

[B.sub.1] - [B.sub.2] = 0, [X.sup.2] = 16.01 p-value of 0.000, [B.sub.3] - [B.sub.4] = 0, [X.sup.2] = 9.76 p-value of 0.002.

We reject the null hypothesis that the level of effort is the same for both levels of the probability of detection, holding equity constant. Thus, we do not find support for our second hypothesis.

We do find weak evidence of some feedback effect given the negative and significant coefficient on lagged audit. (21) We also find evidence of learning on the part of subjects given the positive and significant coefficient on inverse period. A dynamic model is required to incorporate the possibility of such effects and is beyond the scope of this paper.

Table 5 reports the results of our analysis of the fraud decision. The estimations utilize the random-effects Tobit specification to account for subject heterogeneity and the censoring of the level of fraud. (22) Our approach is similar to that of the estimation of the effort equation. We start with the most parsimonious model first, which includes the treatment indicator variables along with the subject-specific dummy variable indicating whether the subject exhibited risk-averse preferences in the gamble choice exercise. (23) We then control for learning effects by including the inverse of the period (inverse period) in the second model. The third model adds to the second feedback effects, such as whether a subject was audited in the previous period (lagged audit), the amount of penalties in the previous period (lagged penalties), and the amount of accumulated earnings (lagged wealth). The third model is used for hypotheses tests. Thus, the estimated effort equation can be written as follows:

fraud = [[beta].sub.1][T.sub.1] + [[beta].sub.2][T.sub.2] + [[beta].sub.3][T.sub.3] + [[beta].sub.4][T.sub.4] + [[beta].sub.5]averse + [[beta].sub.6]inverse_period + [[beta].sub.7]lagged_audit + [[beta].sub.8]lagged_penalty + [[beta].sub.9]lagged_wealth + [e.sub.i].

Recall that the last four hypotheses in the previous section were concerned with the response of fraud to changes in the level of equity compensation and the probability of detection, respectively. We formally test these four hypotheses using a Wald test, thus:

[H.sub.3]: Fraud is strictly increasing #1 the level of equity.

[B.sub.3] - [B.sub.1] > 0, [chi square] = 14.77 p-value of 0.000, [B.sub.4] - [B.sub.2] > 0, [chi square] = 6.51 p-value of 0.011.

For either level of the probability of detection, we reject the null hypothesis that fraud is independent of the level of equity. Thus, we find support for our third hypothesis that fraud is strictly increasing in the level of equity.

[H.sub.4]: Fraud is strictly decreasing in the probability of detection, ceteris paribus.

[B.sub.1] - [B.sub.2] > 0, [chi square] = 1.69p-value of 0.194, [B.sub.4] - [B.sub.3] > 0, [chi square] = 7.50p-value of 0.006.

For the low level of equity we accept and for the high level of equity we reject the null hypothesis that the level of fraud is decreasing in the probability of detection. Thus, we only find weak support for our fourth hypothesis. Furthermore, our results are indicative of an interactive effect between the level of equity and the probability of detection. This will be explored further in our next hypothesis test.

[H.sub.5]: There is an interactive effect between equity and the probability of detection on the level of fraud.

[B.sub.3] - [B.sub.1] > [B.sub.4] - [B.sub.2], [chi square] = 0.74 p-value of 0.390.

We accept the null hypothesis that the response of fraud to a change in the level of equity is independent of the probability of detection. Thus, we do not find support for our fifth hypothesis of an interaction effect between equity and the probability of detection.

[H.sub.6]: Fraud is decreasing in risk aversion.

[B.sub.5] < 0, Z = - 2.70 p-value of 0.007.

We reject the null hypothesis that fraud is independent of risk aversion. Thus, we find support for our sixth hypothesis that fraud is decreasing in risk aversion.

There is no evidence that the subjects "learned" the optimal amount of fraud as the experiment progressed given that the estimated coefficient on the inverse of the period (inverse period) is not statistically different from zero. (24) However, there is evidence of feedback effects. The estimated coefficient on lagged penalty is 2.34 and is significant at the 1% significance level. This is consistent with the often-observed gambler's fallacy. That is, after being caught and assessed penalties, subjects increased the amount of fraud they chose in the subsequent period. (25) But, the estimated coefficient of lagged audit is not statistically different from zero. The estimated coefficient on lagged wealth is consistent with decreasing absolute risk aversion.

Table 6 presents the point predictions for both effort and fraud from the theoretical model assuming risk neutrality. Conditional means are obtained from the estimated coefficients on the treatment dummy variables from the effort and fraud equations, which allow us to partial out any subject-specific effects, as well as feedback and learning effects.

The mean level of effort for each treatment is close to the theoretical prediction. The theoretical level of effort for treatments 1 and 2 was 32, while the mean level of effort was 31.51 and 40.70, respectively. The theoretical level of effort for treatments 3 and 4 was 80, while the mean level of effort was 89.39 and 80.53, respectively. The mean level of fraud is close to the theoretical prediction for treatments 1 and 4. The theoretical level of fraud for both treatments 1 and 4 is between 200 and 220, and the mean levels of fraud are 188.55 and 239.35, respectively. (26) Treatments 2 and 3 are predicted to generate corner solutions (in fraud) of 0 and 500. However, the design permits errors from the predicted outcomes in only one direction, respectively, for each treatment. (27) It is apparent (Table 6) that mean levels of fraud for treatments 2 and 3 reflect an inward bias as a result of the censoring involved with our experimental design. That is, the mean predicted level of fraud for treatment 2 (136.17) is greater than the theoretical prediction, and, conversely, for treatment 3 (338.08) it is below the predicted level.

Before concluding our analysis a discussion of the relative effects of our treatment variables (equity share and probability of detection) would be useful. Given that we observe two distinct levels for each of our treatment variables, we can evaluate the average unconditional discrete change in fraud due to each of the treatment variables (model 3, Table 6). For the level of equity the average unconditional discrete change was 75.59. (28) For the probability of detection the average unconditional discrete change was -44.15. Thus, we find for the levels of the treatment variables we observe that the increase in fraud caused by a 66% increase in the level of equity is almost twice as large as the decrease in fraud caused by a 66% increase in the probability of detection.

5. Conclusions

The recent publicity of high-profile cases of corporate fraud has drawn the attention of not only the media but also of political agents. In response, the U.S. Congress passed the Sarbanes-Oxley Act (SOX) in an effort to restore investor confidence by appearing to attack corporate fraud. Central to the debate is the question of the root causes of fraudulent behavior. One well-received hypothesis, provided by academics as well as policy makers, is that the use of firm equity to compensate managers has had the unintended consequence of making fraud more attractive to these managers. The results of the experimental investigation reported here indicate that, while it does increase productive effort, equity compensation has the effect of increasing managerial fraud.

The multivariate analysis confirms the predictions of the theoretical framework: Both the level of effort chosen and the level of fraud committed are increasing in the share of output (equity) the subject managers received. Increasing enforcement, the probability that fraud is detected, is shown to reduce the amount of fraud committed. The conditional means from the estimated equations indicate support for the theoretical model. Higher levels of equity-based compensation are correlated with higher levels of effort, as well as higher levels of fraud. Our subject managers clearly understand the incentives established by the laboratory setting.

Finally, in our experiments the level of equity compensation was a treatment variable and, therefore, was exogenous. However, in the naturally occurring world the level of equity compensation is determined by owners, who are influenced by external factors such as the regulatory environment and the market's potential reaction to the revelation of fraud. Given that equity compensation is expected to result in more fraud, it is natural to ask whether owners will provide greater equity compensation in response to legislation (such as SOX) that imposes greater monitoring on the managers. If so, the net effect on the actual amount of fraud may be dampened. One important avenue for future research is to examine the response of the optimal compensation contract to regulation designed to decrease corporate fraud.

The usual caveat is in order regarding the use of laboratory experiments to inform our understanding of behavior in the naturally occurring world. We address the following central question: Does the laboratory setting provide for the necessary degree of "parallelism" to the naturally occurring world that is crucial to generalizing our experimental results beyond the setting of the lab (Smith 1982; Plott 1987)? The experimental setting need not attempt to capture all of the variation in the naturally occurring environment, but it should sufficiently recreate the fundamental elements of the naturally occurring world, if the results are to be relevant in policy debates. While our payoffs are relatively small, our experimental setting provides the computations necessary for the decision and a clear link between decisions and rewards, thus reducing the decision costs. List and Levitt (2007) describe several factors that they assert limit the experimenter's ability to generalize behavior beyond the lab. Most, if not all, of the issues raised by List and Levitt are minimized by incorporating the precepts of experimental design articulated by Smith (1982) and by ensuring that payoffs are salient and that the financial rewards dominate such factors as the subjects' desire to please (or punish) the experimenter. (29)

Received September 2005; accepted September 2007.

An earlier version of this paper was presented at the Economic Science Association meeting in Montreal, Canada, in June 2005. A later version was presented at Appalachian State University. We thank several participants, Tim Perri, and two referees for helpful comments. Of course, all errors are our responsibility. The J. Fred Holly Endowment to the Department of Economics at the University of Tennessee provided funds for this research.

References

Anderson, M., R. Banker, and S. Ravindran. 2000. Executive compensation in the information technology industry. Management Science 46:530-47.

Andreoni, J. 1995. Cooperation in public goods experiments: Kindness versus confusion? The American Economic Review 85:891-904.

Brunet, D. 2006. Risk preference elicitation: An empirical analysis of mechanism consistency. Unpublished paper, University of Calgary.

Chen, G., M. Firth, D. N. Gao, and O. M. Rui. 2006. Ownership structure, corporate governance, and fraud: Evidence from China. Journal of Corporate Finance 12:424-48.

Croson, R., and J. Sundali. 2005. The gambler's fallacy and the hot hand: Empirical data from casinos. The Journal of Risk and Uncertainty 30:195-209.

Davis, D. D., and C. A. Holt. 1993. Experimental economies. Princeton, NJ: Princeton University Press.

Denis, D. J., P. Hanouna, and A. Sarin. 2006. Is there a dark side to incentive compensation? Journal of Corporate Finance 12:467-88.

Erickson, M., M. Hanlon, and E. L. Maydew. 2006. Is there a link between executive equity incentives and accounting fraud? Journal of Accounting Research 44:113-43.

Fama, E. 1980. Agency problems and the theory of the firm. Journal of Political Economy 88:288-307.

Fischbacher, U. 2007. Z-tree: Zurich toolbox for readymade economics experiments. Experimental Economics 10:171-8.

Friedman, D., and S. Sunder. 1994. Experimental methods: A primer for economists. New York: Cambridge University Press.

Goeree, J., C. A. Holt, and T. R. Palfrey. 2003. Risk averse behavior in generalized matching pennies games. Games and Economic Behavior 45:97-113.

Goldman, E., and S. L. Slezak. 2006. The economics of fraudulent misreporting. Journal of Financial Economics 80:603-26.

Hall, B. J. 2003. Six challenges of equity-based pay design. Journal of Applied Corporate Finance 15:21-33.

Hall, B. J., and K. Murphy. 2002. Stock options for undiversified executives. Journal of Accounting and Economics 33:3-42.

Harrison, G. W., E. Johnson, M. M. McInnes, and E. E. Rustrom. 2005. Risk aversion and incentive effects: Comment. The American Economic Review 95:897-901.

Holt, C. A., and S. K. Laury. 2002. Risk aversion and incentive effects. The American Economic Review 92:1644-55.

Holt, C. A., and S. K. Laury. 2005. Risk aversion and incentive effects: New data without order effects. The American Economic Review 95:902-4.

Itner, C. D., R. A. Lambert, and D. F. Larcker. 2003. The structure and performance consequences of equity grants to employees of new economy firms. Journal of Accounting and Economics 34:89-127.

Johnson, S. A., H. E. Ryan, and Y. S. Tian. 2006. Managerial incentives and corporate fraud: The sources of incentives matter. Presented at 2006 EFA Zurich Meetings. Accessed 26 July, 2006. Available http://ssrn.com/abstract = 395960.

Khaneman, D., and A. Tversky. 1979. Prospect theory: An analysis of decision under risk. Econometrica 47:263-92.

Lange, A., J. List, and M. K. Price. 2007. Using lotteries to finance public goods: Theory and experimental evidence. International Economic Review 48:901-27.

List, J., and S. Levitt. 2007. What do laboratory experiments tell us about the real world? Journal of Economic Perspectives 21 : 153-74.

McFadden, D. 1974. Conditional logit analysis of qualitative choice behavior. In Frontiers in econometrics, edited by R. Zarembka. New York: Academic Press, pp. 105-42.

Murphy, K. J. 2003. Stock-based pay in new economy firms. Journal of Accounting and Economics 34:129-47.

Plott, C. R. 1987. Dimensions of parallelism: Some policy applications of experimental methods. In Laboratory experimentation in economies: Six points of view, edited by Alvin E. Roth. New York: Cambridge University Press, pp. 193-219.

Poirier, D. J. 1980. Partial observability in bivariate probit models. Journal of Econometrics 12:209-17.

Robison, D., and R. Santore. 2006. Managerial incentives, fraud, and firm value. SSRN Working Paper No. 770924.

Smith, V. L. 1982. Microeconomic systems as an experimental science. The American Economic Review 72:923-55.

Smith, V. L., and J. Walker. 1993. Monetary rewards and decision cost in experimental economics. Economic Inquiry 31:245-61.

Yu, Hung-Chao. "The Effects of Incentive Contracts and Monitoring Mechanism on Honesty in Managerial Reporting." Accessed 6 January 2004. Available http://ssrn.com/abstract=367540.

(1) The use of field data is problematic, since fraud is only observed if it is detected, creating an identification problem. We employ a bivariate probit model with partial observability to address this concern. See Poirier (1980) for a discussion.

(2) List and Levitt (2007) discuss some limitations as they affect our ability to generalize beyond the lab. Essentially, these address the external validity of the behavior observed in the laboratory conditions and the extent to which we can achieve parallelism in the lab. We return to these issues in our concluding section.

(3) We are not the first to investigate corporate malfeasance in the lab. Yu (2004), for example, reports on a set of lab experiments in which the fraud takes the form of embezzling the company assets. Yu's method differs from ours in three important ways: Fraud is a binary choice, not a level; subject risk attitudes are not measured; and subjects are not exposed to an audit each period. Rather, subjects are selected for an audit that covers all periods of the experiment, with the audit being announced only at the end of the session.

(4) Analysis based on field data is only able to detect correlation between variables due to the potential for confounding unobservable effects.

(5) Contrary to the concept of earnings management in the accounting literature, there is no incentive to understate the value of the firm.

(6) The expected level of fraud will be normalized to zero for purposes of the experiment.

(7) Since the sanction is the only random component of wealth, the variance in wealth is equivalent to the variance in the sanction.

(8) Employing such a mechanism to obtain a measure of subject's risk preferences is not without critics. Both Holt and Laury (2002, 2005) and Harrison et al. (2005) demonstrate that the measure of risk aversion obtained by such a mechanism is sensitive to the magnitude of payment. That is, scaling up real payments results in an increase in risk aversion. Furthermore, the gamble choice exercise only allows for gains, while in the fraud experiment both gains and losses were possible. In the context of alternative theories for decision making under uncertainty, such as the Prospect Theory (see Khaneman and Tversky 1979), any measure for risk preference in the first setting cannot be generalized to the second. However, in our experimental setting the payment from the gamble choice exercise is comparable to that of a decision period in the actual experiment. Also, if the subjects view the guaranteed payoff as a reference point, then the gamble choice exercise involves both gains and losses. Thus, any inferred risk preference from the exercise should remain stable across the actual fraud experiment. Goeree, Holt, and Palfrey (2003) report findings that risk attitudes obtained from generalized matching pennies games are consistent with those from the Holt-Laury procedure for the same subject pool. Lange, List. and Price (2007) demonstrate the requirement to incorporate risk attitudes into the econometric specification.

(9) The realization of the gamble phase is not revealed to subjects until alter completion of the managerial decision experiment. This prevents interaction between the decision tasks in the experiment session.

(10) Our Table 1 is derived in the same fashion as table 5 in Goeree, Holt, and Palfrey (2003). We had 20 subjects exhibit inconsistent preferences by switching from the certain amount to the gamble and then back to the certain amount. We also had four subjects exhibit irrational preferences by opting for the certain payoff in option 10, in which they could have chosen a "gamble" with a 100% chance of winning the large payoff. These subjects are coded as missing observations.

(11) See Bruner (2006) for details.

(12) Neutral language was used in order to minimize possible framing effects.

(13) The use of radio buttons invoked a tradeoff between approaching continuous decision space and the cognitive burden associated with evaluating each choice. Thus, we divided the decision space for both effort and fraud into 25 equally spaced discrete choices.

(14) Again, subjects are informed via text in the instructions and can observe such in the payoff table on the decision screen.

(15) This is true for the values of the probability of detection, p, which we investigate since variance of sanctions is a concave function of p. A strength of employing a 2 x 2 experimental design is the ability to detect interaction effects between the treatment variables since the variables are perfectly orthogonal.

(16) The experiment required parameterization of both the G(L) and the S(F) functions. Both functions assumed the form [[theta].sub.i][x.sup.[lambda]i], where i is either G or S and x represents the decision variable. Specifically, [[theta].sub.G] = 50 and [[theta].sub.S] = 1 and [[lambda].sub.G] = 0.45 and [[lambda].sub.s] = 1.11. The choice of parameters was made in order to make the objective function as concave as possible to maximize the likelihood that the gains from discovering the unique maximum surpassed the cognitive cost of searching. Risk neutrality was assumed during the choice of parameters since risk aversion requires some knowledge of individual characteristics.

(17) This also prevents us from running a pure treatment effects model since treatment indicator variables will be perfectly correlated with either the subject fixed or random effects.

(18) Twenty-four subjects provided inconsistent or irrational responses (see footnote 10). These people are coded as missing observations for the risk aversion dummy variable. Thus, they are excluded from the estimated fraud equations because they include that dummy variable. Our results are robust to coding these subjects as zero for the risk aversion dummy variable.

(19) Note that the minimum level of the wealth variable is negative. If subjects are detected committing fraud in the early rounds of the session, the fine is sufficient to result in negative wealth.

(20) The reason the inverse of the period is used is to allow for diminishing marginal effects of learning. That is, subjects are most likely to learn in the early decision periods.

(21) The correlation between lagged audit and lagged penalty is 0.699. Thus, multicollinearity is of concern and may be responsible for the insignificance of the estimated coefficient on lagged audit.

(22) Subjects could choose levels of fraud between 0 and 500. Thus, the observations are censored at 0 and at 500. Given our experimental design we expect 400 observations to occur at both of these levels of fraud. We actually observe 180 observations that occur at 0 and 211 observations that occur at 500. Ordinary least-squares estimates are inconsistent in the presence of censoring, and this prohibits the use of a Hausman test to verify the appropriateness of the random-effects specification.

(23) Since we employ a between-subjects design our treatment dummies are perfectly correlated with any subject-specific fixed or random effects. Thus, the most parsimonious model we can estimate must include at least one subject-specific variable so that the random effects are not perfectly correlated with the explanatory variables.

(24) There is an inconsistency in the sense that estimated "'learning" effects are significant in the effort equation and insignificant in the fraud equation. The fraud decision was more difficult than the effort decision in that it required subjects to calculate expected values. Thus. it could be there were too few periods for any "learning" effects to be estimated for the fraud decision, while there were enough for such effects to be estimated for the effort decision.

(25) This could be considered irrational behavior. As Davis and Holt (1993) note, individuals are observed to make decisions in lottery settings, which may be seen to be irrational as a result of errors in probability estimation (i.e., subjective probability assessment [Davis and Holt]). Of course, this is a plausible explanation, but we cannot be certain of the estimated coefficients, given the correlation between lagged audit and lagged penalty (see footnote 22). See Croson and Sundali (2005) for a detailed discussion concerning the prevalence of the gambler's fallacy in both laboratory and field experiments.

(26) The objective function yields payoffs that are very similar at the 200 and 220 choices for fraud, and we have elected to represent the prediction as a range.

(27) Andreoni (1995) cites this issue in reference to the often higher-than-predicted levels of contributions in public goods experiments. In our setting, subjects can only make errors in their fraud decision that are greater (less) than the theoretical prediction for treatment 2 (4).

(28) Calculated as [([[delta].sub.3] - [[delta].sub.1]) + ([[delta].sub.4] - [[delta].sub.2])]/2.

(29) As Friedman and Sunder (1994) note, one cannot "prove" parallelism through deductive reasoning. In the end, it is inductive reasoning that allows us to assert that because we have observed regularities, these will continue. Smith and Walker (1993) argue for ensuring that the decision rewards are commensurate with the decision costs and task complexity.

David Bruner, * Michael McKee, ([dagger]) and Rudy Santore ([double dagger])

* Department of Economics, Walker College of Business, Raley Hall, 416 Howard Street, Appalachian State University, Boone, NC 28608, USA; E-mail brunerdm@appstate.edu.

([dagger]) Department of Economics, Walker College of Business, Raley Hall, 416 Howard Street, Appalachian State University, Boone, NC 28608, USA; E-mail mckeemj@appstate.edu; corresponding author.

([double dagger]) Department of Economics, College of Business Administration, Stokely Management Center, 1000 Volunteer Boulevard, University of Tennessee, Knoxville, TN 37996, USA; E-mail rsantore@utk.edu.

Table 1. Risk Preference Classification Based on Lottery Choices

Number of Number of
Safe Choices Risk Parameter Range Classification Choices

2 r < -0.002 Very risk loving 3
3 -0.002 < r < -0.001 Risk loving 5
4 -0.001 < r < -0.0004 Slightly risk loving 14
5 r = 0.000 Risk neutral 14
6 0.0004 < r < 0.001 Slightly risk averse 13
7 0.001 < r < 0.002 Risk averse 4
8 or more r > 0.002 Very risk averse 3

Table 2. Experimental Design

 Equity Share

Probability of Detection (%) 30% 50%

15 T1 T3
25 T2 T4

Table 3. Variable Description and Descriptive Statistics (a)

Variable Description Obs

Fraud Amount of additional (false) output 1600
 reported
Effort Level of contribution made to 1600
 produce output
T1-T4 = 1 if treatment is 1-4 1600
Risk averse = 1 if subject displays risk 1120
 aversion in the gamble exercise
Lagged audit = 1 if subject audited in the 1520
 previous period
Lagged penalty Penalty (scaled by 10) subject 1520
 ($10) assessed in the previous period
Lagged wealth Cumulative laboratory earnings 1520
 to date (beginning of the period)
Inverse period 1/period in the experiment 1600

Variable Mean SD Min Max

Fraud 217.58 188.89 0 500
Effort 74.11 63.31 0 200
T1-T4 0.25 0.433 0 1
Risk averse 0.61 0.488 0 1
Lagged audit 0.20 0.40 0 1
Lagged penalty 8.46 24.33 0 99.05
 ($10)
Lagged wealth 24.79 17.10 -12.17 86.00
Inverse period 0.18 0.22 0.05 1

(a) Obs = observations; SD = standard deviation; Min = minimum;
Max = maximum.

Table 4. Managerial Effort Estimations (a)

 Generalized Least-Squares Estimation of Effort

Coefficients Model 1 Model 2
T1 36.60 *** (1.48) 35.87 *** (1.59)
T2 46.41 *** (1.73) 45.33 *** (1.85)
T3 93.74 *** (2.50) 92.95 *** (2.55)
T4 86.96 *** (0.78) 85.64 *** (1.09)
Lagged audit
Lagged penalty ($10)
Lagged wealth ($100)
Inverse period 4.08 (3.17)
Wald chi-square statistic 15,195.04 *** 11,345.17 ***

Generalized Least-Squares Estimation of Effort

Coefficients Model 3

T1 31.51 *** (3.26)
T2 40.70 *** (3.34)
T3 89.39 *** (4.50)
T4 80.53 *** (3.78)
Lagged audit -4.78 * (2.79)
Lagged penalty ($10) 0.07
Lagged wealth ($100) 0.07
Inverse period 22.72 ** (10.41)
Wald chi-square statistic 8158.68 ***

All models were estimated assuming panel-specific heteroskedastic
errors. Standard errors of the coefficients are reported in
parentheses.

* Significant at the 10% level.

** Significant at the 5% level.

*** Significant at the 1% level.

Table 5. Fraud Estimations

 Random-Effects Tobit Estimation of Fraud (a)

Coefficients Model 1 Model 2

T1 198.27 *** (46.53) 200.67 *** (46.81)
T2 168.68 *** (38.75) 171.04 *** (39.03)
T3 387.50 *** (40.86) 389.81 *** (41.10)
T4 264.77 *** (55.97) 267.05 *** (56.46)
Risk averse -72.28 * (38.14) -72.26 (38.08)
Inverse period -13.36 (28.98)
Lagged audit
Lagged penalty ($10)
Lagged wealth ($100)
Wald chi-square statistic 116.78 *** 117.36 ***
Log likelihood -5123.82 -5123.71

Coefficients Model 3 Marginal Effects

T1 188.55 *** (41.72) 121.57 *** (24.82)
T2 136.17 *** (48.74) 88.82 *** (30.41)
T3 338.08 *** (47.72) 208.55 *** (22.37)
T4 239.35 *** (48.33) 153.01 *** (26.82)
Risk averse -80.19 *** (29.72) -52.60 *** (19.14)
Inverse period -103.16 (82.50) -67.75 (54.06)
Lagged audit 25.82 (22.50) 16.97 (14.77)
Lagged penalty ($10) 2.34 *** (0.41) 1.53 *** (0.28)
Lagged wealth ($100) 1.35 ** (0.64) 0.89 ** (0.42)
Wald chi-square statistic 319.90 ***
Log likelihood -4821.96

Standard errors of the coefficients are reported in parentheses.

* Significant at the 10% level.

** Significant at the 5% level.

*** Significant at the 1% level.

Table 6. Theoretical Point Predictions (a)

Treatment Predicted Effort Conditional Mean Effort Predicted Fraud

1 32 31.51 (0.02) 200-220
2 32 40.70 (6.81) 0
3 80 89.39 (4.34) 500
4 80 80.53 (0.02) 200-220

Treatment Predicted Fraud Conditional Mean Fraud

1 200-220 188.55 (0.26)
2 0 136.17 (7.81)
3 500 338.08 (11.51)
4 200-220 239.35 (0.37)

(a) Chi-square statistics are reported in parentheses. Values reported
in italics are statistically different from predicted levels at the
5% level of significance.