首页    期刊浏览 2025年02月24日 星期一
登录注册

文章基本信息

  • 标题:An analysis of alternative methodologies and interpretations of mortgage discrimination research using simulated data.
  • 作者:Brown, Christopher L. ; Simpson, W. Gary
  • 期刊名称:Academy of Banking Studies Journal
  • 印刷版ISSN:1939-2230
  • 出版年度:2010
  • 期号:January
  • 语种:English
  • 出版社:The DreamCatchers Group, LLC
  • 摘要:For the past several years, researchers have focused on investigating discrimination in the residential mortgage market. New data and methodologies have been employed and new theoretical lending models have been developed to explain why loan approval rates are higher for non-minority applicants than for minority applicants. There has been intense debate on the significance of default rates in identifying or ruling out discriminatory lending and on the appropriate methodology to use in testing for discrimination. This paper reviews the debate and uses simulated data to provide conclusive evidence on the merits of the alternative theories and methodologies. Understanding the relationship of default rates as potential indicators of discrimination and assuring that the methodology used in these studies is appropriate is very important because the findings of discrimination studies may be used to influence public policy.
  • 关键词:Default (Finance);Mortgage banks;Mortgage discrimination;Mortgages;Simulation;Simulation methods

An analysis of alternative methodologies and interpretations of mortgage discrimination research using simulated data.


Brown, Christopher L. ; Simpson, W. Gary


INTRODUCTION

For the past several years, researchers have focused on investigating discrimination in the residential mortgage market. New data and methodologies have been employed and new theoretical lending models have been developed to explain why loan approval rates are higher for non-minority applicants than for minority applicants. There has been intense debate on the significance of default rates in identifying or ruling out discriminatory lending and on the appropriate methodology to use in testing for discrimination. This paper reviews the debate and uses simulated data to provide conclusive evidence on the merits of the alternative theories and methodologies. Understanding the relationship of default rates as potential indicators of discrimination and assuring that the methodology used in these studies is appropriate is very important because the findings of discrimination studies may be used to influence public policy.

The mortgage discrimination debate intensified with the release of a study conducted by researchers at the Federal Reserve Bank of Boston (Munnell, Tootell, Browne and McEneaney (1996)). The study employs the most comprehensive loan application information of any of the recent discrimination studies. The authors use a logit regression equation that includes all of the variables that should be relevant to the loan decision. The race of the applicant is included as an additional explanatory variable. If the coefficient on race is significant, it is interpreted as evidence of discrimination.

Munnell et. al. (1996) find minority applicants, on average, have greater debt burdens, higher loan-to-value ratios, and weaker credit histories than non-minorities. Furthermore, denied minorities have lower income and wealth, higher obligation and loan-to-value ratios, and worse credit histories than denied non-minorities. Despite these facts, the authors find evidence of discrimination against minorities. They find that minority applicants are rejected 60 percent more often than non-minority applicants when financial, employment, and neighborhood characteristics are held constant.

The findings of Munnell et. al. (1996) have received a great deal of attention from policymakers and academic researchers. Most of the debate focuses on perceived shortcomings in the study. Criticisms of the Munnell et. al. (1996) study include problems with the integrity of the data (Horne (1994), Liebowitz (1993), and Carr and Megbolugbe (1993)), the authors' failure to consider default rates (Becker (1993), England (1993), and Brimelow and Spencer (1993)), and problems with the use of a single-equation logit regression model of the probability of loan approval to detect discrimination.

More recently, Blank et al (2005) investigated racial discrimination in mortgage lending in Washington, DC. Using three different methodologies, a dissimilarity index approach, a three-way crosstabulation approach and a logistic regression. The adjusted dissimilarity index approach is based on the theory that, after considering for differences in neighborhood factors and using variables on the loan applicants that are available through HMDA, approval rates should be approximately the same across census tracts. Blank et al (2005) find there is a disparity between census tracts. They conclude that 10.64 percent of loans that should have gone to underserved census tracts were denied. That amounts to 1,315 loans. The crosstabulation approach simply evaluates whether there is a disparity in lending based on only income and race. After considering income, they find a significant difference in the proportion of loans denied between minorities and non-minorities across all income levels. The third approach is a logistic regression model. The model includes the race of the applicant along with neighborhood characteristics. They again find that minorities are less likely to receive loans, after accounting for the factors in the model.

The findings by Blank et al (2005) do not necessarily indicate discrimination. None of the methodologies employed by Blank et al (2005) include the credit history of the applicants. As explained in more detail in Section II, the distribution of credit quality may explain differences in loan approval rates even after considering all of the variables used in the Blank et al (2005) study.

Section II discusses the role of default rates in interpreting the results of mortgage discrimination research. A model of the relationship between loan denial and default rates and discrimination developed by Ferguson and Peters (1995) is presented. Section III discusses the criticisms of the use of a single-equation logit regression equation to measure mortgage discrimination. Section III also presents the reverse regression methodology LaCour-Little (1996) applies to the Munnell et. al. (1996) data to test for mortgage discrimination. Weaknesses in the reverse regression methodology are also discussed in Section III. Section IV presents the simulation analysis used to show the relationship between default rates and discrimination and to compare the performance of a single-equation logit regression model and the reverse regression model in identifying discriminatory lending. The simulation results are presented in Section V. The conclusion and recommendations for future research are presented in Section VI.

THE ROLE OF DEFAULT RATES IN MORTGAGE DISCRIMINATION RESEARCH

Becker (1993) and England (1993) argue that, if discrimination exists, minorities should have lower default rates than non-minorities. They contend that failing to observe lower default rates for minority borrowers is evidence against racial discrimination in mortgage lending. Brimelow and Spencer (1993) also use this reasoning to challenge the findings of Munnell et. al. (1996). They cite the Boston Fed's finding that the average default rate for minority neighborhoods in Boston is the same as the rate for non-minority neighborhoods. Brimelow and Spencer (1993) argue that equal default rates for minority and non-minority neighborhoods contradicts the Munnell et. al. (1996) conclusion that Boston area lenders discriminate against minority applicants.

Munnell et. al. (1996), Tootell (1996), Browne and Tootell (1995), Galster (1993), and Ferguson and Peters (1995) argue that racial discrimination in the mortgage market will result in lower default rates for minority borrowers only if certain conditions hold. Tootell (1993) and Browne and Tootell (1995) argue equal minority and non-minority default rates can only be used as evidence of discrimination if the distribution of the quality of accepted minority applicants is identical to the distribution of accepted non-minority applicants.

Ferguson and Peters (1995) explain the relationship between denial rates, default rates, and the distribution of credit quality. They present a model where the distribution of credit quality is higher for non-minority applicants than for minority applicants. This is referred to as heterogeneous credit quality. Recent empirical evidence tends to support the hypothesis that the distribution of credit quality is heterogeneous.

Let q represent the probability of loan repayment. The distributions of credit quality for minority applicants, h(q), and for non-minority applicants, g(q), are shown in Table 1. For simplicity, assume q is measured without error. All applicants with q above some arbitrary cutoff point, [q.sup.*] are approved and applicants with q below [q.sup.*] are denied. All applicants face the same cutoff point, but the average credit score for approved minority applicants, [q.sub.h], is lower than the average credit score for approved non-minority applicants, [q.sub.g].

[TABLE 1 OMITTED]

The Ferguson and Peters (1995) model predicts that approved non-minority applicants will, on average, have higher credit quality than approved minority applicants. The Ferguson and Peters (1995) model indicates that if minorities have a lower distribution of credit quality than nonminorities, they may have higher default rates even if discrimination is present.

MODEL SPECIFICATION ISSUES IN MORTGAGE DISCRIMINATION RESEARCH

Rachlis and Yezer (1993) and Yezer, Phillips, and Trost (1994) argue that single-equation models cannot be used to test for discrimination because of the complexity of the mortgage lending process. The single-equation models do not take into consideration the borrower's choice of loan terms or the borrower's default decision. Yezer, Phillips, and Trost (1994) show the effects of simultaneity and self-selection bias that result from using a single-equation model of the loan approval decision to detect mortgage discrimination. They show that the coefficient on the discrimination variable will be biased upwards.

LaCour-Little (1996) also criticizes the methodology used by Munnell et. al. (1996). He also contends that the direct logit regression model produces a biased estimate of the discrimination coefficient. However, his line of reasoning is different than Yezer, Phillips, and Trost (1994). LaCour-Little (1996) implies that the logit regression methodology is inappropriate when one group of applicants has a lower distribution of credit quality than the other group. He states that differences in average credit quality require the use of a different methodology. He proposes reverse regression as an alternative methodology to detect discrimination in mortgage lending.

LaCour-Little (1996) uses reverse regression on the data from Munnell et. al. (1996) to test for discrimination. First, he estimates a direct logit regression equation with eleven independent variables used in Munnell et. al. (1996). The dependent variable, ACTION, equals 1 if the loan was denied. There is no race coefficient in this model. The coefficients generated from the regression are used to estimate the probability of loan denial for each observation. The predicted probabilities are considered the inverse qualifications index, Q-INDEX. The Q-INDEX variable is a measurement of the probability of loan denial, therefore higher values of Q-INDEX are bad.

The Q-INDEX values are used as the dependent variable in the following ordinary least squares regression:

Q-INDEX = [b.sub.0] + [b.sub.1] ACTION + [b.sub.2] RACE = e

where ACTION equals 1 if the loan was denied and 0 if the loan was approved, and RACE equals 1 if the applicant is a minority and 0 if the applicant is a non-minority. LaCour-Little (1996) contends the coefficient on RACE measures "the excess probability of default required to turn down a minority applicant." LaCour-Little (1996) also calculates a value, a*, that is a measure of the average qualifications of accepted minority applicants relative to accepted non-minority applicants. This measure is calculated as shown:

[a.sup.*] = -[b.sub.2]/[b.sub.1].

The results of the LaCour-Little (1996) reverse regression on the loan application data are shown in Table 2. LaCour-Little (1996) concludes the [a.sup.*] value of -.193 indicates that accepted minority applicants had average qualifications 19 percent lower than accepted white applicants. He interprets the RACE coefficient of .057 as the excess probability of default required to reject a minority applicant. He concludes that lenders appear to apply less stringent underwriting standards to minority loan applications, and that there is evidence of reverse discrimination.

The statistical analysis conducted by LaCour-Little (1996) may be accurate, but the conclusions derived from the analysis are questionable. LaCour-Little (1996) uses the finding that accepted minority applicants have average qualifications 19 percent lower than accepted non-minority applicants to conclude that there is evidence of reverse discrimination. However, the finding that accepted minorities have lower average qualifications than accepted non-minority applicants is not evidence of discrimination. This is the expected outcome based on the Ferguson and Peters (1995) model.

The correct interpretation of the reverse regression methodology employed by LaCour-Little (1996) is relatively straightforward. Since the ACTION and RACE variables are both binary, there are only four possible outcomes calculated by the model. Those four outcomes represent the average Q-INDEX values for (1) approved non-minority applicants (ACTION=0, RACE=0), (2) approved minority applicants (ACTION=0, RACE=1), (3) denied non-minority applicants (ACTION=1, RACE=0), and (4) denied minority applicants (ACTION=1, RACE=1).

LaCour-Little (1996) finds that approved white applicants have an average Q-INDEX of .0892 while approved minority applicants have a Q-INDEX of .1462. These results are consistent with the Ferguson and Peters (1995) model and the findings by Munnell et. al. (1996) that minority applicants, on average, have lower income levels, higher debt ratios, and worse credit histories than non-minority applicants. The findings do not indicate reverse discrimination or the absence of discrimination. The only way to prove discrimination (or reverse discrimination) is to use a model that will determine if the marginal cutoff point is the same for minority and non-minority applicants. When the distribution of credit quality is heterogeneous, calculations of differences in average credit quality do not provide useful information about the presence or absence of discrimination.

SIMULATION ANALYSIS

Simulation analysis is conducted to identify the role of default rates in identifying discrimination and to compare the performance of the reverse regression model and the logit regression model in detecting discrimination when non-minority and minority applicants have heterogeneous credit quality. Assume (following Ferguson and Peters (1995)) the screening process leads to a single credit score. The credit score reflects the probability of repayment, q. A uniform, nondiscriminatory credit policy requires that all applicants who meet some minimum required credit score, [q.sup.*], be approved.

Assume the lender collects all relevant information and inputs it into a logit regression equation to predict the probability the loan will be repaid if originated. The predicted value of the dependent variable from the logit regression is the credit score, q. To be consistent with LaCourLittle (1996), the credit score is converted from the probability of repayment to the probability of default (by subtracting the credit score from 100 percent). This variable, NSCORE, is used as the dependent variable in the reverse regression. It is equivalent to the inverse of the qualifications index used in LaCour-Little (1996). The independent variables in the reverse regression are the same as in LaCour-Little (1996); ACTION, which equals 1 if the loan was denied and 0 otherwise, and RACE, which equals 1 if the applicant is a minority and 0 otherwise. The logit regression model uses ACTION as the dependent variable, with NSCORE and RACE as the independent variables.

We use simulated data that contains 1000 observations on non-minority applicants and 500 observations on minority applicants. The average NSCORE for non-minority applicants is 26.4 percent with a standard deviation of 11.2 percent. The average NSCORE for minority applicants is 35.2 percent with a standard deviation of 13.4 percent. The first simulation contains no discrimination. All loans that have NSCORE's above 40 percent are denied and all other loans are approved. The rejection rate for non-minority applicants is 11.3 percent and the rejection rate for minority applicants is 36.2 percent.

The second simulation assumes discrimination exists such that non-minority applicants are approved with NSCORE's of 41 or below while minority applicants must have NSCORE's of 40 or below to be approved. The rejection rate for non-minority applicants in this simulation is 9.9 percent while the minority rejection rate remains 36.2 percent.

The third simulation assumes a wider range of discrimination exists, where non-minority applicants are approved with NSCORE's of 45 or below and minority applicants are approved with NSCORE's of 40 or below. For the third simulation, the rejection rate for non-minority applicants decreases to 5.0 percent, while the minority rejection rate remains at 36.2 percent.

If the two methodologies are reliable, they should not detect any discrimination in the first set of simulation data, but should detect discrimination in the second and third sets of simulation data. The findings should not reflect reverse discrimination for any of the three simulations.

The validity of default rate analysis is examined by comparing the average NSCORE of accepted minority applicants to the average NSCORE of accepted non-minority applicants for each simulation. Becker (1993) and others argue that, if discrimination is occurring, minorities should have lower default rates than non-minorities. Since the NSCORE variable measures the probability of default, according to Becker's (1993) theory accepted minority applicants should have lower average NSCORE's than accepted non-minority applicants if discrimination is occurring.

Since the required credit score is the same for all applicants in the first simulation, Becker's (1993) theory implies that the average NSCORE's for minorities and non-minorities should be the same for the first simulation. The second and third simulations result in lower marginal requirements for non-minorities, so Becker's (1993) theory indicates that non-minorities should have higher average NSCORE's than minorities for the second and third simulations.

RESULTS OF THE SIMULATION ANALYSIS

Default Analysis

The average NSCORE's for accepted minority and accepted non-minority applicants are shown in Table 3. The average NSCORE for accepted minorities remains 27.28 percent for all three simulations because the cutoff point does not change for minority applicants. The average NSCORE of accepted non-minority applicants is 24 percent when there is no discrimination (first simulation), 24.25 percent when there is slight discrimination (second simulation), and 25.21 percent when there is significant discrimination (third simulation).

Since the NSCORE is the probability of default, it is apparent that default rates for accepted minority applicants can be higher that default rates for accepted non-minority applicants even if discrimination is occurring. This finding is consistent with the Ferguson and Peters (1995) model. If the distribution of credit quality is higher for non-minority applicants than for minority applicants, default analysis provides no useful information on whether or not discrimination is occurring.

The Performance of the Logit and Reverse Regression Models in Identifying Discriminatory Lending

The results of the reverse regression, calculations of [a.sup.*], and the results of the logit regression are shown in Table 4. Using the reverse regression methodology, the RACE coefficient is positive and significant in all three simulations, and [a.sup.*] ranges in value from -.078 in the third simulation to -.156 in the first simulation. These findings are consistent with the results obtained by LaCour-Little (1996), which led him to conclude that there was evidence of reverse discrimination.

The logit regression model performs well in detecting discrimination. The coefficient on RACE is insignificant in the first simulation, where there is no discrimination. The RACE coefficient is significant at the .001 level for the second simulation, where there is a slight degree of discrimination, and the RACE coefficient is significant at the .0001 level for the third simulation, where there is significant discrimination.

From the simulation analysis, it is apparent that the reverse regression methodology adapted from labor discrimination research is inappropriate for use in mortgage research. The logit regression methodology clearly outperforms the reverse regression methodology in detecting discrimination.

CONCLUSIONS AND RECOMMENDATIONS FOR FURTHER RESEARCH

The findings indicate that we cannot focus on default analysis to answer the question of whether or not discrimination is occurring. The findings also indicate that the reverse regression methodology is not appropriate for mortgage discrimination research and that a well-specified logit regression model may be used to identify one aspect of discriminatory lending. It is important to recognize that the logit model can only identify discrimination that occurs after the loan application has been completed. Discrimination in pre-screening applicants, or differences in the assistance provided to applicants in completing the application process cannot be measured using the single-equation logit methodology.

One recommendation for future research is to study the distributional characteristics of credit quality and attempt to measure the differences in the distribution of credit quality between minority and non-minority applicants. This will be a daunting task, but if the distributional properties are better understood, we can devise a better method for detecting mortgage discrimination.

REFERENCES

Becker, Gary S. (1993). The Evidence Against Banks Doesn't Prove Bias. Business Week, April 13.

Blank, Emily C., P. Venkatachalam,, L. McNeil, and R.D. Green (2005). Racial Discrimination in Mortgage Lending in Washington, D.C.: A Mixed Methods Approach. The Review of Black Political Economy, 33(2), 1-30.

Brimelow, P., and L. Spencer (1993). The Hidden Clue. Forbes, January 4.

Browne, L.E., and G. M.B. Tootell (1995). Mortgage Lending in Boston - A Response to the Critics. New England Economic Review, Sept./Oct., 53-78.

Carr, J. H., and I.F. Megbolugbe (1993). The Federal Reserve Bank of Boston Study on Mortgage Lending Revisited. Journal of Housing Research, 4(2), 277-313.

England R. S. (1993). Washington's New Numbers Game. Mortgage Banking, September, 38-54.

Ferguson, M. F., and S. R. Peters (1995). What Constitutes Evidence of Discrimination in Lending? Journal of Finance, 50, 739-748.

Galster, G. C. (1993) The Facts of Lending Discrimination Cannot Be Argued Away by Examining Default Rates. Housing Policy Debate, 4(1), 141-146.

Horne, D.K. (1994). Evaluating the Role of Race in Mortgage Lending. FDIC Banking Review, 7(1), 1-15.

LaCour-Little, M. (1996). Application of Reverse Regression to Boston Federal Reserve Data Refutes Claims of Discrimination, Journal of Real Estate Research, 11, 1-12.

Liebowitz, S. (1993). A Study That Deserves No Credit. Wall Street Journal, September 1, A14.

Munnell, A.H., G.M.B. Tootell, L.E. Browne, and J.E. McEneaney (1996). Mortgage Lending In Boston: Interpreting HMDA Data," American Economic Review, 86, 25-53.

Rachlis, M.B., and A.M.J. Yezer (1993). Serious Flaws in Statistical Tests for Discrimination in Mortgage Markets. Journal of Housing Research, 4(2), 315-336.

Tootell, G.M.B. (1996). Redlining in Boston: Do Mortgage Lenders Discriminate Against Minority Neighborhoods? Quarterly Journal of Economics, November, 1049-1079.

Yezer, A., R. Phillips, and R. Trost (1994). Bias in Estimates of Discrimination And Default in Mortgage Lending: The Effects of Simultaneity and Self-Selection. Journal of Real Estate Finance and Housing, 9, 197-215.

Christopher L. Brown, Western Kentucky University

W. Gary Simpson, Oklahoma State University
Table 2: LaCour-Little (1996) Reverse Regression Results

      Variable          Parameter Estimate

Intercept                     .0892
ACTION                         .296
RACE                           .057

[a.sup.*] = -.057/.296 = -.196

Table 3: Average NSCORE for Accepted Applicants

                 Simulation 1    Simulation 2    Simulation 3

Minority             27.28           27.28           27.28
Non-Minority         24.00           24.25           25.21

Table 4: Results of the Simulation Analysis

Reverse Regression Results

Simulation 1: No Discrimination

  Variable     Parameter Estimate   T-Statistic        P-Value

Intercept            .2397             84.7            <.0001
ACTION               .2167             36.6            <.0001
RACE                 .0339              6.8            <.0001

[a.sup.*] = -.0339/.2167 = -.156 Adjusted [R.sup.2] = .528

Simulation 2: Slight Discrimination

  Variable     Parameter Estimate   T-Statistic        P-Value

Intercept            .2425             84.9            <.0001
ACTION               .2187             35.5            <.0001
RACE                 .0303              5.9            <.0001

[a.sup.*] = -.0303/.2187 = -.143 Adjusted [R.sup.2] = .514

Simulation 3: Significant Discrimination

  Variable     Parameter Estimate   T-Statistic        P-Value

Intercept            .2529             84.7            <.0001
ACTION               .2255             30.7            <.0001
RACE                 .0175              3.1             .0019

[a.sup.*] = -.0175/.2255 = -.078
Adjusted [R.sup.2] = .451

Logit Regression Results

Simulation 1: No Discrimination -2logL = 1461 w/2 df p<.0001

  Variable     Parameter Estimate   Chi-Square         P-Value

Intercept            167.8             23.85           <.0001
NSCORE              -418.8             23.81           <.0001
RACE                  -0.8              0.01            .9133

Simulation 2: Slight Discrimination -2logL = 1410 w/2 df p<.0001

  Variable     Parameter Estimate   Chi-Square         P-Value

Intercept            111.8             35.46           <.0001
NSCORE              -272.9             35.42           <.0001
RACE                  -2.5             11.36            .0007

Simulation 3: Significant Discrimination -2logL = 1257 w/2 df p<.0001

  Variable     Parameter Estimate   Chi-Square         P-Value

Intercept            107.9             31.16           <.0001
NSCORE              -239.9             31.11           <.0001
RACE                 -11.9             28.79           <.0001
联系我们|关于我们|网站声明
国家哲学社会科学文献中心版权所有