Skip to main content
Log in

Human capital versus signaling is empirically unresolvable

  • Published:
Empirical Economics Aims and scope Submit manuscript

Abstract

Economists offer two main explanations for the causal labor market returns to education. The first is human capital accumulation: Education improves ability. The second is signaling: Education allows high-ability students to distinguish themselves. A major point of interest is the relative contributions of these effects. I demonstrate the theoretical and empirical conditions necessary to identify the relative contribution of the two models. Then, I review the existing literature to evaluate whether the feasible set of empirical estimates is capable of meeting those conditions and so informing theory. Empirical evidence is capable of rejecting pure human capital and signaling models and usually does so. I argue that, for the general question of relative contribution, necessary identification conditions are not met, and partial identification bounds are wide. Two models with different nonzero contributions of human capital and signaling cannot be empirically distinguished, limiting the usefulness of human capital versus signaling as a framing for understanding the return to education and for policy.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Subscribe and save

Springer+
from $39.99 /Month
  • Starting from 10 chapters or articles per month
  • Access and download chapters and articles from more than 300k books and 2,500 journals
  • Cancel anytime
View plans

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
The alternative text for this image may have been generated using AI.
Fig. 2
The alternative text for this image may have been generated using AI.
Fig. 3
The alternative text for this image may have been generated using AI.
Fig. 4
The alternative text for this image may have been generated using AI.

Similar content being viewed by others

Notes

  1. Throughout the paper, I use the term signaling to also refer to the screening hypothesis (Arrow 1973; Stiglitz 1975; Wolpin 1977), which is similar but differs in timing and some implications (Stiglitz and Weiss 1994).

  2. This is contrary to the quote from Lang and Kropp (1986), “In fact, many members of the profession maintain (at least privately) that these hypotheses cannot be tested against each other and that the debate must therefore be relegated to the realm of ideology.” Their claim is stronger still, in that it implies that human capital and signaling effects cannot be identified at all, as opposed to being unable to identify the relative importance of either model.

  3. The model is general but is still by necessity a simplification, and there are several obvious variations. Depending on what is considered as an outcome, some outcomes may be considered mediating variables sometimes: for example, education may affect the occupation held, which affects earnings, but also individual productivity and thus returns through job match (Van Der Velden and Bijlsma 2016) and aggregate productivity through production complementarities (Kremer 1993). Mediating variables may also affect each other in some way, such as how having a degree can impact a potential employer’s beliefs about a employee’s skills. While not pictured, these complexities are generally understood and incorporated into the discussion of identification.

  4. Standard analysis of causal diagrams would instead specify the effect of education via a mediator as the product of the effect of education on the mediator and the effect of the mediator on the outcome. My approach is equivalent to the standard if we assume that only one of \(\kappa _j\), \(\sigma _j\), or \(\omega _j\) is nonzero \(\forall \ j\), i.e., that all measured mediators are specific enough to have clear theoretical interpretation. This variation on notation is for the purpose of allowing for unclear theoretical interpretation of observed mediators, which Sect. 4.1 will show to be necessary.

  5. Each of these steps is done either by controlling fully for endogenous selection pressures or by utilizing exogenous selection pressures such as instrumental variables (Pearl 2009; Morgan and Winship 2014).

  6. This is, in effect, the “front-door method” in the case of multiple mediators, as in Bellemare and Bloem (2019).

  7. These calculations assume that \(\omega _j = 0 \ \forall \ j\) and must be modified otherwise, depending on whether the researcher wants to estimate the importance of signaling relative to the total return, or relative to human capital.

  8. Throughout this section, I refer to the identification of individual \(\kappa _j\) or \(\sigma _j\) parameters; it is not necessary in all cases that these individual parameters are identified as long as they are included in some aggregate estimate. For example, if \(\chi ^C\) is empty, then by blocking \(\chi _\kappa \) while estimating the overall effect of education, the aggregate \(\varSigma \) can be identified without identifying the individual \(\sigma _j\)s. This does not change the argument, and so I ignore the possibility of identifying these parameters in aggregate for simplicity of explanation.

  9. This assumes that the human capital and signaling effects are both nonnegative.

  10. I use this same paper as an example many times in this paper. I pick Arteaga (2018) specifically for this because it applies to many arguments, and because the work itself is of high quality and its empirical results are believable, so any issues I point out can be attributed to the flexibility of the human capital and signaling models rather than flaws in the paper.

  11. To demonstrate the precise argument being made here, consider a student who learns Shakespeare in college, and then makes a Shakespeare reference during a job interview, impressing the interviewer and getting the job. This knowledge of Shakespeare is a skill acquired in education, and improved their earnings, even though it may have no effect on productivity.

  12. The Arcidiacono et al. (2010) empirical result that there is no employer learning for college graduates is disputed (Light and McGee 2015). However, the use of the Arcidiacono study here does not rely on their empirical result.

  13. I refer here specifically to estimates that compare the returns to education between degree-granting and non-degree-granting years. Several of the arguments presented here that interpret sheepskin effects in human capital terms do not apply to natural experiments that estimate the return to holding a degree in other ways, like Tyler et al. (2000). These will be addressed in Sect. 4.3.

  14. These figures come from a simulation using 1,000,000 students, with log ability regressed on schooling level indicators to estimate returns.

  15. \(\beta _3 = 0\) and \(\beta _3 > 0\) here can be substituted with \(\beta _3 = \epsilon _3\) and \(\beta _3 > \epsilon _3\).

  16. There is reason to doubt that the effects are actually zero—there is no shortage of studies that find effects of various educational interventions on test scores. The well-established ability to affect test scores at the margin implies a general effect of education on measurable ability, although an argument could be made that the effect is small with some definition of small.

  17. For any theoretical framing, there is always a deeper level on which mechanisms are not explained. Under a pure human capital model, for example, why does education improve skills? This is an unaddressed, deeper mechanism.

References

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Nick Huntington-Klein.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

I thank Oded Gurantz, Tyler Ransom, Jacob Vigdor, anonymous reviewers and the associate editor, and seminar attendees at the University of California Santa Barbara for comments and suggestions.

The effect of education on the labor market in potential outcomes notation

The effect of education on the labor market in potential outcomes notation

This appendix draws from Imai et al. (2010), albeit with the use of superscript counterfactual notation rather than function notation.

We are interested in the causal effect of education \(Ed_i\) on some outcome \(Y_i\) for individual i. For simplicity, assume that the margin of interest compares \(Ed_i = 1\) against \(Ed_i = 0\). Let \(Y_i^{C}\) be the possibly counterfactual outcome for individual i under the condition C.

The average causal effect of education is

$$\begin{aligned} E(Y_i^{Ed_i=1} - Y_i^{Ed_i=0}) \equiv B \equiv \varSigma + K + \varOmega \end{aligned}$$
(3)

where \(\varSigma \), K, and \(\varOmega \) are the parts of the education return attributable to signaling, human capital, and other explanations, respectively.

In observed data, either \(Y_i^1\) or \(Y_i^0\) is missing, such that the raw relationship between \(Y_i\) and \(Ed_i\) does not identify the effect of education:

$$\begin{aligned} E(Y_i|Ed_i =1) - E(Y_i|Ed_i = 0) \ne E(Y_i^{Ed_i=1} - Y_i^{Ed_i=0}) \end{aligned}$$
(4)

However, for the purposes of this paper, we will assume that the causal effect of education can be identified. For simplicity, assume that we do this by conditioning on a set of controls \(W_i\) such that

$$\begin{aligned} E(Y_i|Ed_i =1, W_i) - E(Y_i|Ed_i = 0, W_i) = E(Y_i^{Ed_i=1} - Y_i^{Ed_i=0}) \end{aligned}$$
(5)

The effect of \(Ed_i\) on \(Y_i\) is fully mediated by a set of mediating variables \({x_1,x_2,\ldots ,x_J}\). For simplicity of notation, assume that each of these mediators is binary. We have

$$\begin{aligned}&\displaystyle E(Y_i^{Ed_i=1} - Y_i^{Ed_i=0} | x_1,x_2,\ldots ,x_J) = 0 \end{aligned}$$
(6)
$$\begin{aligned}&\displaystyle E(Y_i^{x_{ij}=1}-Y_i^{x_{ij}=0}) \equiv \sigma _j + \kappa _j + \omega _j + \epsilon _j \ \forall \ j \in \{1, \ldots , J\} \end{aligned}$$
(7)

where \(\sigma _j\), \(\kappa _j\), \(\omega _j\), and \(\epsilon _j\) are the parts of the effect of \(x_j\) on Y that are attributable to signaling, human capital, other educational explanations, and other non-educational explanations. \(\varSigma = \sum _j \sigma _j, K = \sum _j \kappa _j\), and \(\varOmega = \sum _j \omega _j\). Isolating the part of each mediator that is driven by education excludes non-educational explanations such that

$$\begin{aligned} E\left( Y_i^{Ed_i = 1, x_{ij}^{Ed_i = 1}}-Y_i^{Ed_i = 0, x_{ij}^{Ed_i = 0}}\right) = \sigma _j + \kappa _j + \omega _j \ \forall \ j \in \{1, \ldots , J\} \end{aligned}$$
(8)

and for most points of discussion assume also that \(\omega _j = 0 \ \forall \ j\).

Divide the set \({x_1,x_2,\ldots ,x_J}\) into the subset \(\chi _\sigma \) for which \(\kappa _j = 0 \ \forall \ x_j \in \chi _\sigma \), \(\chi _\kappa \) for which \(\sigma _j = 0 \ \forall \ x_j \in \chi _\kappa \), and \(\chi ^C\) for which \(\kappa _j \ne 0\) and \(\sigma _j \ne 0 \ \forall \ x_j \in \chi ^C\). For an element of \(\chi ^C\), we could identify \(\sigma _j\) or \(\kappa _j\) separately if there were a way to vary the mediator while holding the other explanation constant, or by varying only the part of the mediator associated with one explanation, expressed as

$$\begin{aligned} E\left( Y_i^{Ed_i = 1, x_{ij}^{Ed_i = 1}|\kappa _j}-Y_i^{Ed_i = 0, x_{ij}^{Ed_i = 0}|\kappa _j}\right) = \sigma _j \end{aligned}$$
(9)

The share of the educational return that is due to, for example, signaling, is defined as \(\varSigma /B = (B-K)/B = \varSigma /(\varSigma +K)\) under the assumption that \(\varOmega =0\). Identifying this share requires following the steps given in Sect. 2, which includes the tasks of identifying at least some of:

  • The total effect of education B, which is \(E\left( Y_i^{Ed_i=1} - Y_i^{Ed_i=0}\right) \)

  • The part of the effect of education that goes through \(\chi _\sigma \), which is

    \(\sum _{j|x_j\in \chi _\sigma }E\left( Y_i^{Ed_i = 1, x_{ij}^{Ed_i = 1}}-Y_i^{Ed_i = 0, x_{ij}^{Ed_i = 0}}\right) \)

  • The part of the effect of education that goes through \(\chi _\kappa \), which is

    \(\sum _{j|x_j\in \chi _\kappa }E\left( Y_i^{Ed_i = 1, x_{ij}^{Ed_i = 1}}-Y_i^{Ed_i = 0, x_{ij}^{Ed_i = 0}}\right) \)

  • The parts of the effect of education that go through \(\chi ^C\) that are signaling-related, which is

    \(\sum _{j|x_j\in \chi ^C}E\left( Y_i^{Ed_i = 1, x_{ij}^{Ed_i = 1}|\kappa _j}-Y_i^{Ed_i = 0, x_{ij}^{Ed_i = 0}|\kappa _j}\right) \)

  • The parts of the effect of education that go through \(\chi ^C\) that are human capital-related, which is

    \(\sum _{j|x_j\in \chi ^C}E\left( Y_i^{Ed_i = 1, x_{ij}^{Ed_i = 1}|\sigma _j}-Y_i^{Ed_i = 0, x_{ij}^{Ed_i = 0}|\sigma _j}\right) \)

with the particular elements that must be identified varying depending on which explanation is being identified and what strategy is being taken for following the steps in Sect. 2.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Huntington-Klein, N. Human capital versus signaling is empirically unresolvable. Empir Econ 60, 2499–2531 (2021). https://doi.org/10.1007/s00181-020-01837-z

Download citation

  • Received:

  • Accepted:

  • Published:

  • Version of record:

  • Issue date:

  • DOI: https://doi.org/10.1007/s00181-020-01837-z

Keywords

JEL