Abstract Relative age effects (RAEs) occur when those who are relatively older for their age group are more likely to succeed. RAEs occur reliably in some educational and athletic contexts, yet the causal mechanisms remain unclear. Here we provide the first direct test of one mechanism, selection bias, which can be defined as evaluators granting fewer opportunities to relatively younger individuals than is warranted by their latent ability. Because RAEs are well-established in hockey, we analyzed National Hockey League (NHL) drafts from 1980 to 2006. Compared to those born in the first quarter (i.e., January–March), those born in the third and fourth quarters were drafted more than 40 slots later than their productivity warranted, and they were roughly twice as likely to reach career benchmarks, such as 400 games played or 200 points scored. This selection bias in drafting did not decrease over time, apparently continues to occur, and reduces the playing opportunities of relatively younger players. This bias is remarkable because it is exhibited by professional decision makers evaluating adults in a context where RAEs have been widely publicized. Thus, selection bias based on relative age may be pervasive.

Citation: Deaner RO, Lowen A, Cobley S (2013) Born at the Wrong Time: Selection Bias in the NHL Draft. PLoS ONE 8(2): e57753. https://doi.org/10.1371/journal.pone.0057753 Editor: Aldo Rustichini, University of Minnesota, United States of America Received: September 20, 2012; Accepted: January 29, 2013; Published: February 27, 2013 Copyright: © 2013 Deaner et al. This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. Funding: Grand Valley State University supported this research. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript. Competing interests: The authors have declared that no competing interests exist.

Introduction Relative age effects (RAEs) occur when those who are relatively older for their age group are more likely to succeed. RAEs are perhaps most often associated with Canadian ice hockey, where nearly 40% of players on elite junior teams are born in the first quarter of the year, meaning that, because of a January 1 cut-off date, they would have been consistently older than their age group peers [1]–[5]. RAEs have been described in numerous athletic and educational contexts, and there is consensus that RAEs are unfair to relatively younger individuals and that the implied loss of talent may negatively impact societies [6]–[14] (but see [15], [16]). Eliminating RAEs has proven challenging, however, in part because of difficulties in identifying which of the plausible mechanisms are causal [7], [10], [11], [13], [17]. For example, all else being equal, the youngest boy on a youth hockey team will be smaller, weaker, less emotionally mature, and less skillful; these disadvantages may lead to other ones, including diminished confidence, less instruction from coaches, reduced likelihood of being selected for elite teams, and, ultimately, a greater likelihood of dropping out of the sport. Thus, there are many potentially causal mechanisms, and they are likely to interact. Here, we attempt to isolate and demonstrate the occurrence of one mechanism: selection bias. By selection bias, we mean that evaluators (e.g., teachers, coaches) mistakenly grant fewer opportunities (e.g., instruction, access to elite group or team) to relatively younger individuals than is warranted by their latent ability or talent. Although selection bias is widely accepted as a contributor to RAEs [7], [10], [11], [13], [17], there is little direct evidence for it (but see [10], [11]), apparently owing to the difficulty of obtaining measures of both perceived and latent talent. Furthermore, in most studies consistent with selection bias, the selector's preference for relatively older individuals might be rational, given the selector's aims and the greater maturity of relatively older individuals. For example, a coach selecting a youth team may favor relatively older players because the coach's performance may be evaluated based on her team's success; similarly, a teacher may select mostly relatively older pupils for a gifted academic program because they are most likely to benefit from and contribute to it. We tested the hypothesis of selection bias by evaluating entry drafts of the National Hockey League (NHL), the world's premier professional ice hockey league. NHL teams generally draft players once they have reached 18 years of age [18], and teams select players in order of their perceived talent [19], [20]. Nevertheless, there is uncertainty regarding the career trajectory of players, and nearly half never develop sufficiently to play a single game in the NHL. The logic of our study is that a player's draft slot serves as a measure of their perceived talent whereas career productivity indicates their realized talent. If selection bias occurs, then, for any given draft slot, relatively younger players will enjoy more productive careers. The possibility of selection bias in NHL drafting was suggested by a recent study [3]. It reported that 40% of Canadian-born first round NHL draft selections from 2007–2010 (n = 62) were born in the first quarter, but only 28% of Canadian-born NHL players were born in the first quarter. Moreover, in a large sample (n = 1,003), first quarter born players comprised fewer than 25% of Canadian All-Star and Olympic team members and had substantially shorter NHL careers than players born later in the year. We expanded on this study in several ways. First, our analyses of productivity included all selections from all draft rounds for a period of 27 years, meaning that our sample size of draftees was 44 times larger (n = 2,736 Canadian-born non-goaltenders). Second, we used analytic methods to directly test for selection bias while controlling for potentially confounding variables. Most crucially, we tested whether birth quarter was associated with productivity once draft slot was controlled; this is vital because relatively younger players may be drafted earlier [21]. Third, we investigated a potential mediator of selection bias, the decision to become draft eligible. Fourth, we tested for changes in selection bias over time. Finally, we examined whether selection bias reduces relatively younger individuals' playing opportunities.

Discussion Here, we have demonstrated that, for 27 years, relatively younger NHL draftees have enjoyed substantially more productive careers than would be predicted by their draft slots. Moreover, the pattern of drafting far fewer relatively younger individuals has not waned and has occurred again in recent years, including in the 2012 draft (Fig. 1). Before we explore the implications of this selection bias, we must address three questions. First, are our findings novel, given earlier studies? It was previously known that the unequal distribution of players by birth quarter is greater in junior hockey than in the NHL [2], [3], [5]. This pattern is consistent with selection bias but does not require it. This is because relatively younger players in junior hockey might, on average, be more talented than relatively older players, and NHL teams might recognize this and draft accordingly. In fact, a previous study reported that relatively younger players were drafted earlier [21], and we confirmed the effect with a larger sample. Thus, if NHL teams simply drafted based on (unbiased) perceptions of talent, there would be no relationship between relative age and productivity. We showed, however, that there is a strong relationship, even when draft slot is controlled. Our findings are also novel in that they apparently constitute the first demonstration that, in sports, selection based on relative age can be irrational (see below). As noted in the Introduction, although many studies show that relatively older players are more likely to be selected for youth elite teams, none of these studies address whether these selection patterns are disadvantageous for the teams and selectors given that relatively older players generally have greater ability at the time of selection. Because we had good measures of both perceived talent (i.e., draft slot) and realized talent (i.e., career productivity), we were able to address this question directly. A second question is whether there could there be some rational basis for teams drafting relatively younger individuals later than is warranted by their career productivity. A study of the Australian Football League draft illustrates how this might occur: indigenous players significantly outperformed their draft slots; however, the authors could not rule out that this selection bias might be rational because the recruitment and retention of indigenous players might require additional resources, such as assistance with housing and cultural acclimatization [25]. Similarly, a study found that South American players in the Premier division of the English professional soccer league tended to be overpaid in terms of their effects on team performance; however, this preference for South American players might be rational because the presence of these players, perhaps owing to their stylish play, was associated with increased attendance [30]. Although an explanation analogous to one of these examples cannot be logically excluded, it is difficult to conceive of a reasonable one. For example, most fans are apparently unaware of players' birthdays and do not have stereotypical beliefs about the playing style or abilities of players based on relative age. Furthermore, as we showed, the selection bias was robust to different measures of productivity and was undiminished when possible confounds were incorporated. Perhaps the most plausible idea is that relatively older individuals are rationally preferred, despite their worse long-term productivity, because they are more likely to make an early contribution to a team. In particular, a relatively older player would be absolutely older than a relatively younger one, meaning that they would be further along in their development. Contrary to such reasoning, however, draft eligibility rules dictate that players born from January 1 to September 15 of year X first potentially become draft-eligible at the same time as players born September 16 to December 31 of year X-1 [18], [19]. Paradoxically, then, relatively older individuals (e.g., born in first quarter) are absolutely younger than those in their draft-eligible cohort who were born in the fourth quarter. Furthermore, our results show that relatively older individuals are not only less likely to reach long-term career benchmarks (i.e., 400 career games); they are also less likely to even play a single game (Fig. 3; Table 1). Thus, it is reasonable to conclude that the selection bias we have shown is not in the drafting teams' best interest (i.e., it is irrational). A third question is whether there is in fact an RAE in the NHL. Representation and productivity in the NHL differed substantially by birth quarter (Fig. 2; Table 1). Nonetheless, one might argue that because the RAE is smaller in the NHL than in junior leagues [2], [3], [5] there is no mechanism actively contributing to the RAE in the NHL [3] (see also [15], [31]–[33]). In other words, the differential representation by quarter in the NHL is a legacy of past RAE mechanisms. Our demonstration that relatively younger draftees typically achieve greater careers certainly supports this view. Indeed, one might suggest that ‘the cream rises to the top,’ and, despite where they are drafted, truly talented players, whether relatively older or younger, will manage to develop their abilities and achieve long and productive careers [3], [34], [35]. This argument has some validity, yet it overlooks that being drafted ‘later than one should,’ or not being drafted at all, can be costly. Our analysis of escalation effects shows that, once in the league, players who are drafted earlier receive greater playing opportunities than is warranted by their productivity. These opportunities should allow them to more easily establish their reputations, and these opportunities can translate directly into compensation because recently drafted players can earn most of their income through playing-dependent performance bonuses [18]. Moreover, prior to the collective bargaining agreement of 1995, rookie salaries were far higher for players drafted earlier [36]. Indeed, the likely benefits of early selection are revealed in the behavior of potential draftees: they voluntarily endure days of draft evaluation sessions (e.g., interviews, physiological testing) to improve their draft prospects [19]. Thus, if there was no selection bias against relatively younger individuals, there would be more of them in the NHL, and they would produce and earn more. Our demonstration of selection bias against relatively younger draftees is important because it suggests that selection bias may occur broadly. Obviously, selecting professional hockey players differs considerably from selection in other contexts, yet we believe that, all else being equal, selection bias could be at least as great elsewhere. Consider NHL teams' draftee evaluations compared to German teachers' recommendations about their 10 year-old pupils; these recommendations concerning the suitability of secondary school tracks (e.g., basic or academic) are substantially biased against relatively younger children and have life-long effects [11], [12]. NHL teams are evaluating 18 year-olds, not 10 year-olds, meaning that any age gap is proportionally almost twice as large [13]. In fact, most discussions of RAEs state or imply that, although the consequences of RAE mechanisms may persist into adulthood, most mechanisms will be active in children or early adolescents [3] (see also [15], [31]–[33]).

NHL drafting is done by scouts and general managers who are highly motivated to make accurate evaluations due to the high-stakes involved and the scrutiny given to their decisions [19], [20], [24], [26]. By contrast, teachers making recommendations may have less information, less time for deliberation, and may face few or no consequences for poor decisions.

NHL teams are probably aware of RAEs. RAEs in hockey have been documented in the academic literature since the mid-1980s [2], [4], recognized as potentially important by national federations since the mid-1990s [5], and popularized in a bestselling book [37]. In fact, hockey scouts' evaluations frequently consider the relative age of the scouted player compared to their teammates and opponents [19]. Teachers, on the other hand, may have little or no awareness of RAEs [8], [11]. These points collectively indicate that selection bias may be a pervasive contributor to RAEs and that merely instructing selectors to account for it may be ineffective (see [8], [11]). A final question is what factor(s) causes NHL teams to underestimate the future productivity of relatively younger players. The obvious candidate is maturity-related differences in physiological (e.g., power) and technical characteristics [38]. This possibility might seem unlikely because, as noted above, individuals born in the fourth quarter are, in most cases, absolutely older than others in their draft-eligible cohort [18], [19]. However, player evaluations are based on several years of consideration and, once a player is initially evaluated by professional scouts, perhaps in their early teens, this can, in various ways, affect future evaluations. For instance, relatively younger players may be less likely to be selected for the most elite junior teams [19], [20] (see also [1]–[5]); level of competition, has, in turn, been shown to systematically bias drafting in professional sports [39]. Similarly, prior to 2005, players' decisions regarding what year they ‘opted in’ to the draft might have mediated the selection bias; however, our analyses found no evidence for this. Another possibility is that selection bias in NHL drafting is based on some sort of ‘underdog’ effect whereby relatively younger individuals, because they have faced greater social challenges [9], [14] (see also [34]), compensate by developing greater adaptability to different roles or better work habits; these traits then lead to greater long-term achievement [3], [32] (see also [40]). An underdog effect would have interesting implications, but there is little direct evidence for it yet. Furthermore, even if an underdog effect occurs, its importance is generally countervailed by other mechanisms that typically make being relatively younger disadvantageous [6]–[14].

Materials and Methods We obtained data from the official National Hockey League web site, www.nhl.com. In cases where data were unavailable, we attempted to obtain them from another site, www.hockeydb.com/. We recorded information on draft year, draft slot, player name, birth country, birth month, birth year, and whether the player was a goalie, defenseman, or forward (i.e., center, wing). We initially gathered data on all 8,186 draft selections made from 1980–2012. We did not consider data prior to 1980 because, before this, RAEs did not occur consistently [41], [42], and birth dates for draftees who did not reach the NHL were sparse. For analyses involving productivity, we excluded players drafted from 2007–2012; we did this because these players have had little opportunity to accrue production. The number of selections across our sample varied across years from 210 to 293. This variation was due to an increase in the number of NHL teams (from 21 in 1980 to 30 since 2000) and a decrease in the number of draft rounds (from 12 during the most drafts in the 1980s to 7 since 2005). In less than 1% of cases, a player was drafted, did not sign a contract, and then was selected a second time in a later draft; we excluded players' data from their second drafting. For several reasons, we excluded those who were not Canadian-born (55% of draftees). First, draft eligibility rules differ for individuals playing outside of North America [18]. Second, European based players often play in their domestic leagues or have national service obligations; these can raise obstacles to their playing in the NHL and can affect the willingness of teams to draft them [19]. Third, because there are far more Canadians than any other nationality in the NHL and the Canadian developmental system is comparatively homogenous, most previous RAE studies focused on Canadian-born players [2]–[5], [41]. We also excluded goalies. We did this because they constitute a small percentage of draftees (10%), and the development and evaluation of goalies differs substantially from that of forwards and defensemen [19], [20]. Our preliminary analyses yielded no indication of selection bias for goalies, using games as the productivity measure (for each later birth quarter, p>.70; combined F-test, p = .95). If we had included goalies, the selection bias shown in our Results would have been highly similar (for each later quarter, ps<.01). Because players have unequal opportunities to participate in the playoffs, we considered only regular season production. Teams played 80–84 games per annual regular season during the sample, with the exception of one abbreviated (1994–1995) and one cancelled season (2004–2005) [19], [36]. Our productivity measures were based on career totals at the conclusion of the 2011–2012 season. Because many players have not yet completed their careers, they will continue to accrue productivity. Although this means that draft year will correlate with productivity, there is no known reason to expect that this would spuriously produce evidence of selection bias. Because roughly 45% of draftees never play in a single NHL game, productivity measures have a non-normal distribution. Thus, when predicting productivity, we used censored regression models (i.e., Tobits). When conducting multiple regressions to address escalation effects, we reported regression coefficients that were not standardized, so that the coefficients represent conditional marginal effects. For example, the first set of regression results indicate that, when draft slot is fixed, games played increases by 7.24 for each additional point per game scored. All analyses were conducted using two-tailed statistical tests with Stata v.12.1 (College Station, TX).

Acknowledgments For comments on previous versions of the manuscript, we thank Charlie McGrath, Brad Morris, Paul Sicilian, Ben Winegard, and an anonymous reviewer. For assistance with data entry, we thank Candice Schoenborn and Renah Farhan. For providing background on the NHL draft, we thank Dan Falbo and Paul Kukla. For assistance with figures, we thank Richard Bassett.

Author Contributions Conceived and designed the experiments: ROD. Analyzed the data: ROD AL. Wrote the paper: ROD AL SC.