Rogers’ paradox: Why cheap social learning doesn’t raise mean fitness

February 7, 2015 by Marcel Montrey

It’s Friday night, you’re lonely, you’re desperate and you’ve decided to do the obvious—browse Amazon for a good book to read—when, suddenly, you’re told that you’ve won one for free. Companionship at last! But, as you look at the terms and conditions, you realize that you’re only given a few options to choose from. You have no idea what to pick, but luckily you have some help: Amazon lets you read through the first chapter of each book before choosing and, now that you think about it, your friend has read most of the books on the list as well. So, how do you choose your free book?

If you answered “read the first chapter of each one,” then you’re a fan of asocial/individual learning. If you decided to ask your friend for a recommendation, then you’re in favor of social learning. Individual learning would probably have taken far more time here than social learning, which is thought to be a common scenario: Social learning’s prevalence is often explained in terms of its ability to reduce costs—such as metabolic, opportunity or predation costs—below those incurred by individual learning (Aoki et al., 2005; Kendal et al., 2005; Laland, 2004). However, a model by Rogers (1988) famously showed that this is not the whole story behind social learning’s evolution.





Rogers (1988) considered a population of individual learners tracking a temporally varying environment. Because social learners acquire information cheaper than individual learners, they are selected for when introduced. However, this eventually results in there being too few individual learners tracking the environment for up-to-date information to spread. Consequently, social learners’ fitness declines until an evolutionary stable state (ESS) is reached, with the population becoming a mix of both types of learners. Rogers’ key observation was that, by the definition of an ESS, social learners’ fitness at this stage must equal that of individual learners. In other words, while lower costs gave social learners an initial fitness advantage that allowed them to invade, social learning does not necessarily increase the population’s mean fitness in the long run. These results contradict both the assumption that social learning improves population fitness by reducing costs and, more broadly, that adaptiveness is an inevitable consequence of evolution. This finding was considered so striking that it came to be known as Rogers’ paradox.

The key to understanding Rogers’ paradox is that, unlike individual learning, which results in a constant fitness payoff, social learning is frequency dependent. Specifically, social learners’ fitness inversely depends on the number of other social learners in the population; higher frequencies of social learners result in lower fitness. This is because social learners act as as “free-riders” or “information scroungers” who produce no new knowledge of their own, but rather parasitize the knowledge generated by individual learners (Kameda & Nakanishi, 2002). Artem has previously described this as the social learning dilemma, which can be readily understood by noticing that, if everyone is looking for a book recommendation but no one is reading the candidates, then it quickly becomes difficult to find out which books are worth your time. (In fact, the cost of finding out becomes equal to the time investment of reading them yourself—individual learning.)

Another way to interpret this result is to realize that individual learning is akin to cooperation (generating new information at some cost to yourself) while social learning is akin to defection (benefiting from that information, but generating none of your own and incurring no cost). We can conceptualize this as a pairwise interaction between Alice and Bob, where b is the benefit of having information and c is the cost of obtaining it. This makes the payoff matrix:

Where the payoff for Alice is b – c if Alice learns individually (regardless of what Bob does), b if Alice learns socially while Bob learns individually, and 0 if Alice and Bob both learn socially. This places the game on the border between the Hawk-Dove and Leadership games. From this formulation, we can see that the optimal scenario is to learn socially while your partner learns individually. The risk, of course, is that you both try this strategy, ending up with nothing. This means that, so long as b > 2c, individual learning is risk dominant over social learning, despite social learning offering a higher potential payoff.

So, does social learning actually fail to raise population fitness? On a visit to McGill, Jeff Galef jokingly told me that he had never paid much attention to Rogers’ paradox simply because it was obviously wrong. Of course, all models are wrong and that is precisely what makes them useful. Rogers himself stressed that his model was a heuristic one: Its goal was not to argue that social learning fails to raise mean fitness, but to show that factors not included in his model must make up the difference. Consequently, numerous attempts have been made to resolve Rogers’ paradox by extending his model to consider factors other than cost.

Rogers’ paradox has proven surprisingly robust against many of these extensions. For instance, using an island migration model, Boyd and Richerson (1995) showed that allowing the environment to vary spatially, rather than just temporally, fails to resolve the paradox. Rendell et al. (2010) also introduced spatial factors, but did so by constraining learning and dispersal to Moore neighborhoods. They found that Rogers’ paradox not only continues to apply, but is exaggerated: Mean fitness actually drops below the baseline of individual learning, because social learners reliably outcompete individual learners locally, suppressing their own most valuable source of data.

Another extension proposed by Rogers (1988) was to consider the role of favorable learning biases, such as a preference for copying high fitness individuals or making the type of learning used contingent on the quality of environmental information. These are examples of what Laland (2004) has termed social learning strategies (“who” and “when” strategies, respectively). Boyd and Richerson (1995) found that adopting a “who” strategy of preferentially copying individual learners fails to resolve the paradox, because it does not dodge the problem of social learning being frequency dependent. Similarly, Kameda and Nakanishi (2002) showed in both modeling and experimental work that Rogers’ paradox severely restricts the utility of conformity bias (a “who” strategy of preferentially copying the majority).

Nevertheless, “when” strategies can successfully resolve Rogers’ paradox. These are contextual rules that determine when to switch between individual and social learning, allowing individual learning to apply more selectively. For instance, if an individual learns on their own when individual learning is more accurate or less costly, but switches to social learning when this ceases to be the case, then the resulting population’s mean fitness can rise above baseline (Boyd & Richerson, 1995). Both modeling and experimental work has shown that this solution applies across a broad range of parameters (Kameda & Nakanishi, 2003).

Enquist et al. (2007) proposed two other “when” strategies that also resolve Rogers’ paradox: the “critical social learner” and “conditional social learner.” The critical social learner attempts to learn socially, but switches to individual learning when unsuccessful. Conversely, the conditional social learner attempts to learn individually, but switches to social learning when this fails. They showed that critical social learning generally raises mean fitness and is selected for by evolution, except when social learning is highly unfaithful, the environment is highly variable or social learning is much more costly than individual learning. They further showed that conditional social learning is only selected for if individual learning is quite cheap, whereas pure individual learning is only selected for if the environment is extremely unstable or transmission fidelity approaches zero. In short, the critical social learning strategy resolves Rogers’ paradox under the broadest range of conditions.

Rendell et al. (2010) further extended these results, showing that, because spatially varying environments make social learning less effective, they broadly favor conditional over critical social learning. In fact, if the cost of individual learning is sufficiently low, then this effect may be potent enough for conditional social learning to be selected for over critical social learning. In such cases, the conditional strategy may also be selected for over pure individual learning, but only if individual learning is unreliable, making social information a useful backup source of data. Finally, by considering environmental harshness (the degree to which incorrect solutions are punished), Rendell et al. showed that greater harshness generally favors the critical social learner, while lesser harshness favors the conditional one. This is broadly consistent with the view that austerity often drives reliance on others.

In addition to “when” strategies, another factor that can resolve Rogers’ paradox is what happens when knowledge can accumulate over time. Boyd and Richerson (1995) showed that, if incremental learning in small steps is less costly than learning in large steps, then Rogers’ paradox ceases to apply. That is to say, the mere presence of social learners who can accumulate knowledge over time can result in higher mean fitness, so long as small steps are less costly. Ehn and Laland (2012) considered this issue in greater detail, showing that if this assumption about costs does not hold, then Rogers’ paradox can be replicated in a cumulative setting. However, they also showed that both the critical and conditional social learning strategies can resolve the paradox in this context. Moreover, they showed that an “individual refiner” strategy that learns socially, but attempts to refine its solution through individual learning regardless of success or failure, can also raise population fitness.

To sum up, theoretical work has shown that the evolution of social learning is much more subtle question than a mere advantage in costs. On the contrary, Rogers’ paradox has proven surprisingly difficult to resolve. For instance, simply complicating a model with spatial factors not only fails to resolve the paradox, but can in fact exacerbate it. Furthermore, some social learning strategies (“who” strategies) that even Rogers anticipated might resolve the paradox generally fail to do so. Nevertheless, strategies that govern when an individual switches between social and individual learning (“when” strategies) have met with greater success. In particular, Enquist et al. (2007)’s critical and conditional social learning strategies have been well explored, and have even been found to raise mean fitness in settings where learning is cumulative (Ehn & Laland, 2012). Though Rogers’ initial model is sometimes seen as a mere curiosity, it has nevertheless contributed substantially to our understanding of social learning.

References

Aoki, K., Wakano, J. Y., & Feldman, M. W. (2005). The emergence of social learning in a temporally changing environment: A theoretical model. Current Anthropology, 46(2), 334-340.

Boyd, R., & Richerson, P. J. (1995). Why does culture increase human adaptability? Ethology and Sociobiology, 16(2), 125-143. doi: 10.1016/0162-3095(94)00073-g

Ehn, M., & Laland, K. N. (2012). Adaptive strategies for cumulative cultural learning. Journal of Theoretical Biology, 301, 103–11. doi:10.1016/j.jtbi.2012.02.004

Enquist, M., Eriksson, K., & Ghirlanda, S. (2007). Critical Social Learning: A Solution to Rogers’s Paradox of Nonadaptive Culture. American Anthropologist, 109(4), 727-734. doi: 10.1525/aa.2007.109.4.727

Kameda, T., & Nakanishi, D. (2002). Cost-benefit analysis of social/cultural learning in a nonstationary uncertain environment: an evolutionary simulation and an experiment with human subjects. Evolution and Human Behavior, 23, 373-393.

Kameda, T., & Nakanishi, D. (2003). Does social/cultural learning increase human adaptability?: Rogers’s question revisited. Evolution and Human Behavior, 24(4), 242-260. doi: 10.1016/s1090-5138(03)00015-1

Kendal, R. L., Coolen, I., van Bergen, Y., & Laland, K. N. (2005). Trade-offs in the adaptive use of social and asocial learning. Advances in the Study of Behavior, 35, 333-379. doi: 10.1016/S0065-3454(05)35008-X

Laland, K. N. (2004). Social learning strategies. Learn Behav, 32(1), 4-14.

Rendell, L., Fogarty, L., & Laland, K. N. (2010). Rogers’ paradox recast and resolved: population structure and the evolution of social learning strategies. Evolution, 64(2), 534–48. doi: 10.1111/j.1558-5646.2009.00817.x

Rogers, A. (1988). Does biology constrain culture? American Anthropologist, 90, 819-831 doi: 10.1525/aa.1988.90.4.02a00030