Theoretical studies suggest that mating and pair formation is not likely to be random. Computer simulations suggested that sex among genetically complex organisms requires mate choice strategies for its evolutionary maintenance, to reduce excessive genetic variance produced by out-crossing. One strategy achieving this aim efficiently in computer simulations is assortative mating modeled as “self seeking like”. Another one is selection of “good genes”. Assortative mating increases the probability of finding a genetically similar mate, without fomenting inbreeding, achieving assortative mating without hindering the working of other mate selection strategies which aim to maximize the search for “good genes”, optimizing the working of sex in evolutionary terms. Here we present indirect evidence that in a significant proportion of human reproductive couples, the partners show much higher facial resemblances than can be expected by random pair formation, or as the outcome of “matching for attractiveness” or the outcome of competition for the most attractive partner accessible, as had been previously assumed. The data presented is compatible with the hypothesis derived from computer simulations, that human mate selection strategies achieve various aims: “self seeking like” (including matching for attractiveness) and mating with the best available genes.

Methods We photographed 36 randomly selected couples, from a list of addresses provided by a local doctor in the city of Mérida, Venezuela, which either had children and/or were living for at least 3 years together, and which reported to have no known family relationship between them. A digital black and white photograph of each of the partners was taken in or around their homes (Figure 1). The subjects of these 36 couples were not used for any of the behavioral tests performed and we will call them the target subjects. Download Open in new tab Download in PowerPoint To assess the existence of resemblance between the faces of couples among the target subjects, the photographs of the males were placed on a table and those of the females were randomly shuffled. The test subjects (over 100 volunteers at the universities in Caracas and Mérida) had to assign each of the photographs of female target subjects to one of the males. Test subjects did not know any of the target subjects photographed. The test was performed double blind, as neither the experimenter nor the test subject knew who the correspondence of the photos to the real couples. The amount of correct guesses, i.e., joining photographs of male and female partners of the same couple, did not differ between test subjects asked to “Choose the female to which the male is most likely to be married” and test subjects asked to “Choose the female that is most likely to be a sibling to the male” (p > 0.55, Chi squared test, df = 19. The questions given in quotations are rough translations from Spanish). Thus, for further tests, we asked the test subjects to “Join the photographs of the putative couples with the closest resemblance and or the more likely to be married”. In order to simplify the tests and reduce the rejection of test subjects to participate, we built 6 pools of 6 couples (i.e. 12 target subjects each), so that in each pool, the target males had the same age in a range of ± 2 years if less than 35 years old and ± 5 years if 35 years of age or more. Test subjects were then presented with the 6 photos of the faces (or parts of faces) of the target males placed on a table, and they had to assign the randomly shuffled photos of the 6 target females to their partners. To test for the effect of cues in the background of the photo in identifying correctly couples from photographs of target subjects, we prepared 3 experiments. We manipulated the digital photos so as to provide: 1- the same background for each pair in a couple but which differed between couples; 2- a different background for each of the partners in a couple; 3- no background for all couples which made all photos to have the same background (as in Figure 1). As an additional test to reject a possible effect of background cues in the photographs, we also presented photos of only the mouth, eyes or nose of the subjects (see Figure 1). The test subjects were then asked to pair the photos of the facial features of target subjects according to their similarity. As face recognition abilities are dependent on early visual experience (Le Grand et al. 2001) and are better between individuals of the same race (Golby et al. 2001), we correlated the number of correct guesses made by test subjects with the skin colour difference between the test subject and the average of the faces of the correctly guessed couples. The colour was assessed by the experimenter for both test and target subjects in 3 distinct categories (dark, slightly dark, white), which were assigned values of 1 to 3 in order to perform statistical correlations on the results. Not all test subjects were used for this test as only skin colour data assessed by one of the experimenters was used In order to estimate the effect of attractiveness of faces as a possible confounding factor, we assessed the attractiveness of a face, through test subjects from the opposite sex. The attractiveness was registered from less attractive to very attractive in a scale from 1 to 4. Twenty one subjects of each sex were presented with the photographs of 35 members of couples of the opposite sex of the test subject for this assessment. That is, test males were given the photos of target females and test females those of target males to assess the attractiveness. Data was ordered based on increased attractiveness of either males or females. The statistical analyses performed on the data were applied on the scores of test subjects. That is on the number of correct couples guessed by the test subjects. The analyses were: Pearson correlation coefficient to assess correlations between age and scoring, and between attractiveness and scoring. Chi square test to compare the total number of scores obtained for a given experimental setting with those expected for random guessing. The tests involved that each test subject had to match all photos for all coupes. Random guessing under this scenario for either 36 pairs or 6 pairs gives in average one correct guess per test subject. Deviation from one was assumed to be non-random guessing and its significance was assessed by the Chi-squared test. The degrees of freedom were calculated as the number of test subjects used for that given experimental setting, minus one. Another more sensitive way to look at the results was to assess the number of times a given couple was correctly identified as such by test subjects. This distribution of guesses (see Figure 2) was then compared with an expected distribution obtained by random guessing. The outcome of random pair formation plus random guessing was estimated using a simple Monte Carlo simulation model written in basic. Download Open in new tab Download in PowerPoint

Results The number of correct guesses, i.e. guessed pairs of photographs corresponding to actual couples, made by tests subjects was far larger than expected by random guessing in most experiments. When females were provided with the photos of the target faces of 36 couples, they guessed correctly an average of 2.5 couples (Significantly different from random, n = 25 test females, p < 0.0001, Chi-square = 132). Male test subjects placed in front of the same task managed to identify correctly only an average of 0.94 couples (Not different from random, n = 18 test males, p=0.6, Chi-square = 15). The amount of correct guesses made by female test subjects was significantly higher than those made by male test subjects (p < 0.003, chi-square = 38). When the test was simplified, so that only the photos of faces of 6 couples were presented at the same time, this difference between the number of correctly guessed couples achieved by female and male test subjects disappeared (p = 0.11, Chi-square = 21). The average number of correctly guessed couples was 1.71 and 1.91 for female and male test subjects respectively (Significantly different from random in both cases, n = 35 and 21, p < 0.0001 in both cases). No significant correlation (α = 0.1) between correct guesses and age of the test subjects was found (neither for females r = 0.009, mean age = 27, range 19–52, nor for males r = −0.2, mean age = 25, range = 18–70). We correlated the number of correct guesses made by test subjects with the skin colour difference (dark, slightly dark, white) between the test subject and the average of the faces of the correctly guessed couples, and obtained a Pearson's correlation coefficient r = 0.5 (p < 0.01, n = 65). Thus, if test subjects were given same race faces to evaluate, the number of correct guesses increased. When examining the distribution on the number of correct guesses received by each target couple we found that many test subjects never guessed correctly any couple (Table 1). That is, 39% of males and 16% of females never scored a single correct guess, and 28% of males and 64% of females scored above one (random). Of the 36 target couples, 64% were guessed more than once (chance) and 44% more than two times (Fourth column of Table 4). We compared the number of times each couple was guessed correctly with that predicted by a Monte Carlo simulation based on random pair formation and random guessing. These comparisons, for all the experiments performed, are given in Table 2, where we compare the distribution of correct guesses with that produced by Monte Carlo simulations, using a chi-squared test. The table reads as follows. In the first line, for example, we present the results of tests using photographs of faces having all the same background. The number of photos provided to the test subjects was 72 (36 photos of females and 36 of males, belonging to 36 couples). The test subjects for this experiment were both male and females and 43 test subjects were tested. The chi-square value obtained when comparing the actual data with that from a Monte Carlo simulation with totally random guessing, as explained in Figure 2, was 352. This value indicates that the actual data differs from totally random guessing significantly, as the odds of obtaining the actual data by chance are <<0.0001 for a distribution function of guesses with 13 degrees of freedom (see Figure 2). Table 1: Scores, assessed as number of correctly guesses target couples, produced by test subjects when confronted with photographs of faces from partners from 36 couples. View larger version Table 2: Comparison between a distribution of guesses achieved randomly (Monte Carlo simulations) and the distribution of the number of correct assignments made by test subjects, guessing the partners of couples, based on photographs of faces or parts of faces. View larger version Table 3: Mean attractiveness of the partner of opposite sex, in couples in a given quartile if ordered by attractiveness of: View larger version Table 4: Scores obtained by each couple (number of correct guesses), the average attractiveness index obtained by each individual, and the difference in attractiveness between the partners of the couple (absolute difference) as assessed by neutral judges. View larger version The results presented in Table 2 show that in all cases studied test subjects were able to correctly identify a significant proportion of couples with an accuracy that far exceeded that expected by random guessing or by random pair formation. With this more sensitive test, even male test subject which were much less accurate compared to females in guessing the correct pairs for the couples if the sample of photographs to choose from was large, showed a score that statistically differed from random guessing. The results in Table 2 also show that the background of the photographs did not affect significantly the correct guessing of pairs. Independently of how the background of the photos was presented, test subjects could pick out the real couples based on facial similarities. Test subjects were even able to correctly guess the real couples if photos of only parts of the faces of target subjects (nose, eyes or mouth) were presented. This recognition was less accurate than if the complete face was presented, but the large amount of tests performed provide for a high statistical significance of this effect. When ordering couples based on the attractiveness of the faces of one of the sexes, the average value of attractiveness of the opposite sex correlated strongly with the attractiveness of the sex used to order the couples. That is, more attractive females tended to pair with more attractive males, or vice-versa. This tendency can be visualized in Table 3. This table shows that the average attractiveness, as assessed by 20 test subjects of the opposite sex, was lowest for female mates of the most unattractive males and highest for the female mates of the most attractive males, when the couples were ordered according to the attractiveness of the male partner. When the data was order according to the attractiveness of the female partner in the couple, an even stronger result was obtained. The average attractiveness of male mates of the least attractive females was much lower than that of male mates of the most attractive females.

Discussion Our results confirm that human couples resemble each other significantly more than expected for random pair formation, and that this resemblance can be detected by neutral judges (test subjects). In the sample used here, a significant proportion of the couples studied showed to have conspicuous facial similarities, detectable by judges. This result confirms that our test was sufficiently sensitive and the sample examined was adequate, regarding the number of couples with conspicuous facial similarities between them, for the assessment of assortative mating. We wanted to know if this assortment was the product of “self seeking like” or the outcome of competition for the most attractive partner available. Assortment based on attractiveness was certainly at work in our sample as shown with our experiments assessing the attractiveness of the partners of the couples, presented in Table 3. That is, there was a general tendency for more attractive males to pair with more attractive females and vice versa. Yet, attractiveness by itself could not explain our results for various reasons: 1- Test subjects correctly guessed the real couples if photos of only parts of the faces of target subjects (nose, eyes or mouth) were presented. Even though the criteria for attractiveness are not applicable or are much more difficult to apply on only parts of faces, these parts revealed sufficient information on similarities for test subjects to re-assemble photographs of real life couples. 2- The lack of correlation between the scores of correct guesses and the difference in attractiveness between partners in the couples (Table 4) is not compatible with the competition hypothesis or the matching hypothesis. If assortative mating was the outcome of competition for attractiveness, “similarity” and matching for attractiveness should correlate positively. In addition, if matching attractiveness is at work, the assessment of similarity by test subjects should also be based on levels of attractiveness. This was certainly not the case in our study as the scores for similarity and the differences in levels of attractiveness between the partners of a couple diverged conspicuously. Thus, the competition hypothesis by itself can not explain our results. 3- Criteria for attractiveness vary between races and thus, an absolute scale of attractiveness for humans is not likely to exist. This fact has been known for quite a wile as discussed in the introduction. Our data confirmed that the ability to recognize similarities in faces was related to race, making it unlikely that assortative mating can be explained solely on the basis of matching universal attractiveness criteria or as the outcome of competition for the most attractive mate available. The cumulative evidence presented here favors the hypothesis that humans search for couples based on “self seeks like”, using a narcissistic psychological algorithm in assessing the appropriate mate. Yet passive assortative mating could also explain our results. Passive assortative mating occurs when, due to population viscosity, reproduction occurs among spatially proximate individuals that are probably close or distant relatives. In the active sort, individuals choose their mates based on similar phenotypic traits, which reflect similar genes. In both cases the result is assortative mating or breeding among mates that possess similar genes. Our results showed that females are better than males in assessing facial resemblance between individuals when a large number of choices are presented to test subjects. This phenomenon is congruent with the fact that females have a much finer discriminatory ability than males (Briceño and Jaffe 1997). The results of this study are compatible with the notion that humans develop a sense of beauty through imprinting-like mechanisms. This sense of beauty must have a strong narcissistic component, as it is formed through the images of the parents, as discussed in the introduction. When this sense of beauty is applied to mate selection, the outcome is assortative mating in a multidimensional scale, as no universal scale of beauty can be formed through this mechanism. Our results can not discard that assortative mating in humans is at least partially achieved through competition for the most attractive potential partner, or by matching attractiveness. Yet, our results strongly suggest that a multitude of other visual criteria are involved in mate selection. The hypothesis of “self seeking like” explains the experimental results rather well. The fact that these narcissistic criteria seem to be applied also in situation were no pairs for reproductive purposes are involved, such as in the choice of partners for business purposes (DeBruine 2002), strongly support the narcissist hypothesis. A testable prediction to possibly falsify the ‘self seeking like” hypothesis is that narcissistic criteria should be applied to many other situations in human every day life involving aesthetic or affective assessments. For example the choice of pets should also follow narcissistic criteria. This has been shown to be the case after obtaining our results (Roy and Christenfeld 2004, Payne and Jaffe 2004). In the present study we seem to have dealt just with one envelop of a rather complex set of mate selection mechanisms. Thus, further research should refine the mechanisms at work that seem to balance assortative mating with inbreeding avoidance and selection for ‘good genes’ (Jaffe 2002). The results might direct future research in focusing on the still unknown intricacies of mate selection mechanisms and the origin of love.

Acknowledgements We thank Omar Arenas for advice in the statistical treatment of the data, Peter Todd, John Hutchinson and Diana Ajami for helpful comments, Oswaldo Henriquez and Rodolfo Jaffe for help in recruiting test subjects. L. A. received financial support from CONICIT.