When I worked at Google, I once did an employee search on "Vivian" and 26 of the 30 results were Chinese. This post examines this phenomenon a bit more scientifically, with two goals:





Find the most common English first names for Chinese last names (P(Name n | Chinese)).



Find the English first names that are differentially expressed -- that is, which are much more popular among Chinese Americans than among the general American public (i.e., P(Chinese | Name n)).



The ideal approach would be to download a phone book and tally the first names for Chinese last names. While there's nowhere to download a phone book, there are several searchable people databases online. The largest and most famous free option is ZabaSearch.

The first step: get a list of Chinese last names to search for. I used the 100 most common last names in both the natural Pinyin and Wade-Giles variants, for a total of 128 unique last names. With a script, I searched for each on ZabaSearch. Sadly, Zaba won't show you the results if there are over 1000 -- it just says "1000's of CHIN's found!" If you search across the entire U.S., then this happens for too many of the names, so I limited my searches to Boston. The 128 last names culled 22,483 unique people (after also de-duping by address).

Among Chinese in Boston, the most common three first names are Wei (1.34%), Hong (0.916%) and Hui (0.836%). Only about 25% -- 5,949 of 22,483 -- of the first names are English.

Of the Chinese population with English first names, the most popular three male and female names are shown below. For the American public, these are downloaded from the latest census report. (Note: To standardize the population sizes, I limited both populations to those with first names in the top 500.)

Name Rank across America Rank among Chinese Frequency across America Frequency among Chinese DAVID 6 1 0.0286 0.0511 JOHN 2 2 0.0396 0.0378 JAMES 1 3 0.0402 0.0369

Name Rank across America Rank among Chinese Frequency across America Frequency among Chinese JENNIFER 6 1 0.01300 0.0311 AMY 32 2 0.00631 0.0303 ANGELA 29 3 0.00655 0.0178

The three most popular Chinese male first names are also very popular in America as a whole. A more interesting question is the one about P(Chinese | Name n) -- which English first names are much more common among Chinese Americans than among all Americans? To answer that, I conducted a binomial proportion test and sorted the results by p-value, identifying the most extreme differences. The top 10 male and female differences are given below.

Some of the top results are nicknames -- Chinese are much more likely to pick "Andy" or "Jenny" as a legal name, while general Americans are formally named by the longer versions.

The other names on the list are more interesting. For males, "Andrew," "Eric," "Peter" and "Albert" are much more common among Chinese than among Americans. For females, it's "Amy," "Grace," "May" and, yes, "Vivian." By comparing the frequencies, you can see that these names are all over five times more popular among Chinese Americans!

I'll leave interpretation to the sociologists.

Name Frequency across America Frequency among Chinese p-value ANDREW 0.006510 0.02810 2.2e-30 ANDY 0.000594 0.00937 2.0e-26 DAN 0.001220 0.01120 3.6e-23 PETER 0.004620 0.01990 5.3e-22 ALBERT 0.003810 0.01570 7.0e-17 ERIC 0.006590 0.02120 1.5e-16 ALAN 0.002470 0.01150 2.9e-14 SAM 0.001110 0.00786 3.7e-14 ALEX 0.001390 0.00846 1.4e-13 DAVID 0.028600 0.05110 1.7e-12