Abstract The risk of acquiring a chronic disease is influenced by a person’s genetics (G) and exposures received during life (the ‘exposome’, E) plus their interactions (G×E). Yet, investigators use genome-wide association studies (GWAS) to characterize G while relying on self-reported information to classify E. If E and G×E dominate disease risks, this imbalance obscures important causal factors. To estimate proportions of disease risk attributable to G (plus shared exposures), published data from Western European monozygotic (MZ) twins were used to estimate population attributable fractions (PAFs) for 28 chronic diseases. Genetic PAFs ranged from 3.4% for leukemia to 48.6% for asthma with a median value of 18.5%. Cancers had the lowest PAFs (median = 8.26%) while neurological (median = 26.1%) and lung (median = 33.6%) diseases had the highest PAFs. These PAFs were then linked with Western European mortality statistics to estimate deaths attributable to G for heart disease and nine cancer types. Of 1.53 million Western European deaths in 2000, 0.25 million (16.4%) could be attributed to genetics plus shared exposures. Given the modest influences of G-related factors on the risks of chronic diseases in MZ twins, the disparity in coverage of G and E in etiological research is problematic. To discover causes of disease, GWAS should be complemented with exposome-wide association studies (EWAS) that profile chemicals in biospecimens from incident disease cases and matched controls.

Citation: Rappaport SM (2016) Genetic Factors Are Not the Major Causes of Chronic Diseases. PLoS ONE 11(4): e0154387. https://doi.org/10.1371/journal.pone.0154387 Editor: Rodney John Scott, University of Newcastle, AUSTRALIA Received: February 26, 2016; Accepted: April 12, 2016; Published: April 22, 2016 Copyright: © 2016 Stephen M. Rappaport. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. Data Availability: Data were obtained from published sources and are fully tabulated in the manuscript. Funding: Partial funding was received from grants P42ES05948 and P42ES0470518 from the National Institute for Environmental Health Sciences. Competing interests: The author has declared that no competing interests exist.

Introduction As the world’s population ages, mortality increasingly reflects the ravages of complex chronic diseases, particularly cancer and heart disease [1]. A person’s risk of succumbing to a chronic disease is linked to his or her genetics (G) and exposome (E, representing all exposures during life) plus G×E interactions. Although geneticists and epidemiologists have debated the importance of G and E as causes of chronic diseases, it is clear that both factors affect disease risks [2, 3]. However, most etiologic research has focused on genetic causes and has relegated exposures to secondary roles. For example, when queried on Feb. 6, 2016, there were 566,685 PubMed citations for the keywords “disease causes AND genetics” compared to 71,922 citations for “disease causes AND exposure”, a ratio of about eight to one. This genome-centric view of causation is motivated by the technologic ability to detect and manipulate genes, and fosters the notion that genetic factors are necessary determinants of disease that operate in a causal background of diverse exposures [4, 5]. Certainly, technologies spawned by the human genome project led to stunningly comprehensive genome-wide association studies (GWAS) that investigated genomic variability across thousands of diseased and healthy subjects. Yet, because more than 2,000 GWAS rarely reported relative risks greater than 1.2 [6, 7], geneticists are turning to whole-genome sequencing in searches for ‘missing heritability’ [8, 9]. This motivation stems, at least in part, from calculations of heritability that do not differentiate disease variation arising from genetic factors and shared exposures [10]. In contrast to GWAS, the epidemiology of causal exposures still relies on self-reported and geographic information plus a few targeted measurements [11, 12], much as it did a century ago. Nonetheless, data from the World Health Organization (WHO) has attributed nearly half of global mortality to a handful of exposures (Table 1), mainly particulate air pollution (including indoor smoke and occupational exposure) (14% of all deaths), tobacco smoking and second-hand smoke (13%), high plasma levels of sodium (6%), and alcohol use (which is generally protective but can be harmful with high consumption) (5%) [13]. There is also strong epidemiologic evidence that genetically-stable populations experience profound alterations in cancer incidence across generations and with migration that logically reflect changing exposures [3, 14, 15]. Thus, the empirical evidence promotes the notion that exposures are necessary determinants of disease that operate in a causal background of genetic diversity. However, compared to GWAS, the universe of exposures that has been investigated for associations with chronic diseases essentially consists of airborne particulate matter plus a set of about 300 environmental chemicals and nutrients [16]. PPT PowerPoint slide

PowerPoint slide PNG larger image

larger image TIFF original image Download: Table 1. Global deaths attributed to exposure-risk factors for chronic diseases. https://doi.org/10.1371/journal.pone.0154387.t001 To investigate the global influence of genetic factors on chronic-disease risks, data from cohorts of monozygotic (MZ) twins in Western Europe were compiled to estimate population attributable fractions (PAFs) for 28 chronic diseases, including prominent cancers, cardiovascular diseases, neurologic diseases, lung diseases, and autoimmune diseases. Because pairs of MZ twins have essentially identical genomes and also share many exposures [10], especially in early life, these PAFs estimate proportions of cases that would theoretically be prevented if interventions were able to remove particular combinations of genotypes and shared exposures [17, 18]. To further evaluate the impacts of G and E on the risks of chronic diseases, PAFs from MZ twins were linked with mortality statistics from Western Europe to estimate the numbers of deaths attributable to genetic factors and shared exposures for ischemic heart disease and prominent cancers.

Materials and Methods Data for estimation of PAFs were obtained from publications of disease phenotypes in large MZ-twin cohorts, i.e. [19–35], some of which had been curated by Roberts et al. [36]. Virtually all of the data were from Western European twins, primarily Swedish, Danish, and Finnish. The 28 diseases included nine types of cancer, cardiovascular diseases (heart disease and stroke), neurological diseases (Parkinson’s disease, Alzheimer’s disease, dementia, and migraine), lung diseases (chronic obstructive pulmonary disease and asthma), obesity-associated diseases (Type-2 diabetes and gallstone disease), autoimmune diseases (rheumatoid arthritis, Type-1 diabetes, and thyroid autoimmunity), genitourinary diseases (general dystocia, stress urinary incontinence, and pelvic organ prolapse), and three other syndromes (chronic fatigue, irritable bowel syndrome, and gastroesophageal reflux). The following data were extracted from each study: gender, number of MZ twin pairs (N T ), number of concordant MZ pairs (N C ), and number of discordant MZ pairs (N D ). Each PAF (%) was estimated as P*(RR-1)/RR, where P = 2N C /(2N C +N D )*100 represents the proportion (%) of case twins with an affected co-twin and RR = [2N C /(2N C +N D )]/[N D /(N D +2N T -2(N C +N D )] is the associated relative risk. If statistics were reported for both male and female twin pairs, then PAFs were estimated for the combined datasets. Since all of the studies were reported around the year 2000 for twins from Western Europe, mortality statistics for Western Europeans in that year were obtained for ischemic heart disease and relevant cancers from the WHO Global Burden of Disease Database [37, 38]. To estimate deaths attributable to genetics and shared exposures, the number of deaths for each disease type were multiplied by the corresponding PAF from MZ twins.

Conclusions Because the human genome project planted the seeds for genome sequencing and large-scale omics technologies [5], it was inevitable that these methods would be used to search for causes of major diseases, and almost 2,000 GWAS have been reported [6]. Yet, the matrix of disease-associated genetic variants does not explain much heritability [7, 9]. Indeed, Yang et al. predicted that between 20 and 50 causal genetic variants would be required to explain half the burden of a common disease, depending on the frequency of each variant and risk ratio of the genotype [17]. The small genetic PAFs estimated here from studies of MZ twins (Table 2 and Fig 1) cast further doubt on the notion that our inherited genomes are the primary causes of chronic diseases. Nonetheless, the genome can influence disease outcomes through G×E interactions, and may also contribute through epistasis and heritable epigenetic effects that are as yet unknown. Thus, investigations of causes of chronic diseases should continue to consider genetic factors as part of a balanced strategy that characterizes both E and G with high resolution. One avenue for discovering E-related risks would be to extend the data-driven approach embodied in GWAS and conduct exposome-wide association studies (EWAS) [40] via untargeted analyses of chemicals in blood (the ‘blood exposome’) [16]. Since disease processes can alter the blood exposome through dysregulation of systems biology, it is important that EWAS be conducted with archived biospecimens collected prior to diagnosis from incident cases and matched controls in prospective cohort studies. This makes it possible to distinguish chemical signatures of potentially causal exposures from those generated by progression of the disease (reverse causality) [40, 41]. A good example of this data-driven approach for EWAS is given by Wang et al. [42] who found 18 chemical features (out of more than 2000 detected) that were associated with cardiovascular disease in samples totaling only 75 incident cases and 75 matched controls. Three of the features were identified as choline and its metabolites, betaine, and trimethylamine-N-oxide (TMAO), with TMAO exhibiting the strongest disease risk in follow-up studies [43, 44]. Since TMAO is a product of joint microbial and human metabolism of choline, the positive association between plasma TMAO and disease risk points to possible involvement of the gut microbiota in the etiology of cardiovascular disease. It is interesting that a study of colorectal cancer by Bae et al. [45] also found a positive association between plasma TMAO and disease risk, again suggesting involvement of the gut microbiota.

Author Contributions Conceived and designed the experiments: SMR. Performed the experiments: SMR. Analyzed the data: SMR. Contributed reagents/materials/analysis tools: SMR. Wrote the paper: SMR.