Abstract High activity levels of a transgene can be very useful, making a transgene easier to evaluate for safety and efficacy. High activity levels can also increase the economic benefit of the production of high value proteins in transgenic plants. The goal of this research is to determine if recurrent selection for activity of a transgene will result in higher activity, and if selection for activity of a transgene controlled by a native promoter will also increase protein levels of the native gene with the same promoter. To accomplish this goal we used transgenic maize containing a construct encoding green fluorescent protein controlled by the promoter for the maize endosperm-specific 27kDa gamma zein seed storage protein. We carried out recurrent selection for fluorescence intensity in two breeding populations. After three generations of selection, both selected populations were significantly more fluorescent and had significantly higher levels of 27kDa gamma zein than the unselected control populations. These higher levels of the 27kDa gamma zein occurred independently of the presence of the transgene. The results show that recurrent selection can be used to increase activity of a transgene and that selection for a transgene controlled by a native promoter can increase protein levels of the native gene with the same promoter via proxy selection. Moreover, the increase in native gene protein level is maintained in the absence of the transgene, demonstrating that proxy selection can be used to produce non-transgenic plants with desired changes in gene expression.

Citation: Bodnar AL, Schroder MN, Scott MP (2016) Recurrent Selection for Transgene Activity Levels in Maize Results in Proxy Selection for a Native Gene with the Same Promoter. PLoS ONE 11(2): e0148587. https://doi.org/10.1371/journal.pone.0148587 Editor: Jin-Song Zhang, Institute of Genetics and Developmental Biology, Chinese Academy of Sciences, CHINA Received: October 16, 2015; Accepted: January 19, 2016; Published: February 19, 2016 This is an open access article, free of all copyright, and may be freely reproduced, distributed, transmitted, modified, built upon, or otherwise used by anyone for any lawful purpose. The work is made available under the Creative Commons CC0 public domain dedication. Data Availability: All relevant data are within the paper and its Supporting Information files. Funding: This work was funded by the United States Department of Agriculture Agricultural Research Service (http://www.ars.usda.gov/main/main.htm) CRIS project number 3625-21000-055D, MPS and the United States Department of Agriculture Cooperative State Ressearch, Education and Extension Service (no URL available) special appropriation Biotechnology Test Production, IA: Recovery and Purification of Recombinant Proteins from Plants for Therapeutics and Industrial Enzymes, ALB, MNS. Competing interests: The authors have declared that no competing interests exist.

Introduction The Illinois Long Term Selection Experiment is a well-known example of how recurrent selection can result in dramatic changes in phenotype, in this case protein or oil content. Starting with a single population in 1896, recurrent selection for and against grain protein content over 100 generations resulted in 32 and 4% protein respectively, compared to 8–12% protein in the starting population [1]. Recurrent selection for and against grain oil content resulted in 20 and 1% oil respectively compared to 4–6% oil in the starting population [1]. While the Illinois Long Term Selection Experiment is impressive in its longevity and in the large amount of divergence seen in the high and low lines while retaining genetic diversity, recurrent selection for a trait can also result in significantly different lines in just a few generations. For example, Scott et al. [2] used recurrent selection to change methionine content in maize: after 4 generations of selection, there was 17.6% difference between the high and low methionine lines. Other examples of successful recurrent selection programs in maize include increased prolificacy (number of ears) in the Golden Glow population [3], long and short ear length [4], and pseudostarchy endosperm or extreme sugary endosperm in a sugary1 background [5]. It would be useful if the power of recurrent selection could be harnessed to increase transgenic protein production. High activity levels of a transgene can make a transgene easier to evaluate for safety and efficacy. High activity levels can also be important for production of transgenic protein, such as pharmaceutical or industrial proteins. Despite success with recurrent selection in improving non-transgenic traits, this method has only been reported as a method to increase transgene activity in one study. Hood et al. [6] used breeding to increase protein levels produced by a transgene with an embryo preferred promoter. They crossed transgenic lines to high-oil lines, followed by selection for high levels of the transgene-encoded protein. Here, we subject fluorescence of green fluorescent protein (GFP) controlled by the 27kDa gamma zein promoter to selection pressure with recurrent selection. GFP is a convenient marker that produces fluorescence that is directly proportional to the amount of protein [7] and can be screened visually or quantified with fluorometry in whole seeds, requiring no processing to induce fluorescence. In maize, seed storage proteins called zeins make up about 50% of total protein in the endosperm [8] and are not expressed elsewhere in the plant [9]. Due to their localized activity at high levels, zein promoters are useful for many biotechnology applications [10], including production of proteins for extraction and for biofortification. The 27kDa gamma zein promoter was previously shown to drive high activity of GFP in maize endosperm [11]. Recurrent selection is only possible if the trait of interest can be easily quantified. Traits that are expensive or difficult to measure are unlikely to be subjected to recurrent selection, even if an increase in those traits would be useful. Thus it is not practical to alter the activity of most genes by recurrent selection. We propose that it may be possible to apply selection pressure to an easily quantifiable transgene with the same promoter as a native gene of interest in order to increase activity of that native gene. This research investigates recurrent selection as a method to increase transgene activity and the effects of that selection on native genes. The primary hypothesis is that selection for high fluorescence of a 27kDa gamma zein GFP transgene fusion will result in higher activity of GFP in subsequent generations. The secondary hypothesis is that levels of the native 27kDa gamma zein will also increase due to selection pressure on the 27kDa gamma zein promoter, in a phenomenon we propose to call proxy selection.

Materials and Methods Transgenic seed development Maize seeds expressing GFP were developed and backcrossed to B73 for 3 generations by Shepherd [11]. Event P230-71-1 was selected because it expressed GFP well and appeared to be a single-copy integration event. The construct contained the Zea mays 27kDa gamma zein endosperm-specific promoter cloned from inbred Va26 (Genbank accession EF061093), the modified green fluorescent protein (GFP) gene sGFPs65T (Genbank accession ABB59985) [12], and the nos terminator sequence (modified from Genbank accession V00087). GFP and the nos terminator were from the pAct1IsGFP-1 plasmid [13]. Development of segregating populations and seed production The breeding plan is in Fig 1. All plants were grown at the Iowa State University Transgenic Farm in Ames, IA as follows: Year 1 in 2006, Year 2 in 2007, Year 3 in 2009, and Year 4 in 2010. There was no planting in 2008 due to field flooding. In year 1, the transgene was bred into two broad-based synthetic breeding populations: BS11 (derived from the Pioneer two-ear composite [14]) and BS31 (derived from FS8B [15]). This was done by crossing the homozygous transgenic inbred line with about 50 individuals of each breeding population. We tested seeds from the resulting full-sib ears for fluorescence as described below. These breeding populations provided the genetic variability needed for selection. Using two different breeding populations allowed us to determine if the two populations reacted similarly to selection or if any observed effects were specific to a single population. We harvested approximately 50 ears from each population each year and evaluated those ears for GFP fluorescence. To avoid selecting for homozygosity at the transgene locus in the selected populations, we used only ears that were visually segregating for visible GFP fluorescence to advance all populations. We compared selected populations to control populations that were random mated without selection. PPT PowerPoint slide

PowerPoint slide PNG larger image

larger image TIFF original image Download: Fig 1. Breeding strategy used to develop selected and unselected populations over three generations. For the selected populations, the most fluorescent ears were the parents for the next generation. For the unselected populations, random ears were the parents for the next generation. The abbreviation fl. is for fluorescence. https://doi.org/10.1371/journal.pone.0148587.g001 In year 1, 50 randomly chosen seeds from each of the five ears with the highest mean fluorescence level were bulked and planted and the resulting plants intermated to create two selected populations (S1), one derived from BS11 and the second derived from BS31. Methods for fluorescence measurement are described below. In addition, we bulked and planted 50 randomly chosen seeds from each of five randomly chosen ears and intermated the resulting plants to create two unselected (control) populations (U1), one derived from BS11 and the second derived from BS31. In year 3, we tested ears from each S1 population for fluorescence, and selected ears were intermated as in year 1 to create S2. The unselected populations were advanced as described for year 2 to create U2. In year 4, the selected and unselected populations were advanced as in year 3 to create S3 and U3. Evaluation of selected populations Twelve populations (two starting populations x selected or unselected x three generations) were evaluated in single plots in one experiment in 2010. Selected and unselected populations were randomly assigned positions in adjacent plots. Plots consisted of four rows of 50 seeds each. Plants within each population were intermated by hand using chain sib pollinations to avoid pollen flow from neighboring populations. We harvested a total of 448 ears, and randomly chose ears from each harvested population for the experiments. For determination of fluorescence, 30 ears from each population were evaluated as described below, for a total of 14,400 measurements. GFP screening A Dark Reader hand lamp (Clare Chemical, Dolores, CO) was used to visually screen seeds for GFP fluorescence. Quantification of fluorescence was conducted by measurement with a spectrofluorometer (Tecan, Mannedorf/Zurich, Switzerland) at 16 points within each well of a 6-well Costar plate (Corning, Lowell, MA), at 485nm excitation and 535nm emission wavelengths. The instrument gain was set to optimize differentiation of samples in the experiment. One well of each plate contained as standard consisting of the same set of kernels to control for instrument drift during the course of measurements. When the standard values were included as a covariate in the analysis, the effect of the standard was not significant so the standard values were not used in the analyses presented here. Wells were filled with random visually positive seeds. Each plate was shaken and measured 5 times for a total of 80 individual fluorescence measurements per sample to ensure a representative measurement of the sample. Throughout the experiment, only ears that were visually determined to be segregating for GFP activity, as shown in Fig 2, were eligible for analysis. PPT PowerPoint slide

PowerPoint slide PNG larger image

larger image TIFF original image Download: Fig 2. Maize segregating for various levels of GFP activity. The top panel is in white light and the bottom panel in blue light (485nm) with an orange filter (535nm). https://doi.org/10.1371/journal.pone.0148587.g002 Quantification of seed storage proteins To determine whether selection for GFP with the 27kDa gamma zein promoter resulted in changes in levels of the native 27kDa gamma zein, we used HPLC to quantify alcohol soluble seed storage proteins. Two samples, consisting of either random visually GFP positive or random visually GFP negative seeds, were taken from 20 random ears from each of the 4 populations in generation 3: BS11 S3, BS11 U3, BS31 S3, and BS31 U3. Each sample was ground to fine flour and alcohol-soluble proteins were extracted with an alcohol-based buffer, as described by Flint-Garcia [16]. The proteins were separated with HPLC on a C18 protein and peptide column in a Waters 2695 Separation Module with a gradient of water and acetonitrile, both with 0.01% trifluoroacetic acid. The flow rate was 0.5ml/min and the concentration of water changed as follows: 50% at 0min, 46% at 33min, 40% at 35min, to 20% at 37min which was held for 15min. Absorbance was measured at 200nm with a Waters 2487 Dual Absorbance Detector. Individual peak areas for each sample were integrated using Empower software (Waters, Milford, MA) with a minimum peak width of 30 and threshold of 800. We grouped peaks by retention time and identified the 27kDa gamma zein and the alpha zein region by comparison to known HPLC profiles [17, 18]. A standard sample was run with each analysis batch to control for variation among batches but no significant variation in the standard values was observed so the data from the standard was not used in the analysis presented here. Additional phenotypic evaluation To determine the extent of change of unselected traits in the course of the experiment, we evaluated three phenotypic traits unrelated to the transgene: germination rate, seed mass, and percent nitrogen. Germination rates for each of the 12 populations planted in year 4 were determined by counting the number of plants in each row, not including tillers. Seed mass of each of the 12 populations planted in year 4 was determined by weighing two samples of 50 randomly selected seeds from each ear that was segregating for GFP. Percent nitrogen of 0.5g of two flour samples, consisting of either visually GFP positive or visually GFP negative seeds, from 10 randomly selected ears from S3 and U3 of both breeding populations (40 samples total) was determined by combustion analysis by the Iowa State University Soil and Plant Analysis Laboratory. For all phenotypes, means for each treatment were used for statistical analysis described below. Selection was carried out based on ears rather than seeds, with random mixtures of GFP positive and GFP negative seeds from segregating ears used to advance both selected and unselected populations each year. Each population was expected to have approximately 50% of all ears segregating for GFP, 25% with all positive seeds, and 25% with all negative seeds. To determine whether selection was affecting zygosity of the population, we determined the percentage of total harvested ears per population that were segregating for GFP. Ears with all GFP-negative seeds cannot be visually distinguished from ears with uniform low activity, so only segregating ears were counted for the purpose of determining the percentage of segregating ears in each population. Statistical analysis For each of the three generations, there were two pairs of selected and unselected populations, in two breeding populations. To determine if selection was effective, we used least squares fitting of the data to a linear model. The model used for fluorescence, germination rate, and seed mass is as follows: Where Y is the observed value of the treatment and: μ = the overall mean of the observed values Gen = the effect of generations of selection (1, 2 or 3) SorU = the effect of selection for GFP levels versus the unselected control Pop = the effect of breeding population, BS11 or BS31 All effects were considered fixed effects, limiting the inference space to the observations made in this study. The significance of the SorU term was the test used to determine if selection was effective. For zein peak area and total protein, we examined the variation within the most advanced generations of selection (S3 and U3). ANOVA was carried out with JMP [19], using the following fixed effects model: Where Y is the observed value of the treatment, and: μ = the overall mean of the observed values GFP = visually positive or negative for GFP fluorescence SorU = the effect of selected or unselected Pop = the effect of breeding population, BS11 or BS31 A chi square test was used to determine whether the percentage of total harvested ears per population that were segregating for GFP varied significantly from the expected 50%.

Discussion The objective of this research was to determine if transgene activity could be changed by recurrent selection and we found that it was. These findings provide a new way to increase levels of transgene activity. We also determined the effects of selection for transgene activity on an endogenous gene with the same promoter as the transgene, finding that the protein produced by the endogenous gene was increased. Fluorescence of GFP controlled by the 27kDa gamma zein promoter was significantly increased with three generations of selection in two breeding populations. Even though the native 27kDa gamma zein was not the target of selection, its protein level was significantly increased in the selected populations of both breeding populations. The magnitudes of the increase in fluorescence and 27kDa gamma zein levels in generation 3 are similar. For fluorescence, the selected populations were 17.28 and 48.58% higher than the unselected populations for BS11 and BS31 respectively. For 27kDa gamma zein levels, the selected populations were 14.35 and 31.40% higher than the unselected populations for BS11 and BS31 respectively. We hypothesize that selection acted on regulatory sequences common to the transgene and the native 27kDa gamma zein gene. Since the common element between the transgene and the native gene is the promoter, it seems likely that selection had an impact on transcription, possibly through altered activity of one or more transcription factors. Additional studies, such as RNASeq for known 27kDa gamma zein transcription factors, are needed to determine if this is the case. It is important to note that the 3’ untranslated region of the transgene was not derived from a zein gene. Apparently, sufficient regulatory information resides in the 5’ untranslated region to allow selection for the transgene to impact levels of the zein. It would be interesting to repeat the experiment using the native 3’ untranslated region of the gene as well. In addition to significant increases in 27kDa gamma zein levels, multiple other zeins also had significant increases in the selected populations. These increases may be caused by transcription factors that are shared between the 27kDa gamma zein gene and those zein genes, such as PBF-1, which was shown by Wang et al. to bind to the 27, 22, and 19kDa zein promoters [20]. The 27kDa gamma zein plays a role in protein body formation, and stabilizes other zeins [21], so increased level of the 27kDa gamma zein in the selected populations may be contributing to higher stability of other zeins. This effect has been seen in quality protein maize, where higher level of 27kDa gamma zein is associated with seed vitreousness [22]. Alternatively, the significant differences in zeins could be due to genetic drift or to genotypic differences. For example, gamma zein level is highly variable across genotypes [16]. However, the lack of significant differences between selected and unselected populations for germination rate and seed mass indicates that genetic drift is not occurring for these traits, or that it is occurring at the same rate and in the same direction in the selected and unselected populations. Notably, there were no significant differences in 27kDa gamma zein level between GFP positive and negative seeds on ears segregating for the transgene. GFP negative seeds in the selected populations had elevated levels of 27kDa gamma zein that were just as high as levels in GFP positive seeds, indicating that genetic changes resulting from selection are not dependent on the presence of the transgene. This change in protein level of one gene through selection of another gene could be thought of as selection by proxy, or proxy selection. Proxy selection could be a way to use a reporter transgene as a breeding tool to alter the expression of a native gene that shares regulatory elements with the reporter transgene. The transgene can be segregated out after selection, leaving no transgene in the final product. In this study, the level of the native 27kDa gamma zein gene was increased by recurrent selection for activity of a GFP transgene with the 27kDa gamma zein promoter. Proxy selection has the potential to be a useful tool to alter expression of native genes whose products are difficult to quantify.

Acknowledgments We would like to express great appreciation to Adrienne Moran-Lauter for providing technical assistance. The BS11 and BS31 populations were developed by Dr. Kendall Lamkey of Iowa State University, who provided them for this study. Product names are necessary to report factually on the available data; however, the USDA neither guarantees nor warrants the standard of the product, and the use of the name by the USDA implies no approval of the product to the exclusion of others that may be suitable. USDA is an equal opportunity provider and employer.

Author Contributions Conceived and designed the experiments: MPS ALB. Performed the experiments: ALB MNS. Analyzed the data: ALB MPS. Contributed reagents/materials/analysis tools: MPS ALB MNS. Wrote the paper: ALB MPS MNS.