Pooled whole-genome sequencing of interspecific chestnut (Castanea) hybrids reveals loci associated with differences in caching behavior of fox squirrels (Sciurus niger L.).
Journal: 2018/December - Ecology and Evolution
ISSN: 2045-7758
Abstract:
Dispersal of seeds by scatter-hoarding rodents is common among tropical and temperate tree species, including chestnuts in the genus Castanea. Backcrossed (BC) interspecific hybrid chestnuts exhibit wide variation in seed traits: as the parent species (Castanea dentata and C. mollissima) have distinct seed phenotypes and tend to be handled differently by seed dispersers, phenotypic variation in BC trees is likely due to inheritance of genes that have undergone divergent evolution in the parent species. To identify candidate genomic regions for interspecific differences in seed dispersal, we used tagged seeds to measure average dispersal distance for seeds of third-generation BC chestnuts and sequenced pooled whole genomes of mother trees with contrasting seed dispersal: high caching rate/long distance; low caching rate/short distance; no caching. Candidate regions affecting seed dispersal were identified as loci with more C. mollissima alleles in the high caching rate/ long-distance pool than expected by chance and observed in the other two pools. Functional annotations of candidate regions included predicted lipid metabolism, dormancy regulation, seed development, and carbohydrate metabolism genes. The results support the hypothesis that perception of seed dormancy is a predominant factor in squirrel caching decisions, and also indicate profitable directions for future work on the evolutionary genomics of trees and coevolved seed dispersers.
Relations:
Content
Diseases
(1)
Organisms
(2)
Processes
(4)
Anatomy
(1)
Similar articles
Articles by the same authors
Discussion board
Ecology and Evolution. Oct/31/2018; 8(22): 10638-10654
Published online Oct/22/2018

Pooled whole‐genome sequencing of interspecific chestnut (Castanea) hybrids reveals loci associated with differences in caching behavior of fox squirrels (Sciurus niger L.)

Abstract

Dispersal of seeds by scatter‐hoarding rodents is common among tropical and temperate tree species, including chestnuts in the genus Castanea. Backcrossed (BC) interspecific hybrid chestnuts exhibit wide variation in seed traits: as the parent species (Castanea dentata and C. mollissima) have distinct seed phenotypes and tend to be handled differently by seed dispersers, phenotypic variation inBCtrees is likely due to inheritance of genes that have undergone divergent evolution in the parent species. To identify candidate genomic regions for interspecific differences in seed dispersal, we used tagged seeds to measure average dispersal distance for seeds of third‐generationBCchestnuts and sequenced pooled whole genomes of mother trees with contrasting seed dispersal: high caching rate/long distance; low caching rate/short distance; no caching. Candidate regions affecting seed dispersal were identified as loci with more C. mollissima alleles in the high caching rate/ long‐distance pool than expected by chance and observed in the other two pools. Functional annotations of candidate regions included predicted lipid metabolism, dormancy regulation, seed development, and carbohydrate metabolism genes. The results support the hypothesis that perception of seed dormancy is a predominant factor in squirrel caching decisions, and also indicate profitable directions for future work on the evolutionary genomics of trees and coevolved seed dispersers.

1INTRODUCTION

Successful dispersal of seeds is pivotal for a tree's evolutionary fitness because effective dispersal allows parent trees to propagate within an immediate area, may decrease density‐dependent mortality near parent trees, enables a tree's offspring to colonize distant suitable habitats (Howe & Smallwood, 1982; Janzen, 1971), expanding the species’ range, and reduces extinction risk (Larson‐Johnson, 2015; Schupp, Jordano, & Gomez, 2010; Vander Wall, 2001). Among angiosperm trees, diverse strategies for seed dispersal have evolved (Friis, Crane, & Pederson, 2011), including wind dispersal (e.g., Acer, Platanus, Populus), dispersal by frugivores (e.g., Prunus, Diospyros), and dispersal by granivores that cache seeds in the ground. Much of the forest canopy in eastern and central North America is, or was historically, dominated by granivore‐dispersed genera (e.g., Quercus, Juglans, Castanea) in the order Fagales (Leopold, McComb, & Muller, 1998). Across many tree taxa, including Fagales (Larson‐Johnson, 2015), dispersal by seed‐caching animals is actually associated with greater average dispersal distance than wind dispersal (Thomson, Moles, Auld, & Kingsford, 2011).

The coevolved relationship between trees and scatter‐hoarding seed dispersal agents is a “conditional mutualism” (Theimer, 2005): rodents and birds consume large numbers of seeds, but cache enough in the soil—sometimes at considerable distances from the mother tree—to allow a few to escape predation and germinate. This coevolution has resulted in a wide variety of seed‐packaging strategies employed by granivore‐dispersed trees—from large, hard‐shelled seeds only edible to rodents with specialized dentition (Stapanian & Smith, 1978; Tamura & Hayashi, 2008) to small nuts that can be carried and eaten by birds (Johnson & Webb, 1989; Richardson, Licthi, & Swihart, 2013). The vast majority of seeds cached by scatter‐hoarding rodents are consumed at some point (Brodin, 2010; Calhane, 1942; Tamura, Hashimoto, & Hayashi, 1999; Thompson & Thompson, 1980), so there must be a fitness advantage for cached versus noncached seeds in order for trees and squirrels to mutually benefit from scatter‐hoarding (Zwolak & Crone, 2012). Caching in the soil hides the seed from other potential predators at the surface and can reduce desiccation (Vander Wall, Kuhn, & Beck, 2005; Schupp et al., 2010; Zwolak and Crone 2012). In relatively thin‐shelled nuts such as oak and chestnut, death through desiccation is a danger for exposed seeds (Connor, Donahoo, & Schafer, 2006), so these species are likely to derive a net benefit from their relationship with scatter‐hoarding rodents.

American chestnut (Castanea dentata, abbreviated Cd) was formerly a keystone species in the forests of the Appalachian Mountains of the eastern United States and adjacent regions before its elimination by chestnut blight (Cryphonectria parasitica). It is currently the subject of a breeding program (led by the American Chestnut Foundation, abbreviated TACF) using several generations of backcrossing to introgress blight‐resistance genes from Chinese chestnut into the American chestnut genome. Dispersal by squirrels will be important for chestnut restoration, which is the goal of the TACF breeding program. Chestnut is likely to be planted largely in reclaimed minelands, old fields, and other open sites (Jacobs, 2007). The success of restoration based on hybrids will depend on whether hybrids are dispersed and regenerate naturally (Jacobs, Dalgleish, & Nelson, 2012). Scatter‐hoarding in the eastern North American deciduous forest is primarily practiced by two species of tree squirrels, the eastern gray squirrel (Sciurus carolinensis) and the fox squirrel (Sciurus niger), as well as the blue jay (Cyanocitta cristata) (Blythe, Lichti, Smyser, & Swihart, 2015; Moore, McEuen, Swihart, Contreras, & Steele, 2007; Richardson et al., 2013; Smith and Stapanian 2002; Steele, Smallwood, Spunar, & Nelsen, 2001). Although third‐generation backcross (BC3) chestnuts used for species restoration will inherit only a small fraction of the Cm genome, this genetic material may influence ecologically important traits (Worthen, Woeste, & Michler, 2010); for example, BC3 seeds were dispersed farther on average than seeds of Cd (Blythe et al., 2015). Hybrid background may affect this crucial, coevolved ecological relationship in other ways as well.

The large number of backcrossed hybrid chestnuts generated by the blight‐resistance breeding program of TACF and the publication of a Chinese chestnut draft genome (Carlson et al., 2014; Pereira‐Lorenzo et al., 2016) make backcrossed trees an interesting model for studying the genomic basis of interspecific differences in seed dispersal. Tree squirrels, including Callosciurus erythraeus and Sciurotamias davidianus, are important dispersal agents of Cm in its native range (Xiao, Gao, & Zhang, 2013), but Cm has evolved alongside a number of nut‐bearing trees (e.g., Lithocarpus spp, Camelia oleifera) that are not present in eastern North America and may have indirectly exerted selective pressure on Cm by competing for the attention of seed dispersers. Cm seeds are larger, on average, than American chestnuts, and squirrels are more likely to cache a large seed—and carry it farther—than a small one (Jansen et al., 2002; Xiao, Zhang, & Wang, 2005). In addition to nut size, squirrels are aware of physiological cues in seeds that signal the start of germination and take measures to maximize the nutritional utility of nondormant seeds (Fox, 1982; Smallwood, Steele, & Faeth, 2001; Steele et al., 2001). Dormant seeds are more likely to be cached and carried a longer distance than nondormant seeds, which are usually eaten without caching. All chestnuts go through true dormancy with a chilling period (Baskin & Baskin, 1998), so any difference in dormancy‐breaking among Cd, Cm, and hybrids would presumably be marginal. Physiological differences in the seeds of different chestnut species, however, could be interpreted by squirrels as signals pertaining to dormancy and thus influence caching (Steele et al., 2001; Sundaram et al. 2015, Sundaram, 2016).

Because phenotypic variance in seed traits can be caused by variable inheritance of alleles from Cm in the Cd genomic background of BC3s, we sought to identify loci that influence seed traits and dispersal by associating the presence or absence of Cm alleles in a mother tree with differences in dispersal of its seeds, using pool‐seq (Schlötterer, Tobler, Kofler, & Nolte, 2014) and a whole‐genome genotyping strategy. The goal of our experiment was to determine whether there are loci in the genomes of hybrid chestnut where a Cd/Cm, versus a Cd/Cd, genotype is associated with greater seed dispersal distance and likelihood of caching versus consumption of nuts. Our research questions were as follows:

  • Do differences in seed size or other heritable characteristics influence differences in the way dispersers handle, consume, and/or cache backcrossed hybrid chestnut seeds?
  • What is the genetic basis of seed traits that lead to differences in seed disperser (squirrel) behavior during interactions with hybrid chestnuts?
  • What models of granivore/seed interaction receive strongest support and what research directions are implied by the data?

2MATERIALS AND METHODS

2.1Seed collection

Seeds were collected in late September and early October from a planting of several hundred BC3 ([(Castanea mollissima × dentata) × dentata] × dentata) × dentata) chestnuts at Purdue University's Lugar Farm in Tippecanoe County, IN. The purpose of the planting is blight resistance screening. Most seed parents were 11 years old, but several were 4 years old in 2014 at the start of the study. “Clapper,” a BC1 tree, was the blight‐resistance donor and only source of Cm genetic material in this backcross population. Cm nuts were obtained from a pair of trees planted as blight‐resistant checks in the Lugar Farm orchards. Cd nuts were obtained from two adult trees growing at the Purdue Wildlife Area in Tippecanoe County, IN. BC3 seed parents were chosen based on seed size, with roughly equal numbers of large‐seeded, small‐seeded, and average‐seeded trees chosen, in order to capture a wide range of phenotypic variation. Seed parents were tagged with durable individual plastic nursery labels, but due to large annual variation in the size of seed crops and the loss of some seed parents due to chestnut blight, different seed parents were chosen each year of the study. Seeds were collected by knocking burrs off the parent tree using a ~2‐m wooden pole and manually removing seeds from the bur if necessary. Seeds were floated in water to determine viability; floating seeds were deemed nonviable and discarded.

2.2Seed measurements

Seeds were stored in a cooler (4.4°C) following de‐burring and floating and stratified in peat moss to maintain viability during cold storage. In October, at least 10 seeds from each seed parent were weighed on a digital scale to determine average seed mass. Length (from seed base to tip) and width (across the broadest part of the seed) were determined using digital calipers. In 2015, desiccation was also measured by weighing seeds immediately after collection and again 80 days following collection.

2.3Seed tagging

Tagging was carried out immediately before dispersal trials to avoid spoilage of seeds. A method similar to that employed by Xiao, Jansen, and Zhang (2006) and Hirsch, Kays, and Jansen (2012) was used. A hole was made in the proximal (wider) end of each seed using either a botanical dissecting needle or a small (~2 mm) drill bit. A piece of 24‐gauge green floral wire approximately 12 cm long was looped through the hole and twisted to secure it. A piece of brightly colored waterproof tape was attached to the end of the wire and labeled with a number designating the seed parent.

2.4Dispersal trials

Dispersal trials were conducted in late October, November, and December of each year at four feeding stations placed in and around the Lugar Farm chestnut plantings in Tippecanoe County, IN in a manner similar to the methods of Lichti, Steele, Zhang, and Swihart (2014). In 2016, a feeding station in Woodford County, IL adjacent to the campus of Eureka College was added. At both locations, fruiting chestnuts were present in addition to black walnut (Juglans nigra) and several oak species. Fox squirrels (Sciurus niger) were the only scatter‐hoarding squirrel species observed at either feeding station. Feeding stations were prebaited to acclimate local squirrels to the feeding locations in August‐September prior to dispersal trials. During dispersal trials, 10 (2016) or 25 (2014–15) seeds from 5 to 6 (2016) or 3 to 4 (2014‐15) parent trees were randomly distributed near a post at the center of each feeding site. Seeds were left out for 4‐5 days, and seed fates (cached, consumed, or left at feeding station) were recorded and dispersal distances measured with a forestry measuring tape attached to the post at the center of the feeding site. Intensive searches for seeds were conducted up to 20 m from the feeding site, although some seeds were found outside this distance due to the high visibility of the tags. Trials started in late October or early November and continued through December until the soil surface froze. Relationships between seed dimensions and dispersal parameters were statistically investigated using the lm and glm packages of R software version 3.2.3 (R Core Team 2015).

2.5DNA Isolation

DNA was isolated from BC3 seed parent trees following dispersal trials. Dormant twigs were collected for DNA extraction in early spring 2016 and 2017. Terminal sections (about 3–5 cm) of first‐year twigs were ground to a fine powder in liquid nitrogen using a mortar and pestle. The ground tissue was placed in 5 ml of heated (50°C) CTAB extraction buffer in a 15‐ml conical tube and incubated 4‐8 hr at 50°C. Following incubation, 1 ml of 20 mg/ml proteinase K solution was added and samples were incubated for an additional 15 min. Five milliliters of 25:24:1 phenol:chloroform solution was added, and samples were purified using a standard phenol:chloroform extraction (Doyle and Doyle 1987) followed by precipitation of DNA using 0.2 M sodium chloride and isopropanol. After pelletting and resuspending samples in TE buffer, contaminants were removed using Zymo Research OneStep PCR Inhibitor Removal kits (Zymo Research). Following purification, samples were quantified using a Nanodrop 8000 (ThermoFisher Scientific), and 2% agarose gel, pooled according to dispersal parameters, then submitted to the Purdue Genomics Core Facility for sequencing.

2.6DNA pooling and sequencing

Pools of samples were made for different phenotypic classes based on (a) mean dispersal distance for cached seeds and (b) frequency of caching. The strong‐dispersal pool (Pool A; eight samples) contained DNA from parents that produced seeds with a long dispersal distance (>5 m average dispersal distance for cached seeds) and high frequency of caching (5%–59%) of recovered seeds in caches), including one Chinese chestnut. A moderate‐dispersal pool (Pool B; seven samples) contained parents that produced seeds with a shorter dispersal distance (<5 m) and low frequency (4%–14%) of caching. The weak‐dispersal pool (Pool C; 10 samples) contained seed parents that produced seeds with a frequency of caching and dispersal distance near 0, including one American chestnut (Table 1). Twenty microliters of DNA at concentration 200 ng/μl from each individual sample was included in a pool, and the combined sample was submitted for library construction and sequencing; samples were sequenced as separate libraries on one Illumina HiSeq 2500 (Illumina Inc., San Diego, CA, USA) lane. Reads were paired‐end, 100 bp in length. The individual genomes of “Clapper,” several unrelated Chinese chestnuts, and two American chestnuts were sequenced separately with two samples per lane as part of a blight resistance study (LaBonte et al., in preparation).

Table 1
Summary of seed dispersal data for trees in three genotyping pools, showing the pool each individual parent tree was assigned to, its mean seed weight, the number of seeds cached, the average distance seeds were cached away from the feeding site, and the total number of seeds found for that individual (cached + eaten)
PoolSpeciesYearMean seed mass (g)N (cached)Mean distance (m)Total foundTotal offered
AaCC2014–158.53810.492775
ABC120153.7758.252025
ABC120156.82137.3412125
ABC120164.14110.981530
ABC120163.3527.81330
ABC120144.5179.081925
ABC120144.0329.642850
ABC120142.5566.403050
BbBC120143.5534.532350
BBC120163.1624.922040
BBC120164.5714.422550
BBC120162.6713.692340
BBC120163.5012.85930
BBC120163.8213.521330
BBC120163.4814.92930
CcBC120162.53001220
CBC120163.2200940
CBC120164.01002450
CBC120163.66001420
CBC120142.99002450
CBC120143.47002450
CBC120152.54001925
CBC120151.4700725
CBC120143.09002450
CAC2014‐151.5211.853050

aStrong‐dispersal pool; bModerate‐dispersal pool; cWeak‐dispersal pool.

John Wiley & Sons, Ltd

2.7Genome assembly and SNP calling

Short reads were assembled to the draft Chinese chestnut reference genome v1.1 (Carlson et al., 2014; Pereira‐Lorenzo et al., 2016) using the Burrows‐Wheeler aligner (bwa) (Li and Durbin 2009). Alignments were processed and polymorphisms called for each pool of samples using Picard Tools and the Genome Analysis ToolKit (GATK) best practices workflow (DePristo et al. 2011; Van der Auwera et al. 2013), minus the quality score recalibration step. When calling SNPs using the HaplotypeCaller tool in the GATK, ploidy was set to twice the number of individuals in the pool.

2.8Analysis of SNP data

Several custom Perl scripts were used for processing of polyploid SNP data files generated by the GATK pipeline. The goal of these scripts was to discriminate between predicted genes that had two genotypes well‐represented in a pool (Cm/Cd sites) and genes that had a single genotype fixed or nearly fixed within a pool (Cd/Cd sites) for SNPs within the predicted gene sequence, informed by the observation (LaBonte et al. 2018, in preparation) that heterozygosity values in coding sequences of chestnut hybrids are usually much higher than either parent species. There are no Cm/Cm sites in a BC3 genome. To identify Cm/Cd sites in pooled BC3 genomes, the multi‐sample SNP file was filtered for polymorphisms occurring within predicted genes (AUGUSTUS gene prediction; Stanke et al. 2006) with strong (e‐value <0.001) alignments to the curated Uniprot/SwissProt protein database. For each SNP (minimum depth = 8) between the predicted transcription start and stop sites of each gene, a heuristic hybridity estimator (HE) was assigned to approximate the proportion of individuals that were heterozygotes (Cm/Cd) within a pool. If major allele frequency for a SNP within a pool was between 0.45 and 0.55, HE was assigned a value of 0.75 (most individuals heterozygous), if it was between 0.55 and 0.70, HE was 0.5, if it was between 0.70 and 0.85, HE was assigned the value 0.25, and if the major AF was >0.85, HE was assigned 0 (all or nearly all individuals homozygous). This estimate was averaged across all the SNPs in each predicted gene sequence. When more than 50% of the SNPs in a genotype's sequence could not be scored, that genotype‐locus combination was considered missing data. Finally, HE was averaged across 10‐gene bins for each pool and compared, by bin, among pools. Binning was carried out to detect linked Cm haplotype blocks and avoid detecting single outlier genes as false‐positives. Loci potentially contributing to interspecific differences in seed dispersal were identified as those 10‐gene bins that had a difference in average HE values >2 standard deviations greater than the average HE difference between the strong‐dispersal pool and the moderate‐ and weak‐dispersal pools. Genes within these regions were annotated using the UniProt entries for aligned proteins from the UniProt KB/ SwissProt database (Xenarios 2016). Predicted molecular interactions were analyzed using the STRING protein database.

All the BC3 trees in our sample inherited 100% of their Cm alleles from “Clapper,” but only regions segregating in “Clapper” (loci with Cd/Cm genotypes) were informative, that is, about ~50% of the genome of “Clapper” (a BC1 tree), the other half were Cd/Cd. Thus, our inference on the genomic basis of interspecific difference in seed dispersal was limited to those heterozygous regions. Of loci that were hybrid in “Clapper,” any given BC3 descendant of “Clapper” was expected to retain a Cm allele at one in four loci, with the rest acquiring a second Cd allele in two rounds of meiosis. Therefore, at a locus known to have a Cd/Cm genotype in “Clapper,” a random sample of “Clapper”‐derived BC3s is expected to have one Cm allele observed out of every eight (one Cm/Cd and three Cd/Cd). As we genotyped BC3s in pools, opportunities for random sampling error were present; a given individual's genotype might be over‐represented at a locus, biasing allele frequency. Over‐representation of an individual could be due to differences in DNA quality, inaccurate estimates of DNA concentration prior to pooling or random inclusion of more DNA fragments from one individual during the high‐throughput sequencing process. We developed a Perl script to estimate the likelihood that more Cm alleles than expected by chance alone were present at a given SNP locus in the pooled data.

First, a panel of eight Cm genomes with no evidence of hybrid background, two Cd, and “Clapper” whole‐genome sequences (LaBonte et al. 2018) were used to filter a the pooled whole‐genome SNP file for loci with one allele fixed in Cd, one allele fixed in Cm, and a Cm/Cd genotype in “Clapper.” The coordinates of these loci were recorded as informative SNPs because markers at those loci allowed us to make inferences about the effects of Cm alleles on seed dispersal. Only informative SNPs were kept from the pooled genome SNP file for the analysis.

Next, the program made random draws from arrays of 100 binary values (0 for Cd, 1 for Cm) set to represent the expected species allele frequency at a given SNP locus for the strong‐dispersal, moderate‐dispersal, and weak‐dispersal pools. As the strong‐dispersal pool contained one Cm individual, the expected frequency of Cm alleles at a locus that was hybrid in “Clapper” was 3/8 rather than 1/8; therefore, the array of potential alleles contained 38 “1” values and 62 “0” values. The moderate‐dispersal pool only contained BC3s, so 1/8 was the expected fraction of Cm alleles. As the weak‐dispersal pool contained one Cd individual, the expected fraction of Cm alleles was slightly lower (1/10). To simulate the process of pooled DNA assembly, random draws were made from this distribution up to a simulated read depth of 8, and the number of Cm alleles in the sample was tallied. This process was repeated 1,000,000 times for each pool to create null distributions for Cm allele frequencies in BC3 pooled genomes at “Clapper” hybrid loci and at an assembly depth approximately equal to our actual assemblies.

Subsequently, a p‐value was assigned for each informative SNP in a pool, based on the percent of simulated SNP genotypes that had a count of Cm alleles greater than or equal to the observed number of Cm alleles at that SNP locus. If this percentile‐based p‐value was lower than 0.05, the null hypothesis that Cm alleles were randomly distributed at the locus in a pool was rejected; low p‐values were interpreted as evidence that more Cm alleles were present in the pool than expected by chance alone. For each predicted gene in the genome that contained informative SNPs, an average p‐value was computed using all informative SNPs within the predicted gene sequence. Predicted genes where the null hypothesis was rejected in the strong‐dispersal, but not in the moderate‐ or weak‐dispersal pools were included as potential candidates for influencing seed dispersal. p‐Values assigned to loci using this method were also used to validate candidates identified by the HE heuristic.

2.9Validation of predicted genes

To validate predicted genes from the whole‐genome analysis, cDNA data for a number of species in the order Fagales were aligned to predicted proteins from the Castanea mollissima genome (Carlson et al., 2014; Staton et al., 2014; Appendix S1). cDNA contig consensus sequences were aligned to a database of predicted Castanea protein sequences using the Diamond sequence aligner (Buchfink et al. 2015). A predicted gene was counted as having transcript support if at least one cDNA contig had the predicted gene's protein sequence as its best alignment. The Arabidopsis best hits for each predicted chestnut peptide in genome regions determined to be associated with seed dispersal by the HE method were submitted to gene ontology analysis using g:prolifer (Reimand et al. 2016).

3RESULTS

3.1Seed phenotypes

Dispersal trials were conducted for 13 BC3, one American, and one Chinese chestnut in 2014; 11 BC3, one American, and one Chinese chestnut in 2015; and 12 BC3 in 2016 (Table 1). The average mass (mean ± SD) of BC3 seed over the three years was 3.51 ± 1.47 g, ranging between 1.12 and 7.78 g. The average for American chestnut was 3.05 ± 0.17 g; for Chinese chestnut, the average was 7.82 ± 1.01 g. Average seed length of BC3 in 2014 and 2015 was 22.12 ± 2.43 mm, with a range of 17.73–25.86 mm; length for American chestnut was 20.49 ± 0.16, and for Chinese chestnut average length was 24.27 ± 0.69. Width across the wider axis of the attachment‐scar end of the nut was 20.63 ± 3.88 mm for BC3, ranging between 14.42 and 26.4 mm; for American chestnut the mean was 20.04 ± 0.70 mm and for Chinese chestnut 27.29 ± 0.46 mm. In 2015, Cd seeds lost more of their mass through drying (15.85%) than Cm (10.26%) over 2.5 months of cold‐room storage (October 28 to January 14). The individual half‐sib seed lots with the highest rate of caching (68%, 55%, and 25% of seeds recovered in caches rather than recovered eaten) lost moisture at rates similar to Chinese chestnut (8.99%, 8.55%, and 11.61% of moisture lost, respectively) while seeds that were less likely to be dispersed and cached had highly variable (5.41%–31.64%) loss of mass due to drying and an average rate of moisture loss (17.11%) closer to American than to Chinese chestnut.

3.2Seed dispersal

Average recovery rate (% of tagged seeds recovered after 4–5 days) was 66% in 2014, 42% in 2015, and 46% in 2016. Of seeds that were recovered, in 2014, 36.5% were eaten without being moved away from the feeding site, 49.3% were moved and eaten, and 20.5% were moved and cached. In 2015, these numbers were 40.8%, 36.1%, and 23%, respectively; in 2016, they were 66.1%, 29.0%, and 4.8% cached, respectively. In 2016, the apparent shift in proportions was driven by a low caching rate at the Indiana site rather than the addition of the Illinois site. Average dispersal distance for individual BC3s with more than one dispersal event ranged from 4.92 to 9.08 m, which was less than the average for Chinese chestnut (10.49 m) and greater than the single American chestnut that was cached (1.85 m). Chinese chestnuts were dispersed farther and cached more frequently (29.6% of the time) than American chestnuts (3.2% of the time) and most BC3s (averaged over 14 families with at least one dispersed seed: 16.5%). Pools created for genotyping reflected the wide range of variation in dispersal. The average dispersal distance for seeds of trees placed in the strong‐dispersal pool (seven BC3 and one Cm) was 9.08 ± 4.12 m and average caching frequency (no. cached / no. found) 25.3% (Table 1). For the moderate‐dispersal pool (seven BC3), the average distance dispersed was 4.12 m ± 0.78 m, with a caching frequency of 8.2%. In the weak‐dispersal pool (nine BC3 and one Cd), the average distance dispersed was 0.185 ± 0.59 m, and the caching frequency was 0.5%. Mean individual seed size was a statistically significant predictor of mean individual caching distance in a simple linear regression where individuals with average seed dispersal distance 0 (i.e., seeds that were only recovered eaten at the feeding platform) were excluded (t1,25 = 4.43, p = 0.0002, adjusted r2 = 0.42) and seeds cached/total number of seeds recovered (t1,25 = 2.26, p = 0.03, r2 = 0.14) (Figure 1). In a binomial regression, mean seed mass was not a significant predictor of whether an individual mother tree had at least one seed recovered in a cache (z value = 1.394, p = 0.163).

Figure 1

Scatterplot with simple linear regression line of average distance to caching (m) over average seed mass (g) for 25BC3, 2 Castanea dentata, and 2 C. mollissima mother trees measured 2014–2016

3.3Pooled genome SNP genotyping

Enough 100‐bp paired‐end reads (57–67 million) were obtained for each pool to cover the ~800 Mb chestnut genome between 7.2 and 8.5 times, so that each individual tree in each pool was represented by about one read at any locus in the genome. A small fraction of total bases (~2%) were removed from each sample by Trimmomatic due to low read quality prior to analysis. In the strong‐dispersal pool, 341363 informative SNPs with coverage >8 were identified; 177,884 were identified in the moderate‐dispersal pool, and 215,590 were identified in the weak‐dispersal pool. As expected, “Clapper” had a Cd/Cm genotype at 50% of the loci with one allele fixed in Cm and another in Cd, and a Cd/Cd genotype at the other 50%.

3.4Analysis of hybrid regions among pools

The mean value of the heuristic hybridity estimator (HE) over all SNPs in predicted genes with coverage ≥8 was highest for the strong‐dispersal pool (0.44 ± 0.123) (Figure 2) and lowest for the weak‐dispersal (0.294 ± 0.164) (Figure 2). For the moderate‐dispersal pool, the mean value of HE was 0.313 ± 0.174 over all predicted genes (Figure 2). When windows of 10 genes were used, mean difference in HE among windows was greatest between the high‐ and weak‐dispersal pool (0.155 ± 0.088), but the difference between the strong‐ and moderate‐dispersal pools was similar (0.137 ± 0.101) and both were much larger than the average difference between weak‐ and moderate‐dispersal pools (0.019 ± 0.085). Of 2,714 bins of ten predicted genes, there was one region where the difference in HE between the strong‐dispersal pool and the weak‐dispersal pool was >3 standard deviations greater than the mean difference, and 53 bins >2 standard deviations above the mean. There were two bins for which the difference in HE between the strong‐dispersal pool and the moderate‐dispersal pool was >3 SD above the mean, and 58 where the difference was >2 SD above the mean (Table 2).

Figure 2

Histograms of heterozygosity estimates forSNPs in the pooled genome sequences of (a) sevenBC3 and one Chinese chestnut with more frequent and longer‐distance nut dispersal (b) sevenBC3 chestnut with intermediate dispersal distance and low caching frequency and (c) eightBC3 and one American chestnut with low caching frequency and short dispersal distance

Table 2

Notable predicted genes within genome intervals identified based on lower major allele fraction (higher proportion of Cm alleles) in a pool of BC3 chestnut mother trees with long caching distance and high proportion of seeds cached, relative to other BC3 chestnuts in the study, including predicted molecular function and simulation‐based statistical support for an excess of Cm alleles at the predicted gene

LGaBP intervalbSDcGenedUniprot hiteAnnotationfSNPsgp < 0.05, pool AhP(A)iP(B)jP(C)kBC1 Het.lCd Het.m
LGA83,772,158–8,408,96743.32A.g10648LRX3_ARATH (77%)Extensin‐like protein; cell wall formation1020.170.590.590.7420.000
A.g10657VIL1_ARATH (48%)Involved in vernalization, flowering regulation1980.190.870.410.5220.042
LGC48,273,510–48,654,6883.09C.g6050EMF2_ARATH (46%)Polycomb group protein; flower development2680.240.370.820.8380.068
LGG40,408,926–40,687,7272.61G.g5214DLO2_ARATH (39%)Salicylic acid catabolic process640.060.910.030.4460.175
LGH46,084,023–46,409,4692.29H.g5097LRP1_ARATH (50%)Involved in formation of female flower parts920.380.840.820.0930.018
LGI25,672,061–25,978,4232.71I.g3291NLTL5_ARATH (33%)Lipid binding and transport100.050.00na0.4290.000
I.g3304C94A2_VICSA (52%)Hydroxylation of fatty acids100.640.910.640.6000.000
LGL59,955,058‐60,179,3312.92L.g7556PME51_ARATH (59%)Demethylesterification of cell wall pectin200.050.91na0.1600.031
L.g7302CESA2_ARATH (78%)Crystallization of cell wall microfibrils1000.480.910.860.2030.092
LGL65,537,114‐65,794,6832.89L.g8192TPS13_RICCO (62%)Sesquiterpene synthase50100.180.910.420.7000.022
L.g8191NES1_FRAAN (61%)Synthesis of volatile mono‐, sesquiterpenes310.430.910.350.6710.155
L.g8198NES1_FRAVE (58%)Synthesis of volatile mono‐, sesquiterpenes3130.390.910.270.5540.142
L.g8208NES2_FRAAN (60%)Synthesis of volatile mono‐, sesquiterpenes920.300.910.420.6670.123

aLinkage group and bbase position in the Chinese chestnut draft pseudochromosome assembly (Carlson et al., 2014); cNumber of standard deviations that the heterozygosity estimator (HE) for the strong‐dispersal pool exceeded either the weak‐dispersal or no‐dispersal pools; dNumeric code for a predicted gene (AUGUSTUS) in the Chinese chestnut draft reference genome (Carlson et al., 2014); eTop hit from the Uniprot/Swissprot curated protein sequence database for the predicted chestnut gene, with percent amino acid identity; fLikely function based on the top Uniprot hit; gInformative SNPs in the gene sequence; hSNPs where Cm allele frequency in the strong‐dispersal‐pool was in the top 5% of simulated random Cm allele frequencies; iAverage percentile value of all SNPs (smaller value = more Cm alleles than expected) in the predicted gene for the strong‐dispersal pool; jAverage percentile value of SNPs in the weak‐dispersal pool; kAverage percentile value of SNPs in the no‐dispersal pool; lHeterozygosity for all SNPs in the gene for “Clapper,” the Cm allele donor for all BC3 trees in the experiment; mHeterozgosity for all SNPs in the gene, averaged over two individual Cd samples.

John Wiley & Sons, Ltd

3.5Annotations of genes within hybrid regions

Candidate genes for differences in seed dispersal were analyzed for 18 bins with the largest deviations from the mean difference in the heterozygosity estimate between the strong‐dispersal pool and the moderate‐ and weak‐dispersal pools. Fourteen of these bins had a large difference in heterozygosity between the strong‐dispersal pool (pool A) and the others (pools B and C); three were identified based on the difference between the strong‐ and moderate‐dispersal pool; and one was identified based on the difference between strong‐ and weak‐dispersal pools while the strong and moderate‐dispersal pools showed no difference (Table 2). Of these 14 genome regions, seven that had additional support from the simulation‐based estimation of significance were chosen as the most likely candidates (Table 2). Additional individual candidate genes (rather than regions) were identified based on simulation‐based evidence of Cm alleles in the strong‐dispersal pool (Table 3). Examining annotations of predicted genes in these regions revealed several that have plausible roles in seed development and subsequent seed handling and dispersal by squirrels (Figure 3). Many of the predicted genes in these regions aligned to cDNA sequences from chestnuts and other nut‐bearing species in the order Fagales (Table 5). Gene ontology terms that were enriched in candidate regions, as determined by the difference in HE between strong‐dispersal and weak‐dispersal pools, included “regulation of cellular localization” (p = 0.0181), “membrane‐bounded organelle” (p = 0.0428), and “plasma membrane” (p = 0.0181).

Table 3
Individual predicted genes identified as candidates for differences in seed dispersal based on significant departures from expected allele frequencies in a pool of BC3 chestnut mother trees with long caching distance and high proportion of seeds cached, relative to other BC3 chestnuts in the study
GeneaStrong‐dispersal SNPsAbSignificant SNPscFSTdUniproteAnnotation
A.g1155690.820.343PME_PRUPEPectin methylesterase; cell wall remodeling
A.g185560.860.349GWD2_ARATHAlpha‐glucan water dikinase
A.g199151.000.655BH030_ARATHbHLH transcription factor
A.g324651.000.731C94A2_VICSACytochrome p450 oxidase; fatty acid hydrolysis
A.g518431.000.811BZR1_ARATHModulates ovule development; brassinosteroid signaling
A.g556280.890.319GEX1_ARATHGametophyte development and embyrogenesis
A.g614070.780.965AOP1C_ARATHDioxygenase potentially involved in producing glucosinolates
A.g773411.000.564GEML5_ARATHMaintenance of seed dormancy by abscisic acid
A.g809150.830.496LHT1_ARATHLysine histidine amino acid transporter
B.g305940.800.252PERR_RAUSEBiosynthesis of monoterpenoid indole alkaloids
E.g1046110.920.662C76AD_BETVUCytochrome p450 oxidase in the betalain synthesis pathway
E.g4071140.780.240WAK4_ARATHCell‐surface kinase that binds to pectin; cell expansion
E.g746731.000.589STC_RICCOSugar carrier protein C; carbohydrate transport
E.g822980.800.899HPPR_PLESUHydroxyphenylpyruvate reductase; Biosynthesis of rosmarinic acid
E.g91150.830.719GAIP_CUCMADELLA protein; repressor of gibberellin signal pathway
F.g2981210.810.865TRA1_MAIZEPutative AC transposase from transposon “Activator”
F.g92741.000.365RKS1_ARATHCarbohydrate‐binding serine/threonine protein kinase
F.g941100.770.885SD16_ARATHReceptor‐like kinase; regulation of cellular expansion
H.g31560.860.433BPG2_ARATHBrassinosteroid‐mediated transcriptional regulation
J.g603541.000.583AKR1_SOYBNAldo‐keto reductase
K.g162231.000.749BIOF_ARATH8‐amino‐7‐oxononoanoate synthase
K.g222140.800.565FRL4A_ARATHFRIGIDA‐like protein; flower development
L.g616340.800.910MTEF8_ARATHTranscription termination factor
L.g6184130.770.722GIF2_ARATHTranscription coactivator; active in cotyledon tissue

aAUGUSTUS predicted gene from the Chinese chestnut draft reference genome (Carlson et al., 2014) preceded by linkage group letter; the initial letter indicates the pseudochromosome in which the predicted gene was found; bSNPs in the upper 5% of the distribution of simulated random Cm allele frequencies in the strong‐dispersal pooled genome. No SNPs significant at this level were identified in these intervals in the weak‐ or no‐dispersal pools; cPercent of informative SNPs in the gene sequence, for the high‐dispersal pool, in the upper 5% of simulated Cm allele frequencies; dFST calculated for SNPs within the predicted gene among a sample of individual Chinese and American chestnut genome sequences; eTop hit for the predicted protein from the Uniprot/Swissprot curated protein database.

John Wiley & Sons, Ltd
Figure 3

Depiction of the role of candidate genes in Castanea nut development (upper right) and the perceptions of nut phenotypes by squirrels (Sciurus) that are hypothesized to cause someBC3 chestnuts to be dispersed farther and cached more frequently than others

4DISCUSSION

4.1Dispersal trials

Previous studies have indicated that seed‐caching rodents are more likely to cache (rather than eat) relatively large seeds, and carry larger seeds farther before caching (Jansen et al., 2002; Moore et al., 2007; Tamura & Hayashi, 2008; Xiao et al., 2005). The dormancy status of seeds is also likely a factor in caching decisions (Moore et al., 2007; Smallwood et al., 2001; Xiao, Gao, Jiang, & Zhang, 2009; Xiao, Gao, Steele, & Zhang, 2009): squirrels are more likely to eat seeds perceived as nearing germination and to cache seeds perceived as dormant. The value of squirrel caches to nut survival (fitness) has been amply demonstrated (Lichti, Steele, & Swihart, 2017), so seed phenotypes that influence squirrel caching have evolutionary significance for nut‐bearing trees. In our study, seed size, as measured by seed mass, was associated with both dispersal distance (Figure 1) and likelihood of caching vs. consumption. Seed mass was highly variable among the backcrossed trees in the study, which we deliberately chose based on seed size (Table 1) from a much larger population of BC3 chestnuts. Seeds with greater mass tended to exhibit greater dispersal distance and greater likelihood of caching vs. immediate consumption by squirrels. Chinese chestnuts in this study were much larger than American chestnuts (Table 1), so BC3 hybrids with larger seeds (similar to Chinese chestnut) were more likely to be cached. Variation in seed dispersal distance and caching likelihood among BC3 trees, however, was not fully explained by seed size (Figure 1). There is no documented difference in seed dormancy between American and Chinese chestnut—both species must undergo a dormant period of several months to germinate (Saielli et al. 2012), and in both species, the seed is metabolically active during its dormant phase because chestnuts are recalcitrant seeds (Leprince, Buitnik, & Hoekstra, 1999; Roach et al., 2009). The sugar content of chestnuts under cold storage increases while starches diminish (Ertan, Erdal, Gulsum, & Algul, 2015). Differences in genes that regulate dormancy or signals that communicate dormancy to squirrels, or binding sites for regulatory molecules, are therefore less likely to be false‐positives than structural or housekeeping genes.

4.2Genome scan for loci involved in caching decisions

Interspecific hybrid phenotypes are not always intermediate between parents (Woeste et al. 1998), and intermediate genotypes do not simply correspond to intermediate phenotypes. In hybrid chestnuts, for example, gene interactions can cause unpredictable dormancy‐related phenotypes: hybrid seeds derived from a cross with Allegheny chinkapin (Castanea pumila) as pollen parent and Cm as the seed parent exhibited reduced seed dormancy (Jaynes 1963; Metaxas 2013). No precocious germination of seeds was observed in the course of our experiment, but subtle phenotypes may have been present. In red oaks (Quercus section Lobatae), which are closely related to chestnuts, dormancy and germination appear to be controlled primarily by the pericarp, the dry fruit structure that makes up the hard outer “shell” of both oak acorns and chestnuts (Smallwood et al., 2001). There is evidence that squirrels make use of changes in the pericarp to sense impending germination in oaks and chestnuts. These changes include degradation of pericarp waxes and the release of low molecular weight volatile compounds from inside the pericarp (Paulsen et al., 2014; Sundaram et al. 2015, Sundaram, 2016). By demonstrating that a germinating white oak embryo inside a “dormant” red oak shell is perceived as dormant by squirrels, Steele et al. (2001) showed that signals at the pericarp surface may be more important than signals from the kernel. If some chestnut hybrids have a thicker pericarp wax layer than American chestnut, squirrels might perceive the seeds as reliably dormant and cache them more frequently.

By sequencing pools of chestnuts with different seed dispersal phenotypes, we attempted to identify regions of the genomes of BC3 hybrids where Chinese chestnut allele frequencies were higher than expected in the most frequently dispersed trees. The HE statistic seems to have captured the elevated heterozygosity that is characteristic of Cm/Cd hybrid gene loci. The inference space for our study was limited to the phenotypic effects, in a BC3 population, of loci where “Clapper” may have contributed a Cm allele, which only included half of the genome. Our ability to find genomic regions that were hybrid in one pool but not the others was impeded by the uncertainty associated with estimating heterozygosity in pooled sequence data. By comparing results from pooled data with individual chestnut genome sequences, we determined that the individual “Clapper” genome was significantly more heterozygous than individual Cd genomes at SNP loci in most of the seed dispersal candidate regions (Table 4), which indicated that these regions were plausible candidates. Differences in Cm allele frequencies among pools (Figure 2) were accounted for by the simulation‐based method for identifying outliers; only a small fraction of the many genes with divergent allele frequencies between Cd and Cm were included in candidate regions. The study design limited our inference to maternal effects, but as squirrel caching decisions appear to be influenced strongly by characteristics of the maternally derived pericarp (Steele et al. 2001), the paternal contribution to differences in dispersal is likely to be small. Finally, the small number of genotypes utilized (three pools derived from 24 individual trees) limits the strength of conclusions drawn from this study because the number of false‐positive candidate loci is inversely related to sample size. The importance of candidate genes that remained after statistical validation was rendered more plausible, however, because their predicted function often corresponded to factors known to influence squirrel caching decisions—seed size and the perception of dormancy. These remaining candidates point to new hypotheses on seed/seed‐disperser coevolution in hardwood trees. Seed size, the most obvious dispersal‐associated phenotype that distinguishes Cd and Cm, is a likely a complex trait in chestnut as it is in other plants (e.g., Gnan, Priest, & Kover, 2014). Several candidate loci identified in this study had annotations that point to a potential role in seed development and seed size. The EMBRYONIC FLOWER 2‐like (EMF) gene on LGC (c.g6050; Table 2) could directly influence seed size by regulation of development of female flower parts. EMF2 in Arabidopsis encodes a Polycomb group protein (Yoshida et al., 2001) that regulates vegetative growth and development by suppressing the flower‐development program, Yoshida et al., 2001). The predicted EMF2 gene in chestnut had strong transcript support from C. mollissima and C. dentata and was one of 5 EMF2‐like genes predicted (by AUGUSTUS) in the entire chestnut genome (Carlson et al., 2014). A candidate gene on LGL (g6184) was similar to GIF2 of Arabidopsis, which regulates the expansion of cotyledons (Kim & Kende, 2004). Cotyledons make up the majority of the mass of a chestnut seed. Several other candidates, including a LATERAL ROOT PRIMORDIUM 1 (Kuusk, Sohlberg, Magnus, & Sundberg, 2006) homolog on LGH, a VERNALIZATION‐3‐like gene (VIL1) on LGA from (Table 2), a gene similar to FRIGIDA‐like 4 of Arabidopsis on LGK, also may function in the regulation of flower development. The latter two loci are both involved in the FLOWERING LOCUS C (FLC) regulatory pathway in Arabidopsis (Greb et al., 2007; Michaels, Bezerra, & Amasino, 2004). Whether homologs of these flowering regulatory loci have effects on the development of floral parts, and thereby influence seed size in chestnut, is uncertain. It is also possible that they affect dispersal by modifying seed dormancy, given that FLC and its interactors have documented pleiotropic effects on the regulation of seed dormancy and germination (Chiang et al. 2009). If such pleiotropic effects exist, changes in seed size due to natural selection could also lead to changes in dormancy.

Table 4
Selected candidate genes with summary of evidence for involvement in seed dispersal. Boldface indicates candidate genes with the strongest evidence
SiteGenePool AaPool BbPool CcHet.d BC1Het.eCdUniProtg% ID
Sd01A.g47140.4880.0260.0670.1200.013DBR_TOBAC64
Sd02A.g86350.7500.0880.2420.1760.062ASR1_SOLLC82
Sd03A.g106480.6500.1450.2060.7420.000LRX3_ARATH77
Sd03A.g106570.5220.0770.1950.5220.042VIL1_ARATH48
Sd04A.g133570.5850.1240.1540.0260.055SEC5A_ARATH57
Sd04A.g133590.4710.1040.0630.0070.043SEC5B_ARATH71
Sd05B.g11180.4700.0930.0820.1080.035GOLS2_ARATH72
Sd06B.g34520.4220.0890.1190.1190.021SUP_ARATH41
Sd06B.g34580.5340.0310.1800.3050.088CSLG2_ARATH36
Sd06B.g34600.5220.0000.1740.2270.068CSLG2_ARATH38
Sd07B.g51640.4490.2890.2600.4590.183UXS2_ARATH70
Sd08C.g30740.4370.0770.0930.2360.042PME31_ARATH79
Sd09C.g37250.4320.3990.0620.7550.008AAPT1_ARATH85
Sd09C.g37280.5160.2870.0610.7870.025AAPT1_ARATH98
Sd10C. g60500.5830.3250.1550.8380.068EMF2_ARATH46
Sd12G.g43660.1750.0000.4830.8570.033ABR1_ARATH55
Sd12G.g43690.4840.0120.3690.550.130AMYA_VIGMU71
Sd12G.g43700.4600.0180.4340.5920.198LAG12_ARATH66
Sd13G.g52140.5570.0660.5800.4460.175DLO2_ARATH39
Sd14H.g59070.6470.0780.0720.0930.018LRP1_ARATH50
Sd15I. g32910.7300.0000.0700.4290.000NLTL5_ARATH33
Sd15I.g33040.5500.0000.0820.6000.000C94A2_VICSA52
Sd16L.g73020.5350.0660.1080.2030.092CESA2_ARATH78
Sd17L.g75560.5170.0500.1290.1600.031PME51_ARATH59
Sd18L.g81910.5160.2360.4190.6710.155NES1_FRAAN61
Sd18L.g81920.5970.1120.3160.7000.022TPS13_RICCO62
Sd18L.g81980.6670.0980.4290.5540.142NES1_FRAVE58
Sd18L.g82080.5270.0780.4550.6670.123NES2_FRAAN60

aPool of individuals with highest mean dispersal distance and largest % of seeds cached rather than consumed; bPool of individuals with lower mean dispersal distance and lower caching %; cPool of seeds that were rarely or never cached; dProportion of heterozygous snps in gene for “Clapper” calculated using Vcftools; eProportion of heterozygous snps in gene for Cd calculated in Vcftools; fPercentile of expected Cm allele frequency distribution based on 1,000,000 simulated pooled genotypes for each pool, averaged over all SNP loci within gene.

John Wiley & Sons, Ltd

4.3Genomic loci associated with differences in seed dispersal: pericarp‐mediated dormancy

In red oak acorns, which are anatomically similar to chestnuts, the pericarp prevents absorption of water by the embryo and allows germination only after the pericarp's permeability has increased following a period of cold storage (Peterson 1983; Steele et al., 2001). The pericarp is derived from the ovary walls of female chestnut flowers and consists of several layers of lignified cells with a waxy coating on the outermost layer. Both the breakdown of this waxy layer and the subsequent release of volatile compounds from the pericarp may serve as olfactory cues for squirrels that a seed is approaching germination and is therefore more perishable and a poor candidate for caching. The candidate genes we identified include some that may be involved in the formation of pericarp cells, some that may influence the composition of waxes on the pericarp surface, and others that may influence the release of volatile compounds from the pericarp. While none of the transcriptomic data from Fagales trees we aligned to the Cm predicted gene set was seed specific, several dispersal candidate genes were supported by cDNA contigs from several species (Table 5).

Table 5
Transcriptome alignments from members of the order Fagales, for selected predicted genes from chestnut genome regions associated with interspecific differences in seed dispersal
GeneUniprotCmaCdbCccCsdQaeQfcFgJrhJniCajArukArhlBetm
A.g1855GWD2_ARATH100n10094.598.710097.9o77.583.6
A.g1991BH030_ARATH10062.365.358.3
A.g3246C94A2_VICSA98.797.387.289.287.695.7
A.g5184BZR1_ARATH10010080.381.597.481.075.075.690.279.774.0
A.g7734GEML5_ARATH69.460.699.699.494.268.070.390.684.470.683.6
A.g8091LHT1_ARATH95.510078.1
A.g10648LRX3_ARATH10010097.998.293.195.195.594.395.794.392.590.3
A.g10657VIL1_ARATH10010010093.196.196.283.083.084.492.489.087.9
A.g11556PME_PRUPE10087.778.584.672.476.8
C.g6050EMF2_ARATH98.998.474.3
E.g7467STC_RICCO94.886.888.191.678.685.977.581.2
E.g8229HPPR_PLESU97.982.372.379.281.884.378.1
F.g927RKS1_ARATH10084.796.092.675.877.168.472.2
G.g5214DLO2_ARATH10099.472.5
H.g315BPG2_ARATH89.694.081.594.581.180.785.185.6
H.g5907LRP1_ARATH79.410080.269.610088.958.363.082.6
I.g3291NLTL5_ARATH89.0100.085.077.350.365.161.859.8
J.g6035AKR_SOYBN10084.972.388.286.480.7
K.g2221FRL4A_ARATH81.910085.586.110084.579.767.965.084.697.370.278.4
L.g6163MTEF8_ARATH10010072.672.568.985.9
L.g6184GIF2_ARATH93.074.598.040.257.170.849.176.7
L.g7302CESA2_ARATH85.610090.088.3100.087.386.084.488.5100.079.160.581.9
L.g8191NES1_FRAAN93.138.9
L.g8192TPS_RICCO100.0
L.g8198NES1_FRAVE92.195.7
L.g8208NES2_FRAAN10090.868.281.281.278.8

aHealthy stems, blight cankers, and whole‐plant tissues (Barakat et al. 2009, 2012) for Chinese chestnut, Castanea mollissima (Cm) and bAmerican chestnut, C. denata (Cd); cRoot transcriptomes for Japanese chestnut, C. crenata (Cc), and dEuropean chestnut, C. sativa (Cs) (Serrazina et al. 2015); Whole‐plant libraries of ewhite oak (Quercus alba; Qa), fnorthern red oak (Quercus rubra; Qr), gAmerican beech (Fagus grandifolia; Fg), hblack walnut (Juglans nigra; Jn), ired alder (Alnus rubra; Aru), and jwhite alder (Alnus rhomboids; Arh) downloaded from hardwoodgenomics.org; kPersian walnut (Juglans regia; Jr) whole‐plant transcriptome data (Martinez‐Garcia et al., 2016); lCommon hazelnut (Corylus avellana; Ca) all‐tissue library (Rowley et al., 2012); mBetula platyphylla (Mu et al., 2012); nPercent amino‐acid identity in a blastx alignment of the predicted chestnut protein with the cDNA transcript; oPredicted protein in chestnut was not the best blastx alignment for any transcript.

John Wiley & Sons, Ltd

Our analysis identified several predicted genes that may have a role in cell wall modification during nut development and ripening in chestnut. These include an extensin‐like predicted gene on LGA (LGA.sd03; g10648), which has a similar protein sequence to an LRX‐family gene in Arabidopsis (Baumberger et al., 2003); the LRX family has been implicated in the modification of plant cell walls (Draeger et al., 2015). Predicted genes similar to pectin methylesterases were identified on LGA (g11556) and LGL (g7556). The latter predicted gene (LGL.Sd17) was similar to a gene in Arabidopsis expressed in developing siliques (seed pods) (Louvet et al., 2006) and could be involved in the formation of the lignified chestnut pericarp. Pecinesterase genes are active in maturing (lignifying) wood in poplar (Mellerowicz, Baucher, Sundberg, & Boerjan, 2001). Conversely, these genes could be directly involved in the germination process: in yellow cedar (Chamaecyparis nootkaensis), pectinesterases are active in germinating seeds (Ren & Kermode, 2000), and Arabidopsis lines with overexpression of a PME inhibitor showed more rapid germination (Müller et al. 2005). Given the importance of the pericarp in squirrel perceptions of seed perishability, these cell‐wall modification genes may influence seed dispersal by acting in the maturing pericarp, rather than in the germinating embryo. Other candidate genes that could influence formation of the pericarp include a cellulose synthase (CESA2)—like gene (Beeckman et al., 2002; Li et al. 2009). Adjacent to the pectinesterase at SD17 on LGL and a gene similar to a sugar transport carrier in castor bean (Ricinus communis) (STC_RICCO; LGE_g7467). Several candidate genes we identified appear to have a role in lipid metabolism that may be related to the formation and/or degradation of pericarp wax layers (Pollard, Beisson, Li, & Ohlrogge, 2008). The potential importance of fatty‐acid metabolic processes in regulating squirrel dispersal was explored by Sundaram (2016), who showed that differences in the outer wax layer of the pericarp influence squirrel perception of seed dormancy. Nonspecific lipid transfer proteins (LGC.Sd15) in Arabidopsis are involved in the formation of suberin in crown galls (Deeken et al., 2016), various tissues of tomato in response to drought stress (Trevino & O'Connell, 1998), and the surface wax of broccoli leaves (Pyee, Yu, & Kolattukudy, 1994). Another lipid‐modifying gene, a cytochrome p450 oxidase (g3304), occurs at the same locus as the NLTL‐like predicted gene on LGC, and a second cytochrome p450 oxidase (LGA_g3246) was identified as a candidate gene on LGA. Based on alignments to the UniProt database, these genes are both similar to a gene from tare (Vicia sativa) that catalyzes hydroxylation of myristic acid and other fatty acids (Le Bouquin, Pinot, Benveniste, Salaun, & Durst, 1999).

Squirrels perceive volatile compounds from seeds as cues of metabolic activity and impending germination (Sundaram, 2016). Volatile compounds are thought to escape the pericarp as it becomes more porous and germination approaches. One particularly interesting locus appeared to contain a cluster of four volatile terpene synthase genes, which are most similar to terpene synthesis genes highly expressed in the fruits of strawberry (Aharoni et al., 2004) that are thought to influence the fruit's flavor and aroma profile. Nerolidol is a sesquiterpene compound found in many plants (Chan, Tan, Chan, Lee, & Goh, 2016). Sundaram (2016) found that release of beta‐amyrin, a triterpene, was associated with germination of chestnuts. While these compounds are distantly related, their synthesis may be metabolically linked by production of the intermediate squalene. Nerolidol synthase converts farnesyl diphosphate (FPP) to nerolidol. In a yeast study, overexpressing FPP synthase and squalene synthase greatly increased beta‐amyrin production (Zhang et al., 2015). Beta‐amyrin has been associated with wax degradation in other plants (Buschhaus & Jetter, 2012), so it could degrade the cuticular waxes of the outer pericarp as it is released, preparing the seed for germination (Sundaram, 2016). The genes found here do not directly influence beta‐amyrin production, but could influence production of substrate molecules or divert carbon away from beta‐amyrin production. Interestingly, of the three nerolidol synthase‐like genes at this locus, only one showed evidence of expression in Chinese chestnut and two others showed evidence of expression in American chestnut and oaks, but not Chinese chestnut (Table 4), possibly indicating interspecific differences in the expression of these genes. There was little evidence of their expression in the non animal‐dispersed taxa examined (Alnus, Betula) nor in Fagus. If the expression of multiple copies of nerolidol synthase in American chestnut leads to an increase in the activity of volatile organic compounds that degrade pericarp waxes, the result in BC3 seeds that express the Cd alleles could be a signal to squirrels to eat rather than cache these seeds. Our results support previous studies (e.g., Smallwood et al., 2001) that point to seed dormancy and germination as a primary influence on squirrel caching decisions. In particular, they support the notion that pericarp waxes and volatile compounds are important for conditioning squirrels’ perceptions of seed dormancy in nut‐bearing trees. The potential evolutionary role of loci with pleiotropic effects on flower development, seed size, and germination in nut‐bearing trees merits further study. The possibility that lipid‐ and secondary metabolite‐synthesis genes expressed in developing pericarp tissues are ultimately important for seed dispersal phenotypes should be investigated in squirrel‐dispersed tree species and interspecific hybrids. The results of the present study need to be validated and clarified using larger numbers of plants and individual, rather than pooled, genotypes. We hypothesize that genes controlling differences in seed dispersal are primarily expressed during flower and seed development and maturation (the formation of the pericarp) rather than during dormancy, when dispersal takes place.

5CONCLUSIONS

The ecological relationship between trees in Fagales and the scatter‐hoarding rodents and birds that disperse and consume their seeds is pivotal for the current canopy composition and future trajectory of forest ecosystems throughout the northern temperate and subtropical zones. As caching decisions made by squirrels determine whether or not a given seed has a chance of germinating and reproducing, the basis of these decisions has likely been a factor in the evolution and diversification of nut‐bearing tree lineages. Our work provides additional evidence that pericarp‐mediated dormancy plays a predominant role in influencing squirrel dispersal of seeds of the same or closely‐related species (Smallwood et al., 2001; Steele et al., 2001) and the first evidence of gene loci under selection in the coevolution of hardwood trees with scatter‐hoarding seed dispersers. The interplay between differences in seed size, seed dormancy, and the role these traits have played in the evolution, diversification, and speciation of nut‐bearing hardwood trees should be further investigated using more robust genome‐scale genotyping and additional interspecific hybrids. Given the evidence for expression of many of the candidate loci in trees in the order Fagales, it is possible that the predicted genes identified here have had a role in diversification and speciation of several nut‐producing lineages. Additional screening of these candidate genes in the chestnut, oak, and other animal‐dispersed Fagales lineages should further elucidate their role in the ecological coevolution of hardwood trees and their coevolved conditional mutualist seed dispersers.

CONFLICT OF INTEREST

None declared.

AUTHOR CONTRIBUTIONS

Dr. Woeste contributed study design and the core questions of the study. Dr. LaBonte carried out seed‐dispersal trials, seed measurements, genotyping, and data analysis.

DATA ACCESSIBILITY

Quality‐filtered Illumina whole‐genome sequencing reads are available as a Sequence Read Archive (https://www.ncbi.nlm.nih.gov/sra) at the addresshttps://www.ncbi.nlm.nih.gov/bioproject/PRJNA474773). Chinese chestnut genome scaffold sequences and predicted proteins are available at hardwoodgenomics.org.

Supporting information

ECE3-8-10638-s001.docxClick here for additional data file.

ACKNOWLEDGMENTS

The research described was funded by a Frederick M. van Eck Research Scholarship at Purdue University. The authors wish to thank Dr. Nate Lichti, Dr. Mekala Sundaram, and Dr. Rob Swihart for their expert advice on squirrel‐feeding experiments, and to thank Jim McKenna and Claude Diehl for their help collecting, cleaning, and measuring chestnuts. We would also like to thank Nathaniel Cannon and Dr. John Carlson for allowing us to use the draft Chinese chestnut reference genome assembly in our analysis, Dr. Laura Georgi of TACF for a “Clapper” DNA sample, and the Purdue Genomics Core facility for their assistance in carrying out sequencing. Mention of a trademark, proprietary product, or vendor does not constitute a guarantee or warranty of the product by the U.S. Department of Agriculture and does not imply its approval to the exclusion of other products or vendors that also may be suitable.

References

  • 1.Aharoni, A.,Giri, A. P.,Verstappen, F. W.,Bertea, C. M.,Sevenier, R.,Sun, Z., …Bouwmeester, H. J.(2004). Gain and loss of fruit flavor compounds produced by wild and cultivated strawberry species. The Plant Cell, 16, 31103131. [PubMed][Google Scholar]
  • 2.Barakat, A.,DiLoreto, D. S.,Zhang, Y.,Smith, C.,Baier, K.,Powell, W. A.,Wheeler, N.,Sederoff, R., &Carlson, J. E.(2009). Comparison of the transcriptomes of American chestnut (Castanea dentata) and Chinese chestnut (C. mollissima) in response to the chestnut blight infection. BMC Plant Biology, 9, 51[PubMed][Google Scholar]
  • 3.Barakat, A.,Staton, M.,Cheng, C.‐H., et al. (2012). Chestnut resistance to the blight disease: insights from transcriptome analaysis. BMC Plant Biology, 12, 38[PubMed][Google Scholar]
  • 4.Baskin, C. C., &Baskin, J. M.(1998). Seeds: Ecology, biogeography, and evolution of dormancy and germination (p. 666). New York City, NY: Elsevier.
  • 5.Baumberger, N.,Doesseger, B.,Guyot, R.,Diet, A.,Parsons, R. L.,Clark, M. A., …Keller, B.(2003). Whole‐genome comparison of leucine‐rich repeat extensions in Arabidopsis and rice. A conserved family of cell wall proteins form a vegetative and reproductive clade. Plant Physiology, 131(3), 13131326. [PubMed][Google Scholar]
  • 6.Beeckman, T.,Przemeck, G. K.,Stamatiou, G.,Lau, R.,Terryn, N.,De Rycke, R., …Berleth, T.(2002). Genetic complexity of cellulose synthase a gene function in Arabidopsis embryogenesis. Plant Physiology, 130(4), 18831893. [PubMed][Google Scholar]
  • 7.Blythe, R. M.,Lichti, N. I.,Smyser, T. J., &Swihart, R. K.(2015). Selection, caching and consumption of hardwood seeds by forest rodents: Implications for restoration of American chestnut. Restoration Ecology, 23(4), 473481. [PubMed][Google Scholar]
  • 8.Brodin, A.(2010). The history of scatter hoarding studies. Philosophical Transactions of the Royal Society B: Biological Sciences, 365, 869881. [PubMed][Google Scholar]
  • 9.Buchfink, B.,Xie, C., &Huson, D.(2015). Fast and sensitive protein alignment using DIAMOND. Nature Methods, 12, 5960. [PubMed][Google Scholar]
  • 10.Buschhaus, C., &Jetter, R.(2012). Composition and physiological function of the wax layers coating Arabidopsis leaves: B‐Amyrin negatively affects the intracuticular water barrier. Plant Physiology, 160(2), 11201129. [PubMed][Google Scholar]
  • 11.Calhane, V. H.(1942). Caching and recovery of food by the western fox squirrel. The Journal of Wildlife Management, 6(4), 338352. [PubMed][Google Scholar]
  • 12.Carlson, J. E.,Cannon, N.,Staton, M. E.,Henry, N.,Islam‐Faridi, N.,Addo‐Quaye, C., …Hebard, F. V.(2014). The Chinese chestnut genome and QTL sequences, V1.0, January 2014. Retrieved from http://www.hardwoodgenomics.org/chinese-chestnut-genome
  • 13.Chan, W.‐K.,Tan, L. T.‐H.,Chan, K.‐G.,Lee, L.‐H., &Goh, B.‐H.(2016). Nerolidol: A sesquiterpene alcohol with multi‐faceted pharmacological and biological activities. Molecules, 21(5), 529[PubMed][Google Scholar]
  • 14.Chiang, G. C. K.,Barua, D.,Kramer, E. M.,Amasino, R. M., &Donohue, K.(2009). Major flowering time gene, FLOWERING LOCUS C, regulates seed germination in Arabidopsis thaliana. Proceedings of the National Academy of Sciences USA, 106(28), 1166111666. [PubMed][Google Scholar]
  • 15.Connor, K.,Donahoo, J., &Schafer, G.(2006).How does prolonged exposure to natural conditions affect acorn moisture and viability?In ConnorK. F. (Ed.), 2006. Proceedings of the 13th biennial southern silvicultural research conference. Gen. Tech. Rep. SRS–92. Asheville, NC: U.S. Department of Agriculture, Forest Service, Southern Research Station640 p.
  • 16.Deeken, R.,Saupe, S.,Klinkenberg, J.,Riedel, M.,Leide, J.,Hedrich, R., &Mueller, T. D.(2016). The nonspecific lipid transfer protein AtLtpl‐4 is involved in suberin formation of Arabidopsis thaliana crown galls. Plant Physiology, 172(3), 19111927. [PubMed][Google Scholar]
  • 17.DePristo, M.,Banks, E.,Garimella, K., et al. (2011). A framework for variation discovery and genotyping using next‐generation DNA sequencing data. Nature Genetics, 43, 491498. [PubMed][Google Scholar]
  • 18.Doyle, J. J., &Doyle, J. L.(1987). A rapid DNA isolation procedure for small quantities of fresh leaf tissue. Phytochemistry Bulletin, 19, 1115.[Google Scholar]
  • 19.Draeger, C.,Ndinyanka Fabrice, T.,Gineau, E.,Mouille, G.,Kuhn, B. M.,Moller, I., …Ringli, C.(2015). Arabidopsis leucine‐rich repeat extension (LRX) proteins modify cell wall composition and influence plant growth. BMC Plant Biology, 15, 155[PubMed][Google Scholar]
  • 20.Ertan, E.,Erdal, E.,Gulsum, A., &Algul, B. E.(2015). Effects of different postharvest storage methods on the quality parameters of chestnuts (Castanea sativa Mill.). HortScience, 50(4), 577581.[Google Scholar]
  • 21.Fox, J. F.(1982). Adaptation of gray squirrel behavior to autumn germination by white oak acorns. Evolution, 36(4), 800809. [PubMed][Google Scholar]
  • 22.Friis, E. M.,Crane, P. R., &Pederson, K. R.(2011). Early flowers and angiosperm evolution. Cambridge, UK: Cambridge University Press[PubMed]
  • 23.Gnan, S.,Priest, A., &Kover, P. X.(2014). The genetic basis of natural variation in seed size and seed number and their trade‐off using Arabidopsis thaliana MAGIC lines. Genetics, 198(4), 17511758. [PubMed][Google Scholar]
  • 24.Greb, T.,Mylne, J. S.,Crevillen, P.,Geraldo, N.,An, H.,Gendall, A. R., &Dean, C.(2007). The PHD finger protein VRN5 functions in the epigenetic silencing of Arabidopsis FLC. Current Biology, 17, 7378. [PubMed][Google Scholar]
  • 25.Hirsch, B. T.,Kays, R., &Jansen, P. A.(2012). A telemetric thread tag for tracking seed dispersal by scatter‐hoarding rodents. Plant Ecology, 213, 933943. [PubMed][Google Scholar]
  • 26.Howe, H. F., &Smallwood, J.(1982). Ecology of seed dispersal. Annual Review of Ecology and Systematics, 13, 201228. [PubMed][Google Scholar]
  • 27.Jacobs, D. F.(2007). Toward development of silvical strategies for forest restoration of American chestnut (Castanea dentata) using blight‐resistant hybrids. Biological Conservation, 137(4), 497506. [PubMed][Google Scholar]
  • 28.Jacobs, D. F.,Dalgleish, H. J., &Nelson, C. D.(2012). A conceptual framework for restoration of threatened plants: The effective model of American chestnut (Castanea dentata) reintroduction. New Phytologist, 197, 378393.[PubMed][Google Scholar]
  • 29.Jansen, P. A.,Bartholomeus, M.,Bongers, F.,Elzinga, J. A.,den Ouden, J., &Van Wieren, S. E.(2002). Chapter 14: The role of seed size in dispersal by a scatter‐hoarding rodent. Seed Dispersal and Frugivory: Ecology, Evolution, and Conservation, p. 209226.[Google Scholar]
  • 30.Janzen, D. H.(1971). Seed predation by animals. Annual Review of Ecology and Systematics, 2, 465492. [PubMed][Google Scholar]
  • 31.Jaynes, R. A.(1963). Biparental determination of nut characters in Castanea. Journal of Heredity, 54(2), 8488. [PubMed][Google Scholar]
  • 32.Johnson, W. C., &Webb, T.III(1989). The role of blue jays (Cyanocitta cristata L.) in the postglacial dispersal of fagaceous trees in eastern North America. Journal of Biogeography, 16(6), 561571. [PubMed][Google Scholar]
  • 33.Kim, J. H., &Kende, H.(2004). A transcriptional coactivator, AtGIF1, is involved in regulating leaf growth and morphology in Arabidopsis. Proceedings of the National Academy of Sciences of the United States of America, 101, 1337413379. [PubMed][Google Scholar]
  • 34.Kuusk, S.,Sohlberg, J. J.,Magnus, E. D., &Sundberg, E.(2006). Functionally redundant SHI family genes regulate Arabidopsis gynoecium development in a dose‐dependent manner. The Plant Journal, 47, 99111. [PubMed][Google Scholar]
  • 35.Larson‐Johnson, K.(2015). Phylogenetic investigation of the complex evolutionary history of dispersal mode and diversification rates across living and fossil Fagales. New Phytologist, 209(1), 418435.[PubMed][Google Scholar]
  • 36.Le Bouquin, R.,Pinot, F.,Benveniste, I.,Salaun, J. P., &Durst, F.(1999). Cloning and functional characterization of CYP94A2, a medium chain fatty acid hydroxylase from Vicia sativa. Biochemical and Biophysical Research Communications, 261(1), 156162. [PubMed][Google Scholar]
  • 37.Leopold, D. J.,McComb, W. C., &Muller, R. N.(1998). Trees of the Central Hardwood Forests of North America: An identification and cultivation guide (p. 509). Portland, OR: Timber Press p. 148149.
  • 38.Leprince, O.,Buitnik, J., &Hoekstra, F. A.(1999). Axes and cotyledons of recalcitrant seeds of Castanea sativa Mill. exhibit contrasting responses of respiration to drying in relation to desiccation sensitivity. Journal of Experimental Botany, 50(338), 15151524. [PubMed][Google Scholar]
  • 39.Li, H., &Durbin, R.(2009). Fast and accurate short read alignment with Burrows‐Wheeler transform. Bioinformatics, 35, 17541760. [PubMed][Google Scholar]
  • 40.Li, M.,Xiong, G.,Li, R.,Cui, J.,Tang, D.,Zhang, B., …Zhou, Y.(2009). Rice cellulose synthase‐like D4 is essential for normal cell‐wall biosynthesis and plant growth. The Plant Journal, 60(6), 10551069. [PubMed][Google Scholar]
  • 41.Lichti, N. I.,Steele, M. A., &Swihart, R. K.(2017). Seed fate and decision‐making processes in scatter‐hoarding rodents. Biological Reviews, 92(1), 474504. [PubMed][Google Scholar]
  • 42.Lichti, N. I.,Steele, M. A.,Zhang, H., &Swihart, R. K.(2014). Mast species composition alters seed fate in North American rodent‐dispersed hardwoods. Ecology, 95(7), 17461758. [PubMed][Google Scholar]
  • 43.Louvet, R.,Cavel, E.,Gutierrez, L.,Guenin, S.,Roger, D.,Gillet, F., …Pelloux, J.(2006). Comprehensive expression profiling of the pectin methylesterase gene family during silique development in Arabidopsis thaliana. Planta, 224, 782791. [PubMed][Google Scholar]
  • 44.Martinez‐Garcia, P. J.,Crepeau, M. W.,Puiu, D.,Gonzalez‐Ibeas, D.,Whalen, J.,Stevens, K. A., …Neale, D. B.(2016). The walnut (Juglans regia) genome sequence reveals diversity in genes coding for the biosynthesis of non‐structural polyphenols. The Plant Journal, 87(5), 507532. [PubMed][Google Scholar]
  • 45.Mellerowicz, E. J.,Baucher, M.,Sundberg, B., &Boerjan, W.(2001). Unravelling cell wall formation in the woody dicot stem. Plant Molecular Biology, 47, 239274. [PubMed][Google Scholar]
  • 46.Metaxas, A.(2013). Chestnut (Castanea spp.) cultivar evaluation for commercial chestnut production in Hamilton County, Tennessee. M.S.E.S Thesis, University of Tennessee at Chattanooga, 135 pp.
  • 47.Michaels, S. D.,Bezerra, I. C., &Amasino, R. M.(2004). FRIGIDA‐related genes are required for the winter‐annual habit in Arabidopsis. Proceedings of the National Academy of Sciences of the United States of America, 101, 32813285. [PubMed][Google Scholar]
  • 48.Moore, J. E.,McEuen, A. B.,Swihart, R. K.,Contreras, T. A., &Steele, M. A.(2007). Determinants of seed removal distance by scatter‐hoarding rodents in deciduous forests. Ecology, 88(10), 25292540. [PubMed][Google Scholar]
  • 49.Mu, H.‐Z.,Liu, Z.‐J.,Lin, L.,Li, H.‐Y.,Jiang, J., &Liu, G.‐F.(2012). Transcriptomic analysis of phenotypic changes in birch (Betula platyphylla) autotetraploids. International Journal of Molecular Science, 13, 1301213029. [PubMed][Google Scholar]
  • 50.Müller, K.,Levesque‐Tremblay, G.,Bartels, S.,Weitbrecht, K.,Wormit, A.,Usadel, B., …Kermode, A. R.(2005). Demethylesterification of cell wall pectins in Arabidopsis plays a role in seed germination. Plant Physiology, 161, 305316.[Google Scholar]
  • 51.Paulsen, T. R.,Hogstedt, G.,Thompson, K.,Vandvik, V.,Eliassen, S., &Leishman, M.(2014). Conditions favoring hard seededness as a dispersal and predator escape strategy. Journal of Ecology, 102(6), 14751484. [PubMed][Google Scholar]
  • 52.Pereira‐Lorenzo, S.,Costa, R.,Anagnostakis, S.,Serdar, U.,Yamamoto, T.,Saito, T., …Carlson, J. E.(2016).Chapter 15‐ Interspecific hybridization of chestnutIn MasonAnnaliese S. (Ed.), Polyploidy and hybridisation for crop improvement (pp. 377407). Boca Raton, FL: CRC Press.
  • 53.Peterson, J. K.(1983). Mechanisms involved in delayed germination of Quercus nigra L. seeds. Annals of Botany, 52(1), 8192. [PubMed][Google Scholar]
  • 54.Pollard, M.,Beisson, F.,Li, Y., &Ohlrogge, JB.(2008). Building lipid barriers: Biosynthesis of cutin and suberin. Trends in Plant Science, 13(5), 236246. [PubMed][Google Scholar]
  • 55.Pyee, J.,Yu, H. S., &Kolattukudy, P. E.(1994). Identification of a lipid transfer protein as the major protein in the surface wax of broccoli (Brassica oleracea) leaves. Archives of Biochemistry and Biophysics, 311(2), 460468. [PubMed][Google Scholar]
  • 56. R Core Team(2015). R: A language and environment for statistical computing. Vienna, Austria: R Foundation for Statistical Computing Retrieved from https://www.R-project.org/
  • 57.Reimand, J.,Arak, T.,Adler, P.,Kolberg, L.,Reisberg, S.,Peterson, H., &Vilo, J.(2016). g:Profiler‐ a web server for functional interpretation of gene lists (2016 update). Nucleic Acids Research, 44(W1), W83W89. [PubMed][Google Scholar]
  • 58.Ren, C., &Kermode, A. R.(2000). An increase in pectin methyl esterase activity accompanies dormancy breakage and germination of yellow cedar seeds. Plant Physiology, 124(1), 231242. [PubMed][Google Scholar]
  • 59.Richardson, K. B.,Licthi, N. I., &Swihart, R. K.(2013). Acorn‐foraging preferences of four species of free‐ranging avian seed predators in eastern deciduous forests. The Condor, 115(4), 863873. [PubMed][Google Scholar]
  • 60.Roach, T.,Beckett, R. P.,Minibayeva, F. V.,Colville, L.,Whitaker, C.,Chen, H., …Kranner, I.(2009). Extracellular superoxide production, viability, and redox poise in response to desiccation in recalcitrant Castanea sativa seeds. Plant, Cell & Environment, 33, 5975.[Google Scholar]
  • 61.Saielli, T. M.,Schaberg, P. G.,Hawley, G. J.,Halman, J. M., &Gurney, K. M.(2012). Nut cold hardiness as a factor influencing the restoration of American chestnut in northern latitudes and high elevations. Canadian Journal of Forest Research, 42(5), 859857.[Google Scholar]
  • 62.Schlötterer, C.,Tobler, R.,Kofler, R., &Nolte, V.(2014). Sequencing pools of individuals‐ mining genome‐wide polymorphism data without big funding. Nature Reviews Genetics, 15(11), 749763. [PubMed][Google Scholar]
  • 63.Schupp, E. W.,Jordano, P., &Gomez, J. M.(2010). Seed dispersal effectiveness revisited: A conceptual review. New Phytologist, 188, 333353. [PubMed][Google Scholar]
  • 64.Serrazina, S. M. T.,Santos, C.,Machado, H.,Pesquita, C.,Vicentini, R.,Pais, M.,Sebastiana, M., &Costa, R. L.(2015). Castanea root transcriptome in response to Phytophthora cinnamomi challenge. Tree Genetics and Genomes, 11(1) . [PubMed][Google Scholar]
  • 65.Smallwood, P. D.,Steele, M. A., &Faeth, S. H.(2001). The ultimate basis of the caching preferences of rodents, and the oak‐dispersal syndrome: Tannins, insects, and seed germination. American Zoologist, 41, 840851.[Google Scholar]
  • 66.Smith, C. C., &Stapanian, M. A.(2002).Squirrels and oaksIn McSheaW. J., & HealyW. M. (Eds.), The Ecology and Management of Oaks for Wildlife (pp. 256266). Baltimore, MD: Johns Hopkins University Press.
  • 67.Stapanian, M. A., &Smith, C. C.(1978). A model for seed scatterhoarding: Coevolution of fox squirrels and black walnuts. Ecology, 59, 884896. [PubMed][Google Scholar]
  • 68.Stanke, M.,Schoeffmann, O.,Morgenstern, B., &Waack, S.(2006). Gene prediction in eukaryotes with a generalized hidden Markov model that uses hints from external sources. BMC Bioinformatics, 7, 62[PubMed][Google Scholar]
  • 69.Staton, M. E.,Addo‐Quaye, C.,Cannon, N.,Tomsho, L. P.,Drautz, D.,Wagner, T. K., …Carlson, J. E.(2014). The Chinese chestnut (Castanea mollissima) genome version 1.1. Retrieved from http://www.hardwoodgenomics.org/chinese-chestnut-genome
  • 70.Steele, M. A.,Smallwood, P. D.,Spunar, A., &Nelsen, E.(2001). The proximate basis of the oak dispersal syndrome: Detection of seed dormancy by rodents. American Zoologist, 41, 852864.[Google Scholar]
  • 71.Sundaram, M.(2016). The role of seed attributes in eastern gray squirrel foraging. Ph.D. Disseration, Purdue University, West Lafayette, IN.
  • 72.Sundaram, M.,Willoughby, J. R.,Licthi, N. I.,Steele, M. A., &Swihart, R. K.(2015). Segregating the effects of seed traits and common ancestry of hardwood trees on eastern gray squirrel foraging decisions. PLoS ONE, 10, e0130942[PubMed][Google Scholar]
  • 73.Tamura, N.,Hashimoto, Y., &Hayashi, F.(1999). Optimal distances for squirrels to transport and hoard walnuts. Animal Behaviour, 58, 635642. [PubMed][Google Scholar]
  • 74.Tamura, N., &Hayashi, F.(2008). Geographic variation in walnut seed size correlates with hoarding behavior of two rodent species. Ecological Research, 23, 607614. [PubMed][Google Scholar]
  • 75.Theimer, T. C.(2005).Chapter 17. Rodent scatterhoarders as conditional mutualistsIn ForgetP. M., LambertJ. E., HulmeP. E. & Vander WallS. B. (eds.), Seed fate (p. 283296). Wallingford, UK: CAB International.
  • 76.Thompson, D. C., &Thompson, P. S.(1980). Food habits and caching behavior of urban grey squirrels. Canadian Journal of Zoology, 58(5), 701710. [PubMed][Google Scholar]
  • 77.Thomson, F. J.,Moles, A. T.,Auld, T. D., &Kingsford, R. T.(2011). Seed dispersal distance is more strongly correlated with plant height than with seed mass. Journal of Ecology, 99(6), 12991307. [PubMed][Google Scholar]
  • 78.Trevino, M. B., &O'Connell, M. A.(1998). Three drought‐responsive members of the nonspecific lipid‐transfer protein gene family in Lycopersicon pennellii show different developmental patterns of expression. Plant Physiology, 116(4), 14611468. [PubMed][Google Scholar]
  • 79.Van der Auwera, G. A.,Carneiro, M.,Hartl, C., et al. (2013). From FastQ data to high‐confidence variant calls: the Genome Analysis Toolkit Best Practices Pipeline. Current Protocols in Bioinformatics, 43, 11.10.111.10.33.[PubMed][Google Scholar]
  • 80.Vander Wall, S. B.(2001). The evolutionary ecology of nut dispersal. Botanical Review, 67(1), 74117. [PubMed][Google Scholar]
  • 81.Vander Wall, S. B.,Kuhn, K. M., &Beck, M. J.(2005). Seed removal, seed predation, and secondary dispersal. Ecology, 86(3), 801806. [PubMed][Google Scholar]
  • 82.Woeste, K.,McGranahan, G., &Bernatzky, R.(1998). Low correlation between genomic and morphological introgression estimates in a walnut backcross. Journal of the American Society for Horticultural Science, 123, 258263.[Google Scholar]
  • 83.Worthen, L. M.,Woeste, K. E., &Michler, C. H.(2010). Breeding American chestnuts for blight resistance. Plant Breeding Reviews, 33, 305339.[Google Scholar]
  • 84.Xenarios, I.(2016). UniProtKB/Swiss‐Prot, the manually annotated section of the UniProt KnowledgeBase: How to use the entry view. Methods in Molecular Biology, 1374, 2354.[PubMed][Google Scholar]
  • 85.Xiao, Z.,Gao, X.,Jiang, M., &Zhang, Z.(2009). Behavioral adaptation of Pallas's squirrels to germination schedule and tannins in acorns. Behavioral Ecology, 20, 10501055. [PubMed][Google Scholar]
  • 86.Xiao, Z.,Gao, X.,Steele, M. A., &Zhang, Z.(2009). Frequency‐dependent selection by tree squirrels: Adaptive escape of nondormant white oaks. Behavioral Ecology, 21, 169175. [PubMed][Google Scholar]
  • 87.Xiao, Z.,Gao, X., &Zhang, S.(2013). Sensitivity to seed germination schedule by scatter‐hoarding Pere David's rock squirrels during mast and non‐mast years. Ethology, 119(6), 472479. [PubMed][Google Scholar]
  • 88.Xiao, Z.,Jansen, P. A., &Zhang, Z.(2006). Using seed‐tagging methods for assessing post‐dispersal seed fate in rodent‐dispersed trees. Forest Ecology and Management, 223, 1823. [PubMed][Google Scholar]
  • 89.Xiao, S.,Zhang, Z., &Wang, Y.(2005). Effects of seeds size on dispersal distance in five rodent‐dispersed fagaceous species. Acta Oecologica, 28, 221229. [PubMed][Google Scholar]
  • 90.Yoshida, N.,Yanai, Y.,Chen, L.,Kato, Y.,Hiratsuka, J.,Miwa, T., …Takahashi, S.(2001). EMBRYONIC FLOWER2, a novel polycomb group protein homolog, mediates shoot development and flowering in Arabidopsis. The Plant Cell, 13, 24712481. [PubMed][Google Scholar]
  • 91.Zhang, G.,Cao, Q.,Liu, J.,Liu, B.,Li, J., &Li, C.(2015). Refactoring beta‐amyrin synthesis in Saccharomyces cerevisiae. AIChE Journal, 61(10), 31723179. [PubMed][Google Scholar]
  • 92.Zwolak, R., &Crone, E. E.(2012). Quantifying the outcome of plant‐granivore interactions. Oikos, 121, 2027. [PubMed][Google Scholar]
Collaboration tool especially designed for Life Science professionals.Drag-and-drop any entity to your messages.