#PAGE_PARAMS# #ADS_HEAD_SCRIPTS# #MICRODATA#

Analysis of the Rdr1 gene family in different Rosaceae genomes reveals an origin of an R-gene cluster after the split of Rubeae within the Rosoideae subfamily


Authors: Ina Menz aff001;  Deepika Lakhwani aff002;  Jérémy Clotault aff002;  Marcus Linde aff001;  Fabrice Foucher aff002;  Thomas Debener aff001
Authors place of work: Institute for Plant Genetics, Leibniz Universität Hannover, Hannover, Germany aff001;  IRHS, Agrocampus-Ouest, INRA, Université d’Angers, Beaucouzé, France aff002
Published in the journal: PLoS ONE 15(1)
Category: Research Article
doi: https://doi.org/10.1371/journal.pone.0227428

Summary

The Rdr1 gene confers resistance to black spot in roses and belongs to a large TNL gene family, which is organized in two major clusters at the distal end of chromosome 1. We used the recently available chromosome scale assemblies for the R. chinensis ‘Old Blush’ genome, re-sequencing data for nine rose species and genome data for Fragaria, Rubus, Malus and Prunus to identify Rdr1 homologs from different taxa within Rosaceae. Members of the Rdr1 gene family are organized into two major clusters in R. chinensis and at a syntenic location in the Fragaria genome. Phylogenetic analysis indicates that the two clusters existed prior to the split of Rosa and Fragaria and that one cluster has a more recent origin than the other. Genes belonging to cluster 2, such as the functional Rdr1 gene muRdr1A, were subject to a faster evolution than genes from cluster 1. As no Rdr1 homologs were found in syntenic positions for Prunus persica, Malus x domestica and Rubus occidentalis, a translocation of the Rdr1 clusters to the current positions probably happened after the Rubeae split from other groups within the Rosoideae approximately 70–80 million years ago during the Cretaceous period.

Keywords:

Plant genomics – Genome analysis – Sequence alignment – Phylogenetic analysis – genomic medicine – Amino acid sequence analysis – Homologous chromosomes – Roses

Introduction

Roses, together with species from the genera Fragaria, Rubus, Potentilla, and Malus, belong to the family Rosaceae and are therefore related to economically important fruit crops such as apple and peach [1, 2]. The genus Rosa, which includes approximately 200 species, shows a complex evolutionary history due to frequent hybridizations, multiple polyploidizations and recent radiation. The genus is subdivided into four subgenera (Hulthemia, Plathyrhodon, Hesperhodos and Rosa) by some authors, whereas others question the subgeneric status of Hulthemia and Plathyrhodon and propose to include them with the subgenus Rosa. The subgenus Rosa itself includes up to 10 sections and approximately 95% of the species [15].

Until recently, only fragmented rose genomes were available [6,7]. Recently, two chromosome scale reference sequences for the diploid Rosa chinensis cultivar ‘Old Blush’ have been published [8,9]. The two sequences were obtained independently from cell cultures regenerated from microspores and therefore represent haploid segregants of the diploid cultivar ‘Old Blush’. The rose genome displays extensive synteny with the Fragaria vesca genome with only two major rearrangements [9]. Synteny between Fragaria and Rosa genes has been observed for TNL genes (TIR (Drosophila Toll and mammalian interleukin (IL)-1 receptors), NBS (nucleotide-binding site) and LRR (leucine- rich repeat)) [10].

NBS-LRR genes, which include TNLs and CNLs (CC (coiled-coil)-NBS-LRR), are the largest classes of R-genes in plants. They are characterized by three domains with different functions: the N-termini are thought to be involved in protein-protein interactions, the NBS domain is required for ATP (adenosine triphosphate) binding and hydrolysis, and the LRR-domain is involved in protein-protein interactions and ligand binding [1114]. NBS-LRR genes have been detected in organisms from green algae to flowering plants and often occur in clusters of related paralogues or as single loci. The number of NBS coding genes in the genome varies widely among different species within the dicots and monocots. Whereas CNLs are found in both monocots and dicots, TNLs occur only in dicots [15]. Among the dicots, the Caricaceae (Carica papaya: 54) and Cucurbitaceae (Cucumis sativus: 59–71, C. melo: 80, C. lanatus: 45) families have very low numbers of NBS-encoding genes, whereas the number seems to be greater for some members of Rosaceae (198 NBS genes in F. vesca [16] and up to 1303 NBS genes in Malus x domestica [17,18]). The number of NBS-encoding genes also varies considerably within species, as shown for Oryza sativa lines (328–1120 NBS genes) or Gossypium herbaceum (268–1465 NBS genes) [19]. Different evolutionary dynamics have been postulated, with some clusters comprising fast-evolving genes and others comprising slow-evolving genes [20,14].

In grapevine and poplar, the number of NBS-LRR genes in multi-gene clusters varies between 2 and 10 (mean 4.43) and between 2 and 23 (mean 5.33), respectively [21]. In Medicago truncatula approximately 50% of NBS domains occur in clusters of at least five genes; the largest cluster (14 genes) occurs on chromosome 6, with a sliding window size of 100 kb. The phylogenetic tree for the 333 non-redundant NBS-LRRs of M. truncatula showed that most groups were dominated by sequences from one chromosome and usually from one or a small number of genomic clusters [22]. Molecular characterization of the soybean Rsv3 resistance locus against multiple soybean mosaic virus strains revealed a cluster of seven highly homologous CNL genes intermixed with 16 other genes in the genotype Williams 82. All seven were also identified in the same order in the genotype Zaoshu 18. The five most likely resistance gene candidates (NBS_A-E) were also sequenced in ten additional soybean cultivars and showed very high sequence similarities [23].

In a R. multiflora hybrid (88/124-46), the single dominant TNL gene Rdr1 (muRdr1A), a member of a multigene family of at least nine highly similar clustered TNLs (muRdr1A-muRdr1I), confers broad-spectrum resistance against black spot [24,25]. The sizes of all TNLs for the Rdr1 locus, except muRdr1D (interrupted by 6957-bp transposable-element insertion within intron), range from 4085 to 5920 bp with sequence similarities between 78.0% and 99.5%. The domain structure of typical TNL proteins is reflected by the following intron-exon structure: the first exon contains the TIR domain, the second exon contains the NBS domain, and the fourth (or in case of TNL–muRdr1I, the third exon) contains an LRR domain [25].

A region from R. rugosa (subsection Cinnamomeae), homologous to the Rdr1 locus in R. multiflora (subsection Synstylae), was identified with a high degree of synteny that included some flanking non-TNL genes coding for a yellow stripe-like protein, ubiquitin and a TOPLESS-RELATED protein [10]. An analysis of 20 TIR-NBS-LRR (TNL) genes obtained from R. rugosa and R. multiflora revealed illegitimate recombination, gene conversion, unequal crossing over, indels, point mutations and transposable elements as mechanisms of diversification. Additionally, an orthologous locus in F. vesca (strawberry) was identified that contains a homologous TNL gene family and the flanking genes. In contrast, in Prunus persica (peach) and Malus x domestica (apple), only the flanking genes can be detected in syntenic positions, and the genes homologous to the Rdr1 family are distributed on two different chromosomes. Phylogenetic analysis of TNL genes from five Rosaceae species showed that most of the genes occur in single species clades, indicating that recent TNL gene diversification began prior to the split of the Rosoideae (Rosa, Fragaria) from the Spiraeoideae (Malus, Prunus) [26,18] however, not considering genomic positions of individual genes in much detail.

With the availability of chromosome scale assemblies of the R. chinensis ‘Old Blush’ genome, we were interested in analysing the full complexity of the Rdr1 gene family at the genomic level including the number of paralogues and their position in the genome. Furthermore, we tried to elucidate the advent of this TNL family in the Rosaceae in respect of its emergence in the taxonomic lineage leading to the genus Rosa and its evolutionary dynamics after taxonomic separation of Fragaria and Rosa as well as within the genus Rosa. For this we used data from different taxonomic levels, including re-sequencing data for nine rose species recently published along the with the ‘Old Blush’ genome sequences.

Results

Rdr1 homologs in ‘Old Blush’ and F. vesca

The screening of the haploid genomes derived from ‘Old Blush’ for TNLs homologous to Rdr1 from R. multiflora resulted in seven complete TNLs for HapOB1 (OB1-A-G) and 21 for HapOB2 (OB2-A-U). A comparative analysis of TNLs from HapOB1 and HapOB2 showed that the following are identical: OB1-B and OB2-D, OB1-C and OB2-I and OB1-D and OB2-G. The sequences of all Rdr1 homologs are listed in S1 Dataset.

Phylogenetic analysis of the TNLs from R. multiflora, HapOB1 and HapOB2 using the maximum likelihood method resulted in the tree shown in S1 Fig. The phylogram shows that a group of three TNLs (OB1-G, OB2-T, OB2-U) are clearly separated from all other TNLs. OB1-G is located on chromosome 5, and OB2-T and OB2-U are located on chromosome 2. All other TNLs from HapOB1 and HapOB2 are located on chromosome 1 and are clustered in two distinct groups (1 and 2) that are highly supported by a bootstrap value of 100%. TNLs from the R. multiflora Rdr1 cluster are clustered in group 2. The genomic organization of HapOB1- and HapOB2-TNLs on chromosome 1 is shown in Fig 1. For HapOB2, all but three (OB2-A, -B, -I) of the 16 complete TNLs are organized in two clusters at the distal end of the chromosome. Cluster 1 (with a size of 76 kb) contains 37 protein-coding genes, of which 28 displayed significant similarities to entries in the GenBank database, including six complete TNL genes (OB1-C to OB1-H) and some truncated TNL genes (three TIR-domains, one LRR-domain and two NBS-LRR genes). Cluster 2 (with a size of 163 kb) contains 28 protein coding genes, of which 23 displayed significant similarities to entries in the GenBank database, including ten TNL genes (OB1-J to OB1-S) and one additional LRR-domain. TNLs from HapOB1 are also organized in two clusters at the distal end of chromosome 1. Gene prediction identified three TNLs (OB1-B through OB1-D) for cluster 1 and two TNLs (OB1-E and OB1-F) for cluster 2. Additionally, two TIR-domains, two NBS-LRR genes and one LRR-domain could be found within the cluster.

Genomic organization for <i>Rdr1</i> homologs in HapOB1, HapOB2, <i>Fragaria</i> and <i>Rubus</i>.
Fig. 1. Genomic organization for Rdr1 homologs in HapOB1, HapOB2, Fragaria and Rubus.
Shown are: chromosome 1 of HapOB1 and HapOB2 with the upper cluster 1 (OB1-B through OB1-D; OB2-C through OB2-H) and the lower cluster 2 (OB1-E and OB1-F; OB2-J through OB2-S); chromosome 7 of F. vesca with cluster 1 (F.ve-1 through F.ve-8) and cluster 2 (F.ve-9 through F.ve-17); and chromosome 7 of Rubus occidentalis (no Rdr1 homologs found). Positions of three syntenic genes (glucan synthase-like 3, RING/U-box superfamily protein, protein kinase superfamily protein) and the Rdr1 flanking genes YSL (yellow-stripe-like protein), Ubiquitin and TOPLESS-RELATED protein are shown in grey.

To determine the reasons for the unusually small number of Rdr1-TNLs at the two cluster positions in the HapOB1 genome, we analysed the DNA from haploid tissue that had been used for sequencing of the HapOB2 genome as well as DNA from the original diploid OB cultivar with the Rd1LRR microsatellite marker from the LRR region of the gene family. Seven of the 19 genes of HapOB2 contained perfect primer binding sites and were detected on high resolution polyacrylamide gels, whereas 21 fragments were detected in DNA of the diploid OB (S2 Fig). The small number of Rdr1 genes in the HapOB1 genome are likely to be an artefact, possibly due to a problem with the assembly or resulted from recombination events prior to the isolation of the independent haploid callus line from microspores. Therefore, the HapOB1 sequence was not considered in further analyses.

The genomic organization of the TNLs on the chromosome in the two clusters corresponds to the two groups formed in the phylogenetic tree shown in S1 Fig. OB2-C through OB2-H are clustered in group 1, whereas OB2-J through OB2-S are clustered in group 2.

Analysis of the genes surrounding the clusters revealed a high level of synteny between HapOB1, HapOB2 and F. vesca (S1 Table).

The separation of the clusters in two different groups in the phylogenetic tree is further supported by a number of diagnostic sites in the derived amino acid sequences. At two positions (90 and 166), sequences of groups 1 and 2 display unique differences. At three additional positions (348, 688 and 868), one of the two groups displays unique amino acids that are replaced by two or more different sites in the other group.

In addition, the nucleotide diversity differs within each group. Though averages within the group total nucleotide differences are similar for both groups (327 for group 1 and 339 for group 2), the ratio of non-synonymous to synonymous sites is higher in group 2 (2.75) than in group 1 (1.81).

In addition to the TNLs from HapOB1 and HapOB2, the F. vesca genome was screened for Rdr1 homologs. A total of 19 Rdr1 homologs were found in F. vesca, of which 17 are located on chromosome 7 and two are located on chromosomes 1 and 2 (F.ve-19 and F.ve-18, respectively). The 17 TNLs on chromosome 7 are organized in clusters at the distal end of the chromosome: cluster 1 contains seven TNLs, and clusters 2 and 3 contain four TNLs (Fig 1). Phylogenetic analysis of the TNLs from HapOB1, HapOB2 and F. vesca shows that the rose genes for the two clusters from chromosome 1 are grouped with the Fragaria genes from the clusters on chromosome 7, whereas the genes located on other chromosomes are clearly separated from this group (Fig 2). Chromosome 1 of rose is syntenic with chromosome 7 of Fragaria [9]. Furthermore, the rose genes in cluster 2 form a group (group 2) with Fragaria genes in cluster 2 and 3, and each of them build a distinct single species clades within this group. In contrast, the genes from cluster 1 do not form strictly single species clades within group 1, but one clade with mixed species and two single species clades.

Phylogenetic analysis of the amino acid sequence of HapOB1-, HapOB2- and <i>F</i>. <i>vesca</i>-TNLs homologous to <i>Rdr1</i> in <i>R</i>. <i>multiflora</i>.
Fig. 2. Phylogenetic analysis of the amino acid sequence of HapOB1-, HapOB2- and F. vesca-TNLs homologous to Rdr1 in R. multiflora.
The Maximum Likelihood method based on the JTT matrix-based model was used to calculate the phylogenetic tree. Test of phylogeny was performed using the bootstrap method with 500 replicates. Branches reproduced in less than 75% of bootstrap replicates are collapsed. Bootstrap values are indicated as triangles, whereas the smallest value represents 82% and the largest 100%.

TNL structure in other Rosaceae

In a previous study [10], no Rdr1 homologs could be observed in P. persica and M. domestica genomes at syntenic positions. Updated genome assemblies have been released since then, and these might have been corrected for assembly errors around repeat regions. We therefore analysed the genomes again for the presence of Rdr1 homologs at syntenic positions. Rose chromosome 1 (where Rdr1 is located) presents a good synteny with chromosome 2 in peach and chromosomes 1, 2 and 7 in apple [9]. Nevertheless, no homologous sequences for the Rdr1 gene were found at these positions, confirming the previous results.

In addition, we also analysed syntenic positions in Rubus occidentalis, a species from the Rosoideae sub-family, for which a chromosome scale assembly recently became available [27]. Synteny analysis of the genes surrounding the TNL clusters revealed no Rdr1 homologs in syntenic positions for P. persica, M. x domestica and R. occidentalis (S1 Table). In Prunus and Malus, more distantly related Rdr1 homologs were only detected in non-syntenic positions (S3 Fig), whereas in a draft genome from Potentilla micrantha, another species from the Rosoideae, several contigs contained Rdr1 homologs. The genes P.mi-12 and -13 are located on contig 1260 together with genes coding for a yellow stripe-like protein, ubiquitin and a TOPLESS-RELATED protein flanking the Rdr1 locus in R. multiflora and R. rugosa, indicating that Rdr1 homologs are present at syntenic positions in P. micrantha. Analysis for Rdr1 homologs identified 19 for F. vesca, three for R. occidentalis, 10 for Malus x domestica, 17 for P. persica and 11 for P. micrantha (S2 Table).

Phylogenetic analysis showed that the non-syntenic Rdr1 homologs from P. persica, M. x domestica and R. occidentalis are clearly separated from Rdr1 homologs of OB and F. vesca, which are located on chromosome 1 (OB) and 7 (F. vesca) (Fig 3). Furthermore, some of the P. micrantha Rdr1 homologs are grouped together with the TNLs from OB and F. vesca, which are located on chromosome 1 (OB) and 7 (F. vesca) consistent with clusters of these genes in syntenic positions for the Rdr1 clusters.

Phylogenetic analysis of the amino acid sequence of TNLs from different Rosaceae family members homologous to Rdr1 of R. multiflora.
Fig. 3. Phylogenetic analysis of the amino acid sequence of TNLs from different Rosaceae family members homologous to Rdr1 of R. multiflora.
The Maximum Likelihood method based on the JTT matrix-based model was used to calculate the phylogenetic tree. Test of phylogeny was performed using the bootstrap method with 500 replicates. Branches reproduced in less than 60% of bootstrap replicates are collapsed. Bootstrap values are indicated as triangles, whereas the smallest value represents 70%, and the largest value represents 100%. For a better visualization, Rdr1 homologs for the different Rosaceae family members are coloured as follows: HapOB1/2 (OB1+2: dark green), M. domestica (M.do: red), F. vesca (F.ve: black), P. persica (P.pe: orange), P. micrantha (dark blue), R. occidentalis (purple). The protein alignments are shown in S2 Dataset.

Rdr1 homologs from other rose species

Analysis of seven additional recently available genome sequences [8] identified 15 Rdr1 homologs for R. damascena, three for R. persica, eight for R. moschata, 13 for R. xanthina spontanea, 13 for R. chinensis var. spontanea, nine for R. laevigata and 12 for R. minutifolia alba (Table 1). Until recently, only highly fragmented genomes have been available for these rose species, which makes a chromosomal classification for TNLs homologs in Rdr1 difficult.

Tab. 1. List of Rdr1 homologs found in different rose species.
List of <i>Rdr1</i> homologs found in different rose species.

Based on the observation that Fragaria Rdr1 homologs from syntenic clusters form phylogenetic groups with rose homologs for Rdr1, we computed a phylogenetic tree to identify homologs from other rose species (Fig 4). For R. multiflora and R. rugosa TNLs already obtained by [25] were used. The most conspicuous group (group 3), with high bootstrap support, contains single TNLs from HapOB1/2 and Fragaria located on different chromosomes outside the two syntenic clusters. They are grouped together with two R. chinensis, two R. minutifolia, two R. moschata genes and one R. xanthina gene, which also most likely represent genes from outside the syntenic clusters. All Rdr1 homologs of HapOB, R. multiflora [25], R. rugosa [10] and Fragaria, known to derive from cluster 2, fall into one highly supported large group (group 2) that also includes sequences from all other rose species.

Phylogenetic analysis of the amino acid sequence of <i>Rdr1</i> homologs from different rose species and <i>Fragaria</i>.
Fig. 4. Phylogenetic analysis of the amino acid sequence of Rdr1 homologs from different rose species and Fragaria.
The Maximum Likelihood method based on the JTT matrix-based model was used to calculate the phylogenetic tree. Test of phylogeny was performed using the bootstrap method with 500 replicates. Branches reproduced in less than 75% of bootstrap replicates are collapsed. Bootstrap values are indicated as triangles, whereas the smallest value represents 76%, and the largest value represents 100%. For better visualization, Rdr1 homologs for the different species are coloured as follows: HapOB1/2 (OB1/2: dark green), R. multiflora (R.mu: red), F. vesca (F.ve: black), R. rugosa (R.ru: orange), R. damascena (R.da: dark blue), R. persica (P.pe: grey), R. moschata (R.mo: pink), R. xanthina (R.xa: dark purple), R. chinensis (R.ch: neon green), R. laevigata (R.la: purple), R. minutifolia (R.mi: light blue). The protein alignments are shown in S3 Dataset.

Within group 2, Rdr1 homologs from Fragaria form a distinct sub-group, whereas most of the other rose sequences form mixed sub-groups with no clear single species clades. In contrast, sequences clustered in group 1 do not form genus-specific sub-groups, but Fragaria and rose sequences form mixed sub-groups.

Discussion

More than 50% of the NBS-encoding genes are organized in clusters in the genome for many species such as Arabidopsis (64–71%), rice (50–74%), potato (73%), Medicago (80%) and apple (80%) [17,28]. Furthermore, these clusters are not evenly distributed between chromosomal positions. In Medicago truncatula, chromosome 6 contains approximately 34% of all TNLs, and chromosome 3 harbours approximately 40% of all CNLs [22]. In apple, approximately 56% of all identified RGAs are located on six of the 17 chromosomes, with 25% on chromosome 2 alone; whereas in grapevine, 80% of TNLs were located on chromosomes 5, 12 and 18 [28,21]. In tomato, the majority of NBS-LRRs are located close to the telomeres, where recombination occurs frequently, while few were detected in regions called “cold spots” for recombination [29]. An accumulation of RGAs in sub-telomeric regions was also described for apple [28].

Previously, we characterized members of the Rdr1 gene family, among which the Rdr1 gene confers resistance to black spot [24,25] and forms a cluster of closely related genes. As no complete genome was available at that time, our analyses were constricted to the region captured by BAC contigs and previous versions of the Fragaria genome (and others). This research used the high-quality chromosome-scale assembly of the OB genome to analyse the structure of this gene family in more detail. Recently, two high-quality sequences at the chromosome scale from two independent haploids from the same cultivar ‘Old Blush’ were obtained [8,9]. However, even a high-quality assembly might contain assembly errors around regions of highly similar paralogues for large gene families. Evidence for this is provided by our analysis of the HapOB1 genome [8], which only predicts seven Rdr1 paralogues at the chromosome 1 positions in contrast to the situation in the HapOB2 genome [9], where 21 TNLs were annotated. Our access to source DNA was restricted to the original ‘Old Blush’ diploid genotype and the haploid material used to generate the HapOB2 genome; therefore, we can only state that the total number of amplified copies of the Rdr1 paralogues from the original diploid is twice as high as that from the HapOB2 genome (S2 Fig). Thus, the fact that the HapOB1 genome contains only seven paralogues might either be due to assembly errors or recombination events prior to the isolation of the independent haploid callus line from microspores in which meiosis had already occurred. Both processes could result in the elimination of some copies of the Rdr1 family members in the genome sequence. However, this remains unclear because only a fraction of the Rdr1 paralogues can be amplified with our primer combination and we do not have access to the HapOB1 DNA to analyse this DNA directly.

Our analysis shows that two major clusters and two single genes are located on chromosome 1 and further relatives are located on chromosome 2 of OB. Phylogenetic analysis shows that the two major clusters form different groups, which indicates an independent development of the two clusters. Related sequences found on chromosome 2 are clearly distinct from those on chromosome 1 and are therefore not treated as members of the same family.

Re-analysis of the Fragaria genome reveals a similar structure with TNL clusters at syntenic positions. A phylogram of complete Rdr1 sequences for the Fragaria and OB genomes show that Fragaria group 2 and rose group 2 are closer to each other than to Fragaria group 1 and rose group 1. Furthermore, genes from group 1 only form mixed groups with single species clades, whereas the genes from group 2 form single species clades. As both clusters were present before the taxa emerged, the likely cause is a faster evolution within group 2. This could be due to the known processes by which R-genes evolve (including higher rates of recombination, gene conversion and birth and death processes), which led to a concerted evolution of genes in group 1. A similar observation has been made for inbred lines of maize, in which some paralogues are organized in genotype-specific subgroups [30]. A possible reason for the larger dynamics of group 2 in both Rosa and Fragaria might be the telomeric position in relation to group 1 a factor known to promote higher rates of recombination.

A re-analysis of the latest versions of the apple and peach genomes confirmed earlier results [10] that there are no Rdr1-like TNL clusters at syntenic positions in these genomes. The former conclusion remains that the emergence of the Rdr1 clusters must have formed after the Amygdaloideae split from the Rosoideae. A high-quality genome of R. occidentalis recently became available; therefore, we also checked for the presence of our cluster in Rubus, which was not present at a syntenic position.

Genome information for P. micrantha, identifies a larger number of fragments, which shows that there are 5 contigs with Rdr1 homologs.

One of these contigs (contig no. 1260) contains two Rdr1 homologs and conserved genes flanking group 1 in roses [10]. This indicates that Rdr1 homologs in Potentilla are in a putative syntenic position to the group 1 cluster in roses.

The other genes fall into groups of OB sequences that are in both clusters as well as on chromosome 2 in roses. This agrees with the Rosaceae phylogeny which places Potentilla and Fragaria into sister groups of the Potentilleae within the Rosoideae. The timeline for the evolution of the Rosaceae [26] led us to conclude that the Rdr1 cluster was translocated to its current position after the Rubeae split from other groups within the Rosoideae approximately 70–80 million years ago during the Cretaceous period.

A larger phylogram, including 137 sequences from ten species of Rosa, shows that all rose sequences form mixed clusters with few exceptions. Therefore, single species clades for the rose genes within group 1 have not been developed yet. Not all rose species can be easily differentiated taxonomically, and most are highly interfertile; this underlines a close relationship between these taxa and may be one reason for the lack of differentiation of group 1 genes.

This study is a first step in the analysis of the evolution of genes from the Rdr1 family in roses. However, we must keep in mind that assembly processing for clustered duplicated genes can lead to assembly errors. We can then hypothesize that some genes which were studied could represent consensus sequences for several real existing genes. As shown with HapOB2, an assembly obtained from long reads should result in a high-quality chromosome scale assembly for these regions. However, the lower than expected number of Rdr1 homologs in the HapOB1 assembly, developed from PacBio reads, shows that this might only be a general principle.

Material and methods

Origin of sequences

For R. multiflora (HQ455834.1) and R. rugosa (JQ791545), previously published contigs spanning the Rdr1 locus were used [10,25]. The genomes of ‘Old Blush’, HapOB1 [8] and R. damascena Mill. were downloaded from NCBI (https://www.ncbi.nlm.nih.gov/), whereas the haploid genome of ‘Old Blush’, HapOB2 [9] was downloaded from a genome browser (https://iris.angers.inra.fr/obh/). The whole genomes of F. vesca, Malus x domestica, P. persica, R. occidentalis and P. micrantha were downloaded from the Genome Database for Rosaceae (https://www.rosaceae.org/).

Additionally, sequences of the rose species R. persica, R. moschata, R. xanthina spontanea, R. chinensis var. spontanea, R. laevigata and R. minutifolia alba were used ([9], assemblies unpublished). The origins of all used sequences are listed in Table 2.

Tab. 2. Origin of sequences used in this study.
Origin of sequences used in this study.

Analysis of the Rd1LRR microsatellite marker in ‘Old Blush’

The Rdr1-TNLs in the ‘Old Blush’ genome were amplified from DNA for the haploid tissue that had been used for sequencing the HapOB2 genome as well as from DNA of the original diploid OB cultivar using the Rd1LRR microsatellite marker, presented in the coding sequences for the NBS-LRR members, and analysed on a LiCor 4300 DNA-analyser as previously described [3].

Gene prediction and annotation

Regions homologous to the Rdr1 locus were identified for all species using local BLAST searches implemented in Bioedit [36]. The BLASTn method was conducted with the muRdr1A-sequence as a query and an E-value of 1.0E-20.

Gene prediction and annotation was performed using FGENESH and AUGUSTUS (http://www.softberry.com; http://augustus.gobics.de/). The protein domains were determined using PfamScan ([37], https://www.ebi.ac.uk/Tools/pfa/pfamscan/). Only genes with a size larger than 2 kb and coding for all three protein domains (TIR, NB-ARC, LRR) were used for further phylogenetic analyses.

Sequence alignment and construction of phylogenetic trees

The predicted amino acid sequences of the Rdr1 homologs of R. multiflora, R. rugosa, F. vesca, HapOB1, HapOB2, R. damascena, R. chinensis var. spontanea, R. laevigata, R. minutifolia alba, R. persica, R. moschata and R. xanthina spontanea were aligned in MEGAX using MUSCLE (Multiple sequence comparison by log- expectation, [38]) with default options.

For the aligned Rdr1 homologs from the different species, phylogenetic trees were constructed in MEGAX [39] using the maximum likelihood (ML) method with the Jones-Taylor-Thornton matrix-based model using a discrete gamma distribution with empirical frequencies (JTT+G+F) [40]. The best model was estimated using MEGAX. Initial trees for the heuristic search were obtained automatically. The tree topology was tested via a bootstrap analysis with 500 replicates. For a better visualization of the phylogenetic trees the software Tree Of Life (iTOL) version 4.2.3 [41] (https://itol.embl.de/) was used. Nucleotide diversity within groups of sequences was computed in MEGAX using nucleotide differences among aligned sequences.

The analysis of synonymous and non-synonymous sites was performed in MEGAX by aligning the amino acid sequences of sets of coding DNA-sequences and analysing the DNA differences with the Nei-Gobojori model [42] for 1314 positions in the final dataset.

Synteny analysis

For the synteny analysis of the two clusters, genes surrounding the clusters were selected based on the rose reference sequence [9]. Reciprocal BLAST were performed against the most recent available Rosaceae genomes: Fragaria vesca [31, 45], Prunus persica [34], Malus domestica [32] and Rubus occidentalis [43]. The order of the homologous genes was checked on the genome browser of the GDR website (https://www.rosaceae.org/tools/jbrowse, [44]).

Supporting information

S1 Fig [tif]
Phylogenetic analysis of the amino acid sequence for . -TNLs and homologous TNLs of HapOB1 and HapOB2.

S2 Fig [tif]
Results from Rd1LRR microsatellite PCR.

S3 Fig [tif]
Genomic organizations of TNLs homologous to from and x .

S1 Table [xlsx]
Results of micro-synteny analysis outside the family clusters.

S2 Table [docx]
Positions and annotation of TNLs homologous to on the different chromosomes of Old Blush (OB1+2), . (F.ve), (P.pe), (M.do), (R.oc) and (P.mi).

S1 Dataset [txt]
Coding sequences of all used genes in this study.

S2 Dataset [txt]
Muscle alignment of protein sequences used for the phylogram shown in .

S3 Dataset [txt]
Muscle alignment of protein sequences used for the phylogram shown in .


Zdroje

1. Brands SJ Systema Naturae 2000: The Taxonomicon. [cited 06.04.2018] Available from: http://taxonomicon.taxonomy.nl/.

2. Koopman WJM, Wissemann V, Cock K de, van Huylenbroeck J, Riek J de, Sabatino GJ et al. AFLP markers as a tool to reconstruct complex relationships: A case study in Rosa (Rosaceae). Am J Bot. 2011; 95 (3): 353–366.

3. Terefe D, Debener T. An SSR from the leucine-rich repeat region of the rose Rdr1 gene family is a useful resistance gene analogue marker for roses and other Rosaceae. Plant Breed. 2011; 130 (2): 291–293.

4. Wissemann V, Ritz CM. The genus Rosa (Rosoideae, Rosaceae) revisited: Molecular analysis of nrITS-1 and atpB-rbcL intergenic spacer (IGS) versus conventional taxonomy. Bot J Linn Soc. 2005; 147 (3): 275–290.

5. Wissemann V. Conventional Taxonomy (Wild Roses). In: Roberts A, editor. Encyclopedia of rose science. Cambridge: Academic Press; 2003. pp. 111–117.

6. Debener T, Linde M. Exploring Complex Ornamental Genomes: The Rose as a Model Plant. CRC Crit Rev Plant Sci. 2009; 28 (4): 267–280.

7. Nakamura N, Hirakawa H, Sato S, Otagaki S, Matsumoto S, Tabata S et al. Genome structure of Rosa multiflora, a wild ancestor of cultivated roses. DNA res. 2018; 25 (2): 113–121. doi: 10.1093/dnares/dsx042 29045613

8. Raymond O, Gouzy J, Just J, Badouin H, Verdenaud M, Lemainque A et al. The Rosa genome provides new insights into the domestication of modern roses. Nat genet. 2018; 50 (6): 772–777. doi: 10.1038/s41588-018-0110-3 29713014

9. Hibrand Saint-Oyant L, Ruttink T, Hamama L, Kirov I, Lakhwani D, Zhou NN et al. A high-quality genome sequence of Rosa chinensis to elucidate ornamental traits. Nat plants. 2018; 4 (7): 473–484. doi: 10.1038/s41477-018-0166-1 29892093

10. Terefe-Ayana D, Kaufmann H, Linde M, Debener T. Evolution of the Rdr1 TNL-cluster in roses and other Rosaceous species. BMC Genomics. 2012; 13: 409. doi: 10.1186/1471-2164-13-409 22905676

11. Belkhadir Y, Subramaniam R, Dangl JL. Plant disease resistance protein signaling: NBS-LRR proteins and their partners. Curr Opin Plant Biol. 2004; 7 (4): 391–399. doi: 10.1016/j.pbi.2004.05.009 15231261

12. Jones DA, Jones JDG. The Role of Leucine-Rich Repeat Proteins in Plant Defences. In: Tommerup IC, Andrews JH, editors. Advances in botanical research: Incorporating advances in plant pathology. London: Academic. 1997. pp. 89–167.

13. Leipe DD, Koonin EV, Aravind L. STAND, a class of P-loop NTPases including animal and plant regulators of programmed cell death: Multiple, complex domain architectures, unusual phyletic patterns, and evolution by horizontal gene transfer. J Mol Biol. 2004; 343 (1): 1–28. doi: 10.1016/j.jmb.2004.08.023 15381417

14. McHale L, Tan X, Koehl P, Michelmore RW. Plant NBS-LRR proteins: Adaptable guards. Genome Biol. 2006; 7 (4): 212. doi: 10.1186/gb-2006-7-4-212 16677430

15. Pan Q, Wendel J, Fluhr R. Divergent evolution of plant NBS-LRR resistance gene homologues in dicot and cereal genomes. J Mol Evol. 2000; 50 (3): 203–213. doi: 10.1007/s002399910023 10754062

16. van Eck L, Bradeen JM. The NB-LRR Disease Resistance Genes of Fragaria and Rubus. In: Hytönen T, Graham J, Harrison R, editors. The Genomes of Rosaceous Berries and Their Wild Relatives. Heidelberg: Springer; 2018. pp. 63–75.

17. Sekhwal MK, Li P, Lam I, Wang X, Cloutier S, You FM. Disease Resistance Gene Analogs (RGAs) in Plants. Int J Mol Med Sci. 2015; 16 (8): 19248–19290.

18. Jia YX, Yuan Y, Zhang Y, Yang S, Zhang X. Extreme expansion of NBS-encoding genes in Rosaceae. BMC Genetics. 2015; 16: 48. doi: 10.1186/s12863-015-0208-x 25935646

19. Zhang M, Wu Y-H, Lee M-K, Liu Y-H, Rong Y, Santos TS et al. Numbers of genes in the NBS and RLK families vary by more than four-fold within a plant species and are regulated by multiple factors. Nucleic acids res. 2010; 38 (19): 6513–6525. doi: 10.1093/nar/gkq524 20542917

20. Kuang H, Woo S-S, Meyers BC, Nevo E, Michelmore RW. Multiple genetic processes result in heterogeneous rates of evolution within the major cluster disease resistance genes in lettuce. Plant Cell. 2004; 16 (11): 2870–2894. doi: 10.1105/tpc.104.025502 15494555

21. Yang S, Zhang X, Yue J-X, Tian D, Chen J-Q. Recent duplications dominate NBS-encoding gene expansion in two woody species. Mol Genet Genomics. 2008; 280 (3): 187–198. doi: 10.1007/s00438-008-0355-0 18563445

22. Ameline-Torregrosa C, Wang B-B, O’Bleness MS, Deshpande S, Zhu H, Roe B et al. Identification and characterization of nucleotide-binding site-leucine-rich repeat genes in the model plant Medicago truncatula. Plant Physio. 2008;146 (1): 5–21.

23. Ma F-F, Wu M, Liu Y-N, Feng X-Y, Wu X-Z, Chen J-Q et al. Molecular characterization of NBS-LRR genes in the soybean Rsv3 locus reveals several divergent alleles that likely confer resistance to the soybean mosaic virus. Theor Appl Genet. 2018; 131 (2): 253–265. doi: 10.1007/s00122-017-2999-9 29038948

24. Menz I, Straube J, Linde M, Debener T. The TNL gene Rdr1 confers broad-spectrum resistance to Diplocarpon rosae. Mol Plant Pathol. 2018; 19 (5): 1104–1113. doi: 10.1111/mpp.12589 28779550

25. Terefe-Ayana D, Yasmin A, Le TL, Kaufmann H, Biber A et al. Mining disease-resistance genes in roses: functional and molecular characterization of the Rdr1 locus. Front Plant Sci. 2011; 2: 35. doi: 10.3389/fpls.2011.00035 22639591

26. Xiang Y, Huang C-H, Hu Y, Wen J, Li S, Yi T et al. Evolution of Rosaceae Fruit Types Based on Nuclear Phylogeny in the Context of Geological Times and Genome Duplication. Mol Biol Evol. 2017; 34 (2): 262–281. doi: 10.1093/molbev/msw242 27856652

27. VanBuren R, Wai CM, Colle M, Wang J, Sullivan S, Bushakra JM et al. A near complete, chromosome-scale assembly of the black raspberry (Rubus occidentalis) genome. Gigascience. 2018; 7 (8).

28. Perazzolli M, Malacarne G, Baldo A, Righetti L, Bailey A, Fontana P et al. Characterization of resistance gene analogues (RGAs) in apple (Malus × domestica Borkh.) and their evolutionary history of the Rosaceae family. PloS One. 2014; 9 (2): e83844. doi: 10.1371/journal.pone.0083844 24505246

29. Nieri D, Di Donato A, Ercolano MR. Analysis of tomato meiotic recombination profile reveals preferential chromosome positions for NB-LRR genes. Euphytica. 2017; 213 (9): 1027.

30. Chavan S, Gray J, Smith SM. Diversity and evolution of Rp1 rust resistance genes in four maize lines. Theor Appl Genet. 2015; 128 (5): 985–998. doi: 10.1007/s00122-015-2484-2 25805314

31. Edger PP, VanBuren Rt, Colle M, Poorten TJ, Wai CM, Niederhuth CE et al. Single-molecule sequencing and optical mapping yields an improved genome of woodland strawberry (Fragaria vesca) with chromosome-scale contiguity. Gigascience. 2018; 7 (2): 1–7.

32. Daccord N, Celton J-M, Linsmith G, Becker C, Choisne N, Schijlen E et al. High-quality de novo assembly of the apple genome and methylome dynamics of early fruit development. Nat Genet. 2017; 49 (7): 1099–1106. doi: 10.1038/ng.3886 28581499

33. Verde I, Abbott AG, Scalabrin S, Jung S, Shu S, Marroni F et al. The high-quality draft genome of peach (Prunus persica) identifies unique patterns of genetic diversity, domestication and genome evolution. Nat Genet. 2013; 45 (5): 487–494. doi: 10.1038/ng.2586 23525075

34. Verde I, Jenkins J, Dondini L, Micali S, Pagliarani G, Vendramin E et al. The Peach v2.0 release: High-resolution linkage mapping and deep resequencing improve chromosome-scale assembly and contiguity. BMC Genomics. 2017; 18 (1): 225. doi: 10.1186/s12864-017-3606-9 28284188

35. Buti M, Moretto M, Barghini E, Mascagni F, Natali L, Brilli M et al. The genome sequence and transcriptome of Potentilla micrantha and their comparison to Fragaria vesca (the woodland strawberry). Gigascience. 2018; 7 (4): 1–14.

36. Hall TA, others. BioEdit: A user-friendly biological sequence alignment editor and analysis program for Windows 95/98/NT. Nucleic Acids Symp Ser. 1999; 95–98. doi: 10.1093/nass/42.1.95

37. Mistry J, Bateman A, Finn RD. Predicting active site residue annotations in the Pfam database. BMC Bioinformatics. 2007; 8: 298. doi: 10.1186/1471-2105-8-298 17688688

38. Edgar RC. MUSCLE: Multiple sequence alignment with high accuracy and high throughput. Nucleic Acids Res. 2004; 32 (5): 1792–1797. doi: 10.1093/nar/gkh340 15034147

39. Kumar S, Stecher G, Li M, Knyaz C, Tamura K. MEGA X: Molecular Evolutionary Genetics Analysis across Computing Platforms. Mol Biol Evol. 2018; 35 (6): 1547–1549. doi: 10.1093/molbev/msy096 29722887

40. Nei M, Kumar S. Molecular evolution and phylogenetics. Oxford University Press; 2000.

41. Letunic I, Bork P. Interactive tree of life (iTOL) v3: An online tool for the display and annotation of phylogenetic and other trees. Nucleic Acids Res. 2016; 44 (W1): W242–5. doi: 10.1093/nar/gkw290 27095192

42. Nei M, Gojobori T (1986) Simple methods for estimating the numbers of synonymous and nonsynonymous nucleotide substitutions. Mol Biol Evol. 1986; 3 (5): 418–426. doi: 10.1093/oxfordjournals.molbev.a040410 3444411

43. VanBuren R, Bryant D, Bushakra JM, Vining KJ, Edger PP, Rowley ER et al. The genome of black raspberry (Rubus occidentalis). Plant J. 2016; 87 (6): 535–547. doi: 10.1111/tpj.13215 27228578

44. Jung S, Lee T, Cheng C-H, Buble K, Zheng P, Yu J et al. 15 years of GDR: New data and functionality in the Genome Database for Rosaceae. Nucleic Acids Res. 2019; 47 (D1): D1137–D1145. doi: 10.1093/nar/gky1000 30357347

45. Li Y., Pi M., Gao Q., Liu Z. & Kang C. Updated annotation of the wild strawberry Fragaria vesca V4 genome. Hortic Res. 2019; 6 (61): doi: 10.1038/s41438-019-0142-6 31069085


Článek vyšel v časopise

PLOS One


2020 Číslo 1
Nejčtenější tento týden
Nejčtenější v tomto čísle
Kurzy

Zvyšte si kvalifikaci online z pohodlí domova

Svět praktické medicíny 1/2024 (znalostní test z časopisu)
nový kurz

Koncepce osteologické péče pro gynekology a praktické lékaře
Autoři: MUDr. František Šenk

Sekvenční léčba schizofrenie
Autoři: MUDr. Jana Hořínková

Hypertenze a hypercholesterolémie – synergický efekt léčby
Autoři: prof. MUDr. Hana Rosolová, DrSc.

Význam metforminu pro „udržitelnou“ terapii diabetu
Autoři: prof. MUDr. Milan Kvapil, CSc., MBA

Všechny kurzy
Kurzy Podcasty Doporučená témata Časopisy
Přihlášení
Zapomenuté heslo

Zadejte e-mailovou adresu, se kterou jste vytvářel(a) účet, budou Vám na ni zaslány informace k nastavení nového hesla.

Přihlášení

Nemáte účet?  Registrujte se

#ADS_BOTTOM_SCRIPTS#