#PAGE_PARAMS# #ADS_HEAD_SCRIPTS# #MICRODATA#

Genome-wide identification and expression analysis of the PHD-finger gene family in Solanum tuberosum


Authors: Mingyue Qin aff001;  Wenbin Luo aff004;  Yan Zheng aff001;  Huazhong Guan aff002;  Xiaofang Xie aff001
Authors place of work: College of Life Sciences, Fujian Agriculture & Forestry University, Fuzhou, China aff001;  Key Laboratory for Genetics, Breeding and Multiple Utilization of Crops, Ministry of Education, Fujian Agriculture & Forestry University, Fuzhou, China aff002;  Fujian Key Laboratory of Crop Breeding by Design, Fujian Agriculture & Forestry University, Fuzhou, China aff003;  The Crop Institute, Fujian Academy of Agricultural Sciences, Fuzhou, Fujian, China aff004
Published in the journal: PLoS ONE 14(12)
Category: Research Article
doi: https://doi.org/10.1371/journal.pone.0226964

Summary

Plant homeodomain (PHD) proteins are prevalent in eukaryotes and play important roles in plant growth, development and abiotic stress response. In this study, the comprehensive study of the PHD family (StPHD) was performed in potato (Solanum tuberosum L.). Seventy-two PHD genes (named StPHD1-72) were identified and grouped into 10 subfamilies based on phylogenetic analysis. Similar structure organizations were found within each subfamily according to the exon/intron structures and protein motif analysis. These genes were unequally scattered on the chromosomes of potato, with 9 pairs of segmental duplicated genes and 6 pairs of tandem duplicated genes showing that both segmental duplicated and tandem duplicated events contributed to the expansion of the potato PHD family. The gene ontology (GO) analysis suggests that StPHD mainly functioned at the intracellular level and was involved in various binding, metabolic and regulation processes. The analysis of expression patterns of StPHD genes showed that these genes were differentially expressed in 10 different tissues and responded specifically to heat, salt and drought stress based on the FPKM (Fragments per kilobase of transcript per million mapped reads) values of the RNA-seq data. Furthermore, the real-time quantitative PCR for 12 selected StPHD genes revealed the various levels of gene expression corresponding to abiotic stress. Our results provide useful information for a better understanding of PHD genes and provide the foundation for additional functional exploration of the potato PHD gene family.

Keywords:

Gene expression – Maize – Sequence alignment – Phylogenetic analysis – Sequence motif analysis – Potato – Arabidopsis thaliana – Duplicated genes

Introduction

Zinc-finger proteins are widely dispersed in eukaryotic organisms. Zinc-finger domains are rich in cysteine or histidine and have been classified into several types, including RING (Really Interesting New Genes), LIM (Lin11, Isl-1 and Mec-3), and plant homeodomain (PHD) [1]. The majority of PHD proteins are in the nucleus [2]. A typical PHD protein usually contains one to several PHD-finger domains, and each of the proteins contains approximately 60 amino acids with the structure composition of Cys4-His-Cys3 zinc-binding motif [3]. The residues of the pair of cysteines or between cysteine and histidine are usually conserved, whereas the second amino acid residue before the final pair of cysteines is usually tryptophan or another aromatic amino acid [4]. The combination of the core amino acid residues and two zinc ions plays a fundamental role in maintaining a firm spatial framework for the domain, a roughly globular domain in a three-dimensional conformation [5]. PHDs proteins have been widely studied in plant species since the first PHD protein HAT3.1 was identified in Arabidopsis [6]. Studies have shown that PHD proteins perform critical roles in plant development [7]. In Arabidopsis, the PHD protein VIL1 regulates the expression of floral repressors through photoperiod and vernalization pathways. DUET, a PHD protein, is also essential for chromosome organization and progression during spermatogenesis in Arabidopsis [8]. The PHD protein VIM1 contains the histone residue and participates in the regulation of the chromatin state [9]. As a PHD domain containing protein, SIZ1 codes a SUMO E3 ligase, which contains the PHD domain and regulates the anther dehiscence for spikelet fertility. Furthermore, SIZ1 regulates both vegetative and reproductive development in plants [10, 11]. The Ehd3 gene, which encodes a PHD protein, is an important promoter of rice flowering [12]. Recent studies showed that PHD proteins are essential epigenetic members in methylation maintenance [13, 14]. For example, the PHD proteins ATX1 and ATX2 control the expression of genes that encode histone methyltransferase [15]. In addition, some PHD proteins are involved in abiotic stress response [16]. Six soybean PHD proteins, encoding the Alfin1-type PHD protein, showed differential expression in response to cold, ABA and drought treatment, and displayed resistance to salt stress in transgenic Arabidopsis [7].

The PHD gene families have been studied in several plant species, such as Arabidopsis (Arabidopsis thaliana), maize (Zea mays) [17], poplar (Populus trichocarpa) [18], Chinese pear (Pyrus bretschneideri) [19] and bamboo (Phyllostachys edulis) [20]. Potato (Solanum tuberosum) is one of the most important food crops in the world, with more than 380 million tons of field production worldwide (http://www.fao.org). However, the PHD gene family in potato has not been thoroughly examined. Genome sequence and annotation resources of the potato genome provides an excellent opportunity for understanding the PHD gene family [21]. In this study, a comprehensive investigation of the PHD gene family in the potato genome was conducted, including PHD gene structure, chromosomal localization, gene duplication, phylogenetic relationship, gene ontology, tissue-specific expression profile, and expression patterns following exposure to drought, heat and salt stress. The objectives of this study were to identify the genome-wide sequence structures and the evolutionary relationship of the potato PHD gene family and provide a better understanding of functional mechanisms of the PHD-finger gene family in plants.

Materials and methods

Identification of PHD genes

The PHD protein sequences of Arabidopsis thaliana were used as the queries to identify the amino acid orthologs in potato through the BLASTP tool of SpudDB [22] and Phytozome v12.1 (https://phytozome.jgi.doe.gov). Potato candidate PHD proteins were identified with more than 30% similarity to the query sequence and an E value less than E-10. The genomic sequences of Arabidopsis thaliana PHD proteins were downloaded from the Phytozome database (https://phytozome.jgi.doe.gov/pz/portal.html). The domains for PHD proteins were confirmed by using the Conserved Domain Database (CDD) from NCBI (https://www.ncbi.nlm.nih.gov/cdd/) [23] with an E value of 1e-10. Finally, the sequences with complete PHD domains were retained and were assigned as potato PHD (StPHD) genes. The information for gene IDs, physical position, sequence of gene and protein, and coding sequences (CDS) were retrieved from Phytozome. These StPHD genes were renamed according to the order of their physical position.

Gene structure and conserved motif identification

The parameters for the final confirmed PHD proteins were calculated using online ExPASy programs (http://web.expasy.org/protparam/) [24]. The exon-intron structures for the StPHD genes were identified on the Gene Structure Display Server (GSDS, http://gsds.cbi.pku.edu.cn/) [25]. The conserved motifs were identified by the MEME program (version 5.0.3, http://alternate.meme-suite.org/tools/meme) [26] using an optimum motif width of 50–250 residues and 20 as the number of maximum motifs. The function of the identified motifs was annotated with the CDD program [23].

Chromosomal location and gene duplication

The chromosomal location image of StPHD genes was generated by the MapInspect tool (http://www.plantbreeding.wur.nl/uk/software_mapinspect.html) based on gene physical position data. Gene duplication was determined by the length and similarity (>70%) [27, 28]. Tandem duplicated genes were defined when the physical interval of two duplicated gene was less than 100 kb and they were separated by less than five genes [29].

Phylogenetic analysis

Alignment of all potato PHD-finger protein sequences was generated using ClustalX1.83 [30]. A phylogenetic tree with 1,000 bootstrap replicates was constructed based on the IQ-TREE maximum-likelihood method [31]. BLASTN was used to identify the orthologs among potato, Arabidopsis and maize [32]. The two sequences with the best alignment were confirmed as orthologs if the alignment was more than 300 bp and shared at least 40% similarity.

GO annotation and RNA-seq data analysis

GO analysis for StPHD genes was performed using PlantRegMap (http://plantregmap.cbi.pku.edu.cn/go.php). FPKM values of StPHD for various tissues and treatments derived from RNA-seq (DM_v4.03) [21] were extracted from SpudDB. The heatmap2 function of the R package for heatmap functions were used to analyze the expression of StPHD genes [33].

Plant growth and treatments

T Virus-free plantlets (S. tuberosum L. autotetraploid cultivar Zhongshu 3) were produced from in vitro cuttings. Using the nodal cutting method, the potato shoots were cultured in full MS solid media with a 16-h day temperature of 22°C and an 8-h night temperature of 18 °C for one month. The plantlets were then grown in a tray with a half-strength modified Hoagland solution [34] for 6 days prior to treatment. Small foam squares were used to suspend plantlets in the solution, which was adjusted to cover the roots. Plantlets were exposed to abiotic stress conditions included heat (35°C), drought (260 mM mannitol) and salt (150 mM NaCl) treatment, untreated plantlets were used as control groups. All treated plantlets and controls were collected 6 h after treatment and stored at −80°C before RNA samples were extracted.

Quantitative real-time PCR analysis

The total RNA of the plantlets was extracted using TRIzol reagent (Invitrogen, http://www.invitrogen.com). The cDNA samples were assayed by qRT-PCR using SYBR Premix Ex Taq (Takara). The primers used for qPCR analysis are listed in S1 Table. Actin was used as an internal control. Three biological replicates (each replicate contained 6 plants) and three technical replicates were tested. The relative expression level of a gene was measured using the 2−ΔΔCt method [35].

Results

Identification and characterization of PHD genes

A BLASTP search was performed using 70 PHD protein sequences of Arabidopsis [36] as the queries against the potato genome database to analyze PHD genes in potato. In addition, the CDD analysis was used to validate the presence of the PHD conserved domain in the predicted sequences. Finally, a total of 72 PHD genes were confirmed in potato and these genes were named StPHD1 through StPHD72 based on their physical position on chromosomes (Table 1). However, a single gene (StPHD1) was located on the unassembled scaffolds, and therefore, StPHD1 could not be assigned to any of the potato chromosomes. The StPHD genes varied greatly in terms of the number of amino acids and molecular weights. The StPHD proteins contain 115 (StPHD42) to 1,569 (StPHD38) amino acids, with an average of 532.7 amino acids, whereas the molecular weights ranged from 12.94 kDa (StPHD42) to 175.35 kDa (StPHD38) and the theoretical isoelectric points (pI) varied between 4.43 (StPHD24) and 9.27 (StPHD63). The detailed information for StPHD genes, including the gene ID, protein length (aa), molecular mass (MS), pI and location are listed in Table 1.

Tab. 1. The PHD family genes in Solanum tuberosum.
The <i>PHD</i> family genes in <i>Solanum tuberosum</i>.

Phylogenetics and structure of PHD protein in potato

Phylogenetic analysis was performed, and an unrooted phylogenetic tree was generated using the 72 potato PHD protein sequences, with 1,000 bootstrap replicates, to reveal the evolutionary relationship for the PHD family (Fig 1A). Moreover, the exon/intron structures for individual PHD genes were investigated (Fig 1B). Together with the results of gene structure analysis, the StPHD family genes were clustered into 10 groups (I through X) with the bootstrap values (≥60%) on the phylogenetic tree. However, 4 StPHD genes (StPHD12, -10, -27 and -38) could not be grouped into any subfamily for the 10 groups due to the low bootstrap values (<60%). Among these 10 groups: group X was the largest group, which contained 18 members; group I, V and VIII had 8 members; group IX had 7 members, and these five clades represented 68.06% of the total StPHD proteins. In contrast, groups III and VII only contained two to three members.

Phylogenetic relationships and gene structures for the StPHD family.
Fig. 1. Phylogenetic relationships and gene structures for the StPHD family.
A. Phylogenetic tree generated by maximum-likelihood method with bootstrapping analysis (1,000 replicates) based on the protein sequences of StPHD genes (percentage of bootstrap value was displayed at each node). The branches of different groups are shown in different colors. B. The graphic displays exon-intron structures for StPHD gene family members using GSDS. The horizontal black lines and the blue boxes show introns and exons, and the scale displays the relative length and position of the introns and exons.

Structure analysis of StPHD genes (Fig 1B) showed the number of introns ranged from 0 to 20. Among them, 7 genes had no introns (StPHD24, -25, -17, -70, -67, -42 and -1) and StPHD51 had the largest number of introns (20). Most of the StPHD genes that shared the same group showed similar exon/intron structure, including the intron numbers and exon length, and however, exceptions were also found among these genes. For example, StPHD genes in group I contained 2 to 6 introns, and the members in StPHD gene groups II contained 5 to 9 introns. Interestingly, the members in group V had larger gene sequences and contained a wide range of introns (7–20) whereas they exhibited great diversity in exon length.

Potato PHD-finger domain and motifs

To further investigate the similarity between the potato PHD-finger domains, 72 PHD-finger domain sequences were aligned (S1 Fig). The highlighted figures in black and pink colors were the Zn ion binding sites with seven cysteines and one histidine residue (Cys4-His-Cys3), and the result indicated that the PHD-finger domain of potato is very conservative. Based on the result of alignment, the consensus sequence of the potato PHD-finger domain was C–X(1–2)-C–X(4–24)-C–X(2–17)-C–X(4–15)-H-X2-C–X(2–25)-C–X(0–4)-C. The consensus sequence of potato PHD domain was basically consistent with previous research [17, 18, 37]

The online MEME software was used to investigate the conserved motifs of 72 StPHD proteins within each subfamily to investigate the diversity of motif components among StPHDs. A total of 20 distinct motifs were identified (Fig 2), and the detailed sequences for each motif is listed in S2 Table. By searching CDD database, 11 putative motifs were acquired functional annotation and motifs 1, 2, 3, 5 and 8 were annotated for the components of the conserved PHD domain (Fig 2), and however, no functional annotation was found for the other 9 putative motifs. The majority of StPHD proteins in the same subfamily have similar motif components and distribution, suggesting that the StPHD proteins share the same groups may possess similar functions. For example, proteins in group I possessed motifs 4, 19 and 3, while group E members contained motifs 9, 3 and 6. However, significant differences were also observed between different groups, and some motifs were exclusively presented in a particular group, suggested that these motifs might play specific functions in the group. For instance, motifs 4 and 19 contain a histone-binding component, which is common for all members in group I and play an important role in transcription start [38]. Motifs 6 and 9 present in group E members, and motif 6 is a type of N-Acetyltransferases (NAT) and mostly catalyzes the transfer of an acyl group. Motif 9 possesses Jas domain and this motif appears to bind to the Groucho/Tup1-type co-repressor TOPLESS (TPL) and TPL-related proteins (TPRs) and with an important role in the jasmonate signalling pathway [39]. Motif 7 is the Bromo-adjacent Homology domain (BAH) in group A members and plays an important role in DNA methylation, replication and transcriptional regulation [40]. While many members in group J contain motif 10, it is Oberon Coiled-coil region (Oberon_cc) and plays an important role for maintenance and establishment of both the shoot and root apical meristems [41].

Schematic diagram of conserved motifs for StPHD proteins.
Fig. 2. Schematic diagram of conserved motifs for StPHD proteins.
Different motifs are displayed with different colored boxes and numbers (1–20). The annotation of each motif listed on the right, which was identified by CDD program.

Chromosomal locations and duplications of StPHD genes

A total of 71 out of the 72 PHD genes were assigned to the 11 potato chromosomes, except chromosome 12 (Fig 3). Chromosome 7 contained the largest number of PHD genes (10), followed by chromosomes 9 (9 PHD genes), whereas chromosomes 2 and 8 possessed the smallest number of PHD gene members (2). A similar uneven distribution pattern of PHD gene families was also observed in Arabidopsis [36] and Populus trichocarpa [18]. The regions on the proximate or the distal ends of potato chromosomes exhibited a dense distribution of StPHD genes, such as the bottom of chromosomes 1, 6, 7, 9, 10 and 11. Both tandem duplication and segmental duplication perform an important role in the expansion of the gene family [42]. The potential duplication events were investigated to reveal the detailed mechanism for the PHD gene family. According to the phylogenetic and comparative analysis of the StPHD genes, 9 gene pairs (StPHD6/40, StPHD 9/15, StPHD 6/36, StPHD 9/64, StPHD 15/64, StPHD 14/53, StPHD52/60, StPHD52/61 and StPHD51/63) were found to be involved in the segmental duplication events (Fig 3). A biased distribution pattern was also found among the 9 segmental duplication pairs and no pairs were distributed on chromosomes 2, 4, 5, 8, 11 and 12. In addition, 6 gene pairs (StPHD2/3, StPHD21/22, StPHD67/68, StPHD33/34, StPHD33/35 and StPHD34/35) were confirmed to be tandem duplicated genes, and they were distributed on chromosomes 1, 4, 5 and 11. These results suggest that segmental duplication events and segmental duplication may play important roles in the amplification of the potato PHD gene family.

Chromosomal locations of potato <i>PHD</i> genes.
Fig. 3. Chromosomal locations of potato PHD genes.
The left scale represents the length of potato chromosomes (Mb). The tandem duplicated gene pairs are marked with gray boxes, and the segmental duplicated gene pairs are marked with colored boxes and are linked by corresponding color lines.

Phylogenetic analysis of the StPHD gene family

To explore the evolutionary process of the PHD gene family in potato, 72 StPHD protein sequences were aligned with 70 PHD proteins from Arabidopsis [36] and 67 PHD proteins from maize [17]. The phylogenetic tree was classified into 11 distinct groups (group A through K) with high bootstrap values (≥60%) support (Fig 4). The PHD proteins with low bootstrap values were not included with any subfamily. Group C was the largest subfamily, with 42 proteins, whereas the smallest groups, E, G and I, contained 8 to 9 proteins. StPHD proteins were more closely related to those in the same subfamily from Arabidopsis and maize than those of the other potato PHD proteins. Recent reports showed that PHD proteins for many plant species were highly similar to those of Arabidopsis [43]. Based on our analysis, most PHD protein structures in potato, Arabidopsis and maize were highly similar. A total of 18 members of PHD subfamily K were identified and 5 times more members were found in potato than Arabidopsis; only 3 members were identified in Arabidopsis. Only 2 members of subfamily K were identified in maize and 8 more times were found in potato than in maize (Fig 4).

Phylogenetic analysis of potato, Arabidopsis and maize PHDs.
Fig. 4. Phylogenetic analysis of potato, Arabidopsis and maize PHDs.
The tree was constructed by maximum-likelihood method. Each PHD subfamily was marked by a specific color, except for black, which represented genes that did not belong to any subfamily due to low bootstrap values. The square, circle and diamond represent maize, potato and Arabidopsis PHD proteins, respectively.

In total, 77 ortholog pairs (St-At) were identified between potato and Arabidopsis (S3 Table). However, only 68 ortholog pairs were found between potato and maize (St-Zm) (S4 Table). The number of orthologs between potato and Arabidopsis were greater than that between potato and maize, which may be due to the closer evolutionary distance between potato and Arabidopsis. One StPHD gene identified two or more orthologs from Arabidopsis or maize. For example, AT1G14510, AT2G02470, AT3G42790, AT5G26210 and AT3G11200 are orthologs to StPHD6, and these paralogs may perform important roles in the expanding process of the PHD gene family in plants.

GO annotation for StPHD proteins

Previous reports showed that PHD is involved in many biological processes at the nuclear level [4], including chromatin modification and mediation of molecular interactions in gene transcription. In addition, PHD displayed the complicated histone sequence reading ability mediated by histone modifications [44]. Diverse functions were found by the GO analysis of StPHD proteins (Fig 5). The majority of StPHD proteins were involved in intracellular activities, whereas some of them were located in the nucleus, others were located in the organelle. Some StPHD proteins had molecular function of binding, including ion, protein, DNA and chromatin. Meanwhile, some StPHD proteins showed function for acetyltransferase activity. In terms of biological processes, StPHD proteins participated in various biological pathways and regulated the various metabolic and biological processes, such as mediation of various organizations of cellular components, chromosomes, macromolecules, complex subunits, and organelles. In addition, some StPHD proteins engaged in gene expression. The diverse biological functions of StPHD proteins may be due to diverged sequences rather than the conserved StPHD domain.

GO annotation of StPHD proteins.
Fig. 5. GO annotation of StPHD proteins.
The annotation was performed on three categories, including biological process (blue color), cellular component (green color), and molecular function (yellow color).

Tissue expression patterns for StPHD genes

To reveal the potential functions of the StPHD genes, the expression patterns of StPHD genes were investigated based on the FPKM values extracted from SpudDB from 10 main tissues (root, shoot, petal, carpel, sepal, stamen, tuber, leaf, flower and petiole) (Fig 6). However, 20 genes were excluded from the heat map analysis because of low expression (FPKM < 0.5) or lack of expression in all examined tissues. Based on the heatmap, considerable variations in expression were found among individual genes from different tissues. A total of 52 StPHD genes were clustered into 3 groups (Fig 6), and 16 genes (StPHD52, -6, -40, -65, -47, -59, -61, -13, -20, -4, -60, -5, -43, -30, -29 and -8) were included in group II, which showed high expression levels in most of the analyzed tissues. Among them, two genes (StPHD29 and StPHD8) were especially abundant. In addition, 12 genes were included in group III and showed lower expression levels in most of the tissues. Some StPHD genes showed tissue-specific expression patterns. For example, StPHD27 was found to show abundant expression in root, shoot and stamen but had lower expression levels in petal, carpel and leaf. The expression patterns for these genes provided preliminary information for their functions.

Expression heatmap of <i>StPHD</i> genes in different tissues.
Fig. 6. Expression heatmap of StPHD genes in different tissues.
FPKM values for StPHD genes were transformed by log10.

Expression profiles for StPHD genes in response to abiotic stress

To further explore the response of the 52 StPHD genes following exposure to various types of stress (heat, salt and drought), a heatmap was generated using the relative expression level (represented by the FPKM values of stress/control) (Fig 7). In response to heat stress, most of the StPHD genes were downregulated. StPHD15, StPHD39, StPHD70 and StPHD11 showed significant downregulation, and a few StPHD genes (e.g., StPHD49 and StPHD24) exhibited significant upregulation. In contrast, the variation in expression levels in response to salt and drought stress were not as divergent as that of heat stress. Most StPHD genes were upregulated; StPHD10, StPHD7 and StPHD24 displayed the most significant upregulation in response to salt. However, StPHD59 showed significant downregulation in response to drought stress. A few StPHD genes (e.g., StPHD10, StPHD69 and StPHD46), showed contrasting expression patterns among the three stress types. StPDH46 was not sensitive to salt stress, but it was upregulated when exposed to heat stress and downregulated with drought stress. The StPHD genes with diverse stress responses and expression patterns may be due to the functional dissimilation of StPHD proteins.

Expression heatmap for <i>StPHD</i> genes exposed to heat (H), salt (S) and drought (D) stress.
Fig. 7. Expression heatmap for StPHD genes exposed to heat (H), salt (S) and drought (D) stress.

It has been confirmed that a subfamily of the maize PHD gene family was involved in abiotic stress response (namely, GRMZM5G893976, GRMZM2G017142, GRMZM2G148810, GRMZM2G158918, GRMZM2G016817, GRMZM2G172001, GRMZM2G080917, GRMZM2G050495, GRMZM2G156088, GRMZM2G063864, GRMZM2G107807, GRMZM2G008259, GRMZM2G115424, GRMZM2G153087, AC225147.4, GRMZM2G110952 and GRMZM2G047316) [17]. In our study, phylogenetic analysis showed that all of these members in this maize subfamily were distributed in group J (Fig 4), suggesting the members in this group shared a close evolutionary relationship. As the functions were related to their protein structures, we concluded the 7 StPHD genes (StPHD40, -36, -19, -6, -52, -61 and -60) in group J (Fig 4) may also perform a similar role in response to various stress types in potato. Therefore, to further understand the function of these StPHD genes, qRT-PCR was used to study the expression patterns of 7 StPHD genes in response to abiotic stress (heat, salt and drought). In addition, 5 StPHD genes (StPHD10, -24, -38, -46 and -49) that exhibited significant response to abiotic stress, based on the FPKM values from SpudDB (Fig 7), were also selected for qRT-PCR analysis. The expression patterns for the 12 StPHD genes in response to heat, salt and drought are shown in Fig 8. In total, StPHD48 showed a slight response to all three abiotic treatments. Of the 12 PHD-finger genes, most of the genes were upregulated in response to heat stress, except StPHD52 and StPHD60. StPHD10, StPHD19 and StPHD61 were significantly upregulated (>2 fold). StPHD49 showed extreme sensitivity (>5-fold) in response to heat stress (Fig 7). Although 9 genes were upregulated in response to salt treatment, StPHD10, StPHD38 and StPHD48 were slightly downregulated compared to that of the control. In response to drought stress, 5 (StPHD6, StPHD19, StPHD24, StPHD60 and StPHD61) out of the 9 upregulated genes showed significant variation in expression level (>2 fold), whereas StPHD46, StPHD48 and StPHD49 were slightly downregulated.

Relative expression level of 12 <i>StPHD</i> genes in response to heat, salt and drought stress at 6 h.
Fig. 8. Relative expression level of 12 StPHD genes in response to heat, salt and drought stress at 6 h.
The error bars indicated standard deviation. Asterisks indicated that the expression level was significantly different from the value of the control (*P<0.05, **P<0.01).

Discussion

Previously reported PHD proteins, which participate in many biological processes including chromatin modification and mediating the molecular interactions in gene transcription and histone modifications, played important roles for the growth and development of plants [4, 44]. To comprehensively analyze the PHD gene family, genome-wide identification and annotation of PHD genes have been performed for many species including moso bamboo (60), maize (67) [17], Populus trichocarpa (73) [18], rice (58) and Arabidopsis (70) [45]. The total members of PHD and genome size appeared not to be always related when the PHD gene family of the reported species was compared. For example, moso bamboo has much larger genome sizes than that of poplar and Arabidopsis but possesses fewer PHD genes than these two species. Therefore, PHD gene families may undergo differential expansion in different species. It was hypothesized that PHD gene family expansion might be related to three whole-genome duplication events in Arabidopsis [46] and the specific “Salicoid” duplication event in poplar [47]. In maize, the segmental duplication events were found to perform an important role in the amplification of the PHD gene family during the evolutionary process [17]. It was reported that potato genome had gone through at least two rounds of genome duplication; one ancient event might have occurred after the divergence between dicots and monocots approximately 185 ± 55 million years ago, and the recent duplication occurred approximately 67 million years ago [48]. In our study, a total of 72 nonredundant potato PHD genes were identified. Of these genes, 18.06% were identified from segmental duplication suggesting that whole genome duplication events performed essential roles during the amplification process of PHD gene family. Compared to random duplicated genes in maize (3.0%) and poplar (5.5%), more than two times the tandem duplicated genes (12.5%) were found in potato. These results suggest that the random duplicated event performed an important role during the PHD gene family expansion process in potato, which differed from that of maize and poplar. According to the phylogenetic tree (Fig 4), we found potato had many more members than Arabidopsis (5 times) and maize (8 times) in group K, furthermore, two paralog pairs (StPHD2/3 and StPHD21/22) were derived from random duplicated events distributed in this subfamily. These results further support the important role of random duplicated events during the PHD gene family expansion process in potato.

Protein structure is closely linked with protein function. In our study, most of the StPHD proteins included in the same subfamily exhibited similar domain architectures, which indicated that they could perform similar functions. Furthermore, the structures and motif constitutions for the StPHD protein within each group were basically matched with the phylogenetic analysis (Figs 1 and 2). However, a few branches of the phylogenetic tree failed to categorize into any subfamily because of their low bootstrap values, which may be due to the relatively few feature positions except the highly conserved PHD domain. A similar phenomenon was found in other species, such as maize [17], Arabidopsis and poplar [18]. StPHD members appeared to have different patterns for gaining or losing introns since their origin, and the proteins in the same group were not entirely identical in terms of their structure and motifs (Figs 1 and 2). These results imply that the highly diverse characteristics of sequences among these genes, other than that of the conserved PHD-finger domain, have contributed to the functional diversity of the PHD in potato.

Based on the RNA-seq data for potato, the expression of StPHD genes exhibited incongruous expression profiles in various tissues (Fig 6). Two genes, StPHD8 and StPHD29, showed the specific action of housekeeping expression and they displayed significant expression levels in most of the tissues. Studies for 6 pairs of tandem duplicated genes found that many of these paralogs (StPHD22, -67, -68, -33, -34 and -35) exhibited lower or no expression level in the investigated tissues; these results suggest that they may be expressed at specific developmental stages or have gone through pseudogenization. Most paralogs displayed similar expression patterns, and only a few segmental duplicated StPHD gene pairs displayed diverse expression patterns with each other (e.g., StPHD8 and StPHD44), suggesting that the functional differentiation may be a main character for the surviving duplicated genes [49]. It was reported that some PHD proteins were involved in abiotic stress responses [16]. For example, a subfamily of the maize PHD gene family was reported to be involved in abiotic stress response by RT-PCR [17]. According to the phylogenetic analysis, 7 StPHD proteins were grouped in the same cluster with these maize subfamily members (Fig 4), indicating that these 7 StPHD genes have the potential function of abiotic stress response in potato. This conclusion was verified by qRT-PCR results (Fig 8), and 7 StPHD genes showed varying response to heat, salt and drought treatments.

To analyze the trend of the gene expression derived from qRT-PCR (Fig 8) and the FPKM values, we compared the results from these two different platforms (Fig 7). Similar tendency of the gene expression was found between the two different platforms. Two genes (StPHD6 and StPHD10) had the same expression trends in response to the three abiotic treatments. Four genes (StPHD24, -38, -46 and -49) had similar expression patterns in response to heat and salt stress. StPHD49 was extremely sensitive in both sets of results, whereas three genes (StPHD52, -60 and -61) had similar expression patterns in response to salt and drought stress. For other genes, the same direction changes were identified in response to one of the abiotic stress types. However, the results of qRT-PCR did not totally agree with the pattern of gene expression from RNA-seq data, and this observation may be due to several putative reasons. First, the potato varieties used in the two experiments were different; a doubled monoploid potato variety (DM) was used in RNA-seq [21], whereas autotetraploid cultivar Zhongshu 3 was employed in qRT-PCR. Second, the experimental treatments were different. The plants for qRT-PCR were exposed to a photoperiod of 16 h light/8 h dark in this study, whereas the plants for RNA-seq were placed in the dark environment [21]. Third, the relative expression data was generated by using the 2−ΔΔCt method depending on the SYBR, and the RNA-seq data used in the present study was indicated by FPKM. The FPKM value should be a good method to decrease sample differences, but it may cause deviation for highly expressed genes [50]. The bias of the FPKM value may result in different expression compared to that of qRT-PCR.

Supporting information

S1 Table [docx]
Potato gene-specific primers used for qRT-PCR analysis.

S2 Table [docx]
Sequences of 20 predicted motifs of StPHD proteins.

S3 Table [xlsx]
Information about orthologs between potato and Arabidopsis.

S4 Table [xlsx]
Information about orthologs between potato and maize.

S1 Fig [tif]
Multiple sequence alignment of PHD-finger domain of 72 potato PHD proteins.


Zdroje

1. Takatsuji H. Zinc-finger transcription factors in plants. Cell Mol Life Sci. 1998; 54(6):582–596. doi: 10.1007/s000180050186 9676577

2. Aasland R, Gibson TJ, Stewart AF. The PHD finger: Implications for chromatin-mediated transcriptional regulation. Trends Biochem Sci. 1995; 20(2):56–59. doi: 10.1016/s0968-0004(00)88957-4 7701562

3. Kaadige MR, Ayer DE. The polybasic region that follows the plant homeodomain zinc finger 1 of Pf1 is necessary and sufficient for specific phosphoinositide binding. J Biol Chem. 2006; 281(39):28831–28836. doi: 10.1074/jbc.M605624200 16893883

4. Bienz M. The PHD finger, a nuclear protein-interaction domain. Trends Biochem Sci. 2006; 31(1):35–40. doi: 10.1016/j.tibs.2005.11.001 16297627

5. Nakamura Y, Umehara T, Hamana H, Hayashizaki Y, Inoue M, Kigawa T, et al. Crystal Structure Analysis of the PHD Domain of the Transcription Co-activator Pygopus. J Mol Biol. 2007; 370(1):80–92. doi: 10.1016/j.jmb.2007.04.037 17499269

6. Schindler U, Beckmann H, Cashmore AR. HAT3.1, a novel Arabidopsis homeodomain protein containing a conserved cysteine-rich region. Plant J. 1993; 4(1):137–150. doi: 10.1046/j.1365-313x.1993.04010137.x 8106082

7. Wei W, Huang J, Hao YJ, Zou HF, Wang HW, Zhao JY, et al. Soybean GmPHD-Type Transcription Regulators Improve Stress Tolerance in Transgenic Arabidopsis Plants. PLoS ONE. 2009; 4(9):e7209. doi: 10.1371/journal.pone.0007209 19789627

8. Reddy TV, Kaur J, Agashe B, Sundaresan V, Siddiqi I. The DUET gene is necessary for chromosome organization andprogression during male meiosis in Arabidopsis and encodes a PHD finger protein. Development. 2003; 130(24):5975. doi: 10.1242/dev.00827 14573517

9. Woo HR, Pontes O, Pikaard CS, Richards EJ. VIM1, a methylcytosine-binding protein required for centromeric heterochromatinization. Genes Dev. 2007; 21(3):267–277. doi: 10.1101/gad.1512007 17242155

10. Thangasamy S, Guo CL, Chuang MH, Lai MH, Chen JC, and Jauh GY. Rice SIZ1, a SUMO E3 ligase, controls spikelet fertility through regulation of anther dehiscence. New Phytol. 2011; 189(3):869–882. doi: 10.1111/j.1469-8137.2010.03538.x 21083564

11. Wang HD, Makeen K, Yan Y, Cao Y, Sun SB, and X GH. OsSIZ1 Regulates the Vegetative Growth and Reproductive Development in Rice. Plant Mol Biol Rep. 2011; 29(2):411–417.

12. Matsubara K, Yamanouchi U, Nonoue Y, Sugimoto K, Wang ZX, and Minobe Y. Ehd3, encoding a plant homeodomain finger-containing protein, is a critical promoter of rice flowering. Plant J. 2011; 66(4):603–612. doi: 10.1111/j.1365-313X.2011.04517.x 21284756

13. Barlow JH, Faryabi RB, Callen E, Wong N, Malhowski A, Chen HT, et al. Identification of early replicating fragile sites that contribute to genome instability. Cell. 2013; 152(3):620–632. doi: 10.1016/j.cell.2013.01.006 23352430

14. Liang C, Zhang XL, Song SS, Tian CY, Yin YX, Xing GC, et al. Identification of UHRF1/2 as new N-methylpurine DNA glycosylase-interacting proteins. Biochem Biophy Res Co. 2013; 433(4):415–419.

15. Saleh A, Alvarez-Venegas R, Yilmaz M, Le O, Hou GC, Sadder M, et al. The highly similar Arabidopsis homologs of trithorax ATX1 and ATX2 encode proteins with divergent biochemical functions. Plant Cell. 2008; 20(3):568–579. doi: 10.1105/tpc.107.056614 18375658

16. Wang Y, Wang QQ, Zhao Y, Han GM, Zhu SW. Systematic analysis of maize class III peroxidase gene family reveals a conserved subfamily involved in abiotic stress response. Gene. 2015; 566(1):95–108. doi: 10.1016/j.gene.2015.04.041 25895479

17. Wang QQ, Liu JY, Wang Y, Zhao Y, Jiang HY, Cheng BJ. Systematic Analysis of the Maize PHD-Finger Gene Family Reveals a Subfamily Involved in Abiotic Stress Response. Int. J. Mol. Sci. 2015; 16(10):23517. doi: 10.3390/ijms161023517 26437398

18. Wu SN, Wu M, Dong Q, Jiang HY, Cai RH, Xiang Y. Genome-wide identification, classification and expression analysis of the PHD-finger protein family in Populus trichocarpa. Gene. 2015; 575(1):75. doi: 10.1016/j.gene.2015.08.042 26314912

19. Cao YP, Han YH, Meng DD, Abdullah M, Li DH, Jin Q, et al. Systematic analysis and comparison of the PHD-Finger gene family in Chinese pear (Pyrus bretschneideri) and its role in fruit development. Funct Integr Genomics. 2018; 18(5):1–13.

20. Gao YM, Liu HL, Wang YJ, Li F, Xiang Y. Genome-wide identification of PHD-finger genes and expression pattern analysis under various treatments in moso bamboo (Phyllostachys edulis). Plant Physiol Bioch. 2017; 123:378.

21. Potato Genome Sequencing Consortium. Genome sequence and analysis of the tuber crop potato. Nature. 2011.

22. Hirsch CD, Hamilton JP, Childs KL, Cepela J, Crisovan E, Vaillancourtb B, et al. Spud DB: A Resource for Mining Sequences, Genotypes, and Phenotypes to Accelerate Potato Breeding. Plant Genome. 2014; 7(1):93–113.

23. Marchler-Bauer A, Bo Y, Han L, He J, Lanczycki CJ, Lu SN, et al. CDD/SPARCLE: functional classification of proteins via subfamily domain architectures. Nucleic Acids Res. 2017; 45(D1):D200–D203. doi: 10.1093/nar/gkw1129 27899674

24. Gasteiger E, Gattiker A, Hoogland C, Ivanyi I, Appel RD, Bairoch A. ExPASy: The proteomics server for in-depth protein knowledge and analysis. Nucleic Acids Res. 2003; 31:3784–3788. doi: 10.1093/nar/gkg563 12824418

25. Guo AY, Zhu QH, Chen X. GSDS: a gene structure display server. Hereditas. 2007; 29(8):1023–1026. 17681935

26. Bailey TL, Bodén M, Buske FA, Frith A, Grant CE, Clementi L, et al. MEME SUITE: tools for motif discovery and searching, Nucleic Acids Research, 2009; 37:202–208.

27. Gu Z, Cavalcanti A, Chen FC, Bouman P, Li WH. Extent of Gene Duplication in the Genomes of Drosophila, Nematode, and Yeast. Mol Biol Evol. 2002; 19(3):256–262. doi: 10.1093/oxfordjournals.molbev.a004079 11861885

28. Yang S, Zhang X, Yue JX, Tian D, Chen JQ. Recent duplications dominate NBS-encoding gene expansion in two woody species. Mol Genet Genomics. 2008; 280(3):187–198. doi: 10.1007/s00438-008-0355-0 18563445

29. Wang LQ, Guo K, Li Y, Tu YY, Hu HZ, Wang BR, et al. Expression profiling and integrative analysis of the CESA/CSL superfamily in rice. BMC Plant Biol. 2010; 10(1):282.

30. Thompson JD, Gibson TJ, Plewniak F, Jeanmougin F, Higgins DG. The ClustalX windows interface: flexible strategies for multiple sequence alignment aided by quality analysis tools. Nucleic Acids Res. 1997.

31. Nguyen LT, Schmidt HA, von Haeseler A, Minh BQ. IQ-TREE: a fast and effective stochastic algorithm for estimating maximum-likelihood phylogenies. Mol Biol Evol. 2015; 32 (1):268–274. doi: 10.1093/molbev/msu300 25371430

32. Altschul SF, Madden TL, Schäffer AA, Zhang J, Zhang Z, Miller W, et al. Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. Nucleic Acids Res. 1997; 25(17):3389–3402. doi: 10.1093/nar/25.17.3389 9254694

33. Warnes G, Bolker B, Bonebakker L, Gentleman R, Huber W, Liaw A, et al. Various R programming tools for plotting data. 2015; https://cran.r-project.org/.

34. Hammer PA, Tibbitts TW, Langhans RW, Mcfarlane JC. Baseline growth studies of ‘Grand Rapids’ lettuce in controlled environments. Journal-American Society for Horticultural Science (USA). 1978.

35. Livak K, Schmittgen T. Analysis of relative gene expression data using real-time quantitative PCR and the 2(-Delta Delta C(T)) Method. Methods. 2001; 25(4):402–408. doi: 10.1006/meth.2001.1262 11846609

36. Zhang F, Yang ZN, Zhang S. Genome-wide analysis of PHD-finger protein family in Arabidopsis thalinana. Acta Bot. 2009; 45(3):227–238.

37. Capili AD, Schultz DC, Rauscher FJ, Borden KL. Solution structure of the PHD domain from the KAP-1 corepressor: Structural determinants for PHD, RING and LIM zinc-binding domains. EMBO J. 2001; 20: 165–177. doi: 10.1093/emboj/20.1.165 11226167

38. Winicov II, Bastola DR. Transgenic overexpression of the transcription factor alfin1 enhances expression of the endogenous MsPRP2 gene in alfalfa and improves salinity tolerance of the plants. Plant physiol. 1999; 120(2):473–480. doi: 10.1104/pp.120.2.473 10364398

39. Pauwels L, Barbero GF, Geerinck J, Tilleman S, Grunewald W, Pérez AC, et al. NINJA connects the co-repressor TOPLESS to jasmonate signalling. Nature. 2010; 464(7289):788–791. doi: 10.1038/nature08854 20360743

40. Callebaut I, Courvalin JC, Mornon JP. The BAH (bromo-adjacent homology) domain: A link between DNA methylation, replication and transcriptional regulation. FEBS Lett. 1999; 446:189–193. doi: 10.1016/s0014-5793(99)00132-5 10100640

41. Saiga S, Furumizu C, Yokoyama R, Kurata T, Sato S, Kato T, et al. The Arabidopsis OBERON1 and OBERON2 genes encode plant homeodomain finger proteins and are required for apical meristem maintenance. Development. 2008; 135(10):1751–1759. doi: 10.1242/dev.014993 18403411

42. Cannon SB, Mitra A, Baumgarten A, Young ND, May G. The roles of segmental and tandem gene duplication in the evolution of large gene families in Arabidopsis thaliana. BMC Plant Biology. 2004; 4(1):10.

43. Li H, Yuan Z, Vizcay-Barrena G, Yang CY, Liang WQ, Zong J, et al. PERSISTENT TAPETAL CELL1 encodes a PHD-finger protein that is required for tapetal cell death and pollen development in rice. Plant physiol. 2011; 156(2):615–630. doi: 10.1104/pp.111.175760 21515697

44. Sanchez R, Zhou MM. The PHD finger: a versatile epigenome reader. Trends Biochem Sci. 2011; 36(7):364–372. doi: 10.1016/j.tibs.2011.03.005 21514168

45. Ying F, Liu QP, Xue QZ. Comparative Phylogenetic Analysis of the Rice and Arabidopsis PHD-finger Proteins. Acta Genetica Sinica. 2004; 31(11):1284–1293. 15651682

46. Barker MS, Vogel H, Schranz ME. Paleopolyploidy in the Brassicales: Analyses of the Cleome Transcriptome Elucidate the History of Genome Duplications in Arabidopsis and Other Brassicales. Genome Biol Evol. 2009; 1(1):391–399.

47. Tuskan GA, Difazio S, Jansson S, Bohlmann J, Grigoriev I, Hellsten U, et al. The genome of black cottonwood, Populus trichocarpa (Torr. & Gray). Science. 2006.

48. Peng ZH, Lu Y, Li LB, Zhao Q, Feng Q, Gao ZM, et al. The draft genome of the fast-growing non-timber forest species moso bamboo (Phyllostachys heterocycla). Nature Genetics. 2013; 45(4):456–461. doi: 10.1038/ng.2569 23435089

49. Guillaume B, Wolfe KH. Functional divergence of duplicated genes formed by polyploidy during Arabidopsis evolution. Plant Cell. 2004; 16(7):1679–1691. doi: 10.1105/tpc.021410 15208398

50. Zhao P, Wang DD, Wang RQ, Kong NN, Zhang C, Yang CH, et al. Genome-wide analysis of the potato Hsp20 gene family: identification, genomic organization and expression profiles in response to heat stress. BMC Genomics. 2018; 19(1):61. doi: 10.1186/s12864-018-4443-1 29347912


Článek vyšel v časopise

PLOS One


2019 Číslo 12
Nejčtenější tento týden
Nejčtenější v tomto čísle
Kurzy Podcasty Doporučená témata Časopisy
Přihlášení
Zapomenuté heslo

Zadejte e-mailovou adresu, se kterou jste vytvářel(a) účet, budou Vám na ni zaslány informace k nastavení nového hesla.

Přihlášení

Nemáte účet?  Registrujte se

#ADS_BOTTOM_SCRIPTS#