Skip to main content


Identification and expression analysis of genes with pathogen-inducible cis-regulatory elements in the promoter regions in Oryza sativa

Article metrics



Complex co-regulatory networks in plants may elicit responses during pathogen infections. A number of genes are activated when these responses take place. Identification of these genes would shed new light on understanding the mechanisms of rice response to pathogen infections and the elucidation of crosstalk among diverse signaling networks in rice disease resistance/susceptibility.


Here we report the identification of genes with pathogen-inducible cis-regulatory elements (PICEs) (AS-1, G-box, GCC-box, and H-box) in the promoter regions in rice. Our results showed that a set of 882 rice genes contained these four elements in their promoter regions. Of these genes, 190 encode disease resistance/susceptibility related proteins, and 70 encode transcription factors. Analyses of the available microarray data demonstrated that 357 transcripts were differentially expressed after pathogen infections. 48 out of 53 differentially expressed transcription factors are up-regulated or down-regulated by more than 1.1-fold in response to pathogen infections. Analyses of the public mRNA-Seq data showed that 327 transcripts were differently expressed after pathogen infections. A total of 100 up-regulated genes and 37 down-regulated genes were found in common between the microarray and mRNA-Seq data.


We report here a set of rice genes that contain the four PICEs, i.e., AS-1, G-box, GCC-box, and H-box, in their promoter regions, of which, 53.5% were up- or down-regulated when pathogens attack. The PICEs in the gene promoters are critical for rice response to pathogen infections. They are also useful markers for identification of rice genes involved in response to pathogen infections.


Rice plants are exposed to a number of pathogens that may negatively affect their growth and development. To adjust to pathogen infections in their environment, rice plants have evolved complex innate immunity systems. It has been established that plants evolved pathogen-associated molecular pattern (PAMP) -triggered immunity (PTI) and effector-triggered immunity (ETI) to defend against pathogen infections (Jones and Dangl 2006; Chisholm et al. 2006). PTI is the first layer of plant defense, and it can be activated by the perception of the PAMPs through plasma membrane-localized pattern recognition receptors (PRRs) (Li et al. 2016; Macho and Zipfel 2014). Then the downstream components such as mitogen-activated protein kinase (MAPK) cascades are activated and responsible for leading to defense responses (Chisholm et al. 2006). The second layer of plant defense is ETI, which would be activated by the specific recognition of pathogen effectors through resistance (R) proteins, e.g., nucleotide-binding and leucine-rich repeat (NB-LRR) proteins, in plants. The defense response mediated by the R proteins is characteristic of hypersensitive response (HR), which is usually linked to the cellular production of reactive oxygen species (ROS) (Maekawa et al. 2011). Now, it is known that multiple signaling mechanisms, such as those mediated by salicylic acid (SA), jasmonic acid (JA), and ethylene (ET), are necessary in the transduction of the signal of pathogen perception into rapid defense responses (Vlot et al. 2009; Robert-Seilaniantz et al. 2011).

Numerous genes in rice plants are rapidly or strongly activated at the transcriptional level by various pathogen attacks. Some of these genes are vital in the perception of pathogen infection (Nasir et al. 2017). Some other genes can be induced by plant hormones that include SA, JA, ET and brassinosteroid (Shigenaga and Argueso 2016; Yang et al. 2013). These plant hormones have been demonstrated as secondary signals after pathogen attack. Some genes are implicated in different primary and secondary metabolic pathways (Cho and Lee 2015). Another group of genes, named pathogenesis-related (PR) genes, has often been found to be induced in defense against pathogens (van Loon et al. 2006). Some genes, including transcription factor genes and PR genes, in other species, were observed to be strongly activated after inoculation with pathogens (Zander et al. 2014; Sohn et al. 2006; Meng et al. 2013). Similarly, previous study has shown that in incompatible interaction between rice and blast pathogen, rice genes implicated in defense were more rapidly and strongly induced at early infection stage, and this is in contrast to the genes in compatible interaction (Wang et al. 2014). It was reported that OsWRKY45 could be strongly activated after blast pathogen infection, and it could be also induced by salicylic acid and benzothiazole, leading to strongly increased resistance to Magnisea oryzae or Xanthomonas oryzae pv. oryzae (Shimono et al. 2007).

Many pathogen-inducible genes were found to contain specific cis-acting element sequences within their promoters (Yin et al. 1997; Lebel et al. 1998; Rushton and Somssich 1998). Deletion analyses of pathogen-inducible promoters have identified such sequences that are sufficient to confer responses to pathogens. Up to date, much effort has been made to identify diverse cis-acting elements within pathogen-inducible promoters. AS-1 (activation sequence-1, TGACG) cis-element was originally found in some viral and bacterial T-DNA promoters (Lam et al. 1989; Bouchez et al. 1989). The equivalent and homologous sequences are also named ocs or nos cis-regulatory elements (Ellis et al. 1993; Kim et al. 1993). This element is characterized by two TGACG motifs. G-box (with the core sequence ACGT) was first characterized in the promoters of plant light-regulated genes (Giuliano et al. 1988). Since then, as a ubiquitous cis-element, this cis-element motif was identified in the promoters in many various plant genes (Menkens et al. 1995). An H-box cis-element was found to be adjacent to G-box in the promoter of one soybean chalcone synthase encoding gene. H-box [CCTACC(N)7CT] was first identified in the promoter of the bean chs15 gene, which encodes chalcone synthase (Loake et al. 1992). This cis-element is critical for the induction of the expression of the chs15 gene by elicitors and other stress stimuli. GCC-box (AGCCGCC) was found over-represented in the promoters of many PRs (Sato et al. 1996; Zhou et al. 1997; Manners et al. 1998). The cis-element has been shown to be responsive to ET and jasmonate, and functions after pathogen infection, followed by the induction of plant defense responses (Brown et al. 2003; Manners et al. 1998; Ohme-Takagi et al. 2000).

There are a number of cis-regulatory elements found in pathogen-inducible plant promoters. Some of the elements are occurring with high frequency, others with low frequency. The above four cis-regulatory elements have been shown present with moderate frequency in the pathogen-inducible promoters. Also, they are together involved in SA, ET, and JA signaling pathways.

Rice is one of the important food crops. Currently, the rice genome sequence is available, which provides an excellent platform to discover pathogen-inducible cis-regulatory elements (PICEs) within the promoters at the genome level. In this paper, we identified the AS-1, G-box, GCC-box and H-box cis-regulatory elements in the whole promoters in Oryza sativa cv. japonica (O. sativa) using in silico approaches. Analyses on the functional classification of the putative proteins encoded by the identified transcripts revealed a number of disease resistance/susceptibility proteins, transcription factors, protein kinases, and unknown/hypothetical proteins.

To test the significance of these four PICEs present in the promoter regions, we compared the transcript levels of the rice genes controlled by the promoters with these four elements after pathogen infections to those of these genes after mock inoculation. We found that the transcripts from a great number of genes controlled by the promoters with these four elements were up-regulated or down-regulated after pathogen infections, indicating that the rice genes containing these four PICEs in their promoters are responsive to pathogen infections. Our results indicated that the genes with PICEs in their promoters are important for rice response to pathogen infections.


Identification of rice genes with PICEs in the promoter regions

The pathogen-inducible genes in host play an important role in disease resistance/susceptibility. To identify all the genes in O. sativa that contain AS-1, G-box, GCC-box and H-box cis-regulatory elements (Table 1) in their promoter regions, upstream sequences of 2 kb of 44,609 genes of O. sativa were downloaded from the EnsemblPlants database. The transposable elements, pseudo-genes, and noncoding RNA genes were discarded from the sequence dataset.

Table 1 Pathogen-inducible cis-elements, sequences and interacting factors

The above four elements were identified collectively by custom Perl scripts in upstream genomic sequences. Application of this search led to the identification of total 882 sequences with the four cis-regulatory elements (Fig. 1). Cis-regulatory elements may occur on the direct strands or on the reverse strands of the promoter sequence contexts. In this study, we identified 487 sequences with the four cis-regulatory elements on the direct strand of the promoters, and 407 sequences with the four cis-regulatory elements on the reverse strand of the promoters. We found there were 12 sequences overlap between the two groups. We defined these 882 sequences as putative pathogen-inducible promoters (PIPs) in rice. Our results showed that the number of the identified PIPs present in each rice chromosome varied, ranging from 39 to 120 (Fig. 1). Then, we identified 833 genes (encoding 882 transcripts) driven by the PIPs from the O. sativa genomic sequences (IRGSP-v1.0.37) (Additional file 1: Table S1).

Fig. 1

Chromosomal distribution of putative rice PIPs. PIPs: Pathogen-inducible promoters

Functional classification of identified genes driven by PIPs

The sequences and annotations of these genes were obtained from the rice genome annotation project database (RGAP, Also, they were annotated through the InterProScan program. There were 743 transcripts whose annotations were obtained. The identified candidate genes/transcripts were classified on the basis of their putative functions. Analysis on Gene Ontology (GO) biological process of these transcripts showed that the maximum percentage of transcripts implicated in cellular process (GO:0009987) (28.7%), followed by metabolic process (GO:0008152) (27.5%), biosynthetic process (GO:0009058) (15.9%), nucleobase, nucleoside, nucleotide and nucleic acid metabolic process (GO:0006139) (13.6%), response to stress (GO:0006950) (13.2%), and cellular protein modification process (GO:0006464) (11.8%) (Fig. 2). These data suggest that the rice response to pathogen infection may activate metabolic and biosynthetic processes, as well as stimuli response. The pathogen infection may be a stress factor that promotes rice reproductive process. Classification analysis on GO molecular function presented that the top three group transcripts were protein binding (GO:0005515) (14.1%), catalytic activity (GO:0003824) (14.0%), and binding (GO:0005488) (14.0%). Also, transcripts encoded by candidate genes driven by PIPs involved in nucleotide binding (GO:0000166), hydrolase activity (GO:0016787) (9.8%), kinase activity (GO:0016301) and transcription factor activity (GO:0003700) (8.3%) (Additional file 2: Figure S1A). According to the GO cellular component annotation, the maximum percentage of transcripts localize to membrane (GO:0016020) (14.0%), plastid (GO:0009536) (12.1%) and plasma membrane (GO:0005886) (10.9%), followed by nucleus (GO:0005634) (10.2%) (Additional file 2: Figure S1B). Of the transcripts, 190 disease resistance/susceptibility-related (DRR/DSR) proteins and 70 transcription factors were identified through InterProScan annotation (Additional file 3: Table S2, Additional file 4: Table S3). Out of the DRR/DSR proteins, the maximum belongs to P-loop proteins, then followed by protein kinase domain proteins, serine-threonine/tyrosine-protein kinases, armadillo-type fold proteins, leucine-rich repeat domain proteins, cytochrome P450 and so on (Additional file 3: Table S2). Thus rice protein kinase, receptor and R protein encoding genes are involved in defense response to pathogen infections. It may be a rational explanation that the PIPs contribute to the activation of these genes during pathogen infections. Of the predicted transcription factors, zinc finger family proteins accounted for approximately 44% (31/70) and were the maximum group, followed by ethylene insensitive 3 or similar proteins (7%, 5/70) (Additional file 4: Table S3). Therefore, the zinc finger proteins are essential for rice response to pathogen infections, and ethylene signaling would be activated by pathogen infections.

Fig. 2

Functional classification based on GO-biological process for 743 identified candidate rice genes driven by PIPs

A further functional analysis of the identified DRR/DSR proteins showed they were probably involved in many disease resistance mechanisms (Fig. 3). 16.79% (92) were involved in signaling pathways, including MAPK, SA, JA, ET, GA, ABA, etc. A group of 77 proteins (14.05%) was related in defense responses. Another group of 77 proteins (14.05%) was implicated in ETI. 12.59% (69) of the proteins were related to programmed cell death (PCD) or HR. 9.31% (51) were identified as transcription factors. 7.48% (41) were associated with degradation. Two groups of 25 proteins (4.56%) were implicated in PTI and perception of pathogen, respectively. Other proteins were involved in other than the above mechanisms, including R proteins, PRR, trichome formation, redox, and so forth (Fig. 3).

Fig. 3

Analysis on disease resistance mechanisms of the identified DRR/DSR proteins encoded by the rice genes under the control of PIPs

Expression analysis of genes controlled by PIPs through microarray and mRNA-Seq data

We have identified 882 rice genes with the PICEs in their promoters. To answer the question whether the PICEs lead to the transcriptional activation or repression of these genes after pathogen infections, we analyzed the microarray and mRNA-Seq data available in GEO database. We searched 38 experiments (32 from microarray data and 6 from mRNA-Seq data) querying pathogen-activated or repressed gene expression changes. Some experiments were rejected for the reason that few differentially expressed genes were obtained from these experiments. In the end, 27 experiments were employed and these experiments consisted of 21 microarray as well as six mRNA-Seq data sets, including interactions between rice and Magnaporthe oryzae, Rhizoctonia solani, Xanthomonas oryzae pv. oryzae, Meloidogyne incognita, and Meloidogyne graminicola (Additional file 5: Table S4).

In the microarray data analyses, a total of 357 differentially expressed transcripts were identified. Of these, 209 transcripts were detected to be up-regulated, and 148 were down-regulated (Additional file 6: Table S5). Analyses of mRNA-Seq data detected 327 differentially expressed transcripts. Out of these, 225 transcripts were shown to be up-regulated, and 102 were down-regulated (Additional file 7: Table S6). It is notable that 212 differentially expressed transcripts were identified in common from the microarray and mRNA-Seq data (Additional file 8: Table S7). However, the expression patterns of some of the transcripts, e.g., Os01t0229200–01 and Os01t0628000–01, are not consistent between these two experiments. This inconsistency may be caused by different conditions between the microarray and the mRNA-Seq experiments. The PICEs within the 2000 bp upstream regions of the genes encoding the 212 transcripts were highlighted (Additional file 9: Figure S2). A total of 100 up- and 37 down-regulated transcripts were discovered to be overlapped between the microarray and mRNA-Seq data (Fig. 4).

Fig. 4

Overlap of differentially expressed genes between rice microarray and RNA-Seq data. (A) Overlap of up-regulated genes between rice microarray and RNA-Seq data. (B) Overlap of down-regulated genes between rice microarray and RNA-Seq data

The predicted 70 transcription factors were separately checked for relative fold change in transcription during pathogen infections. The results showed that 53 transcription factors were differentially expressed, and a total of 90% (48/53) predicted candidate transcription factors were expressed more than 1.1 fold, by induction with pathogen infections (Table 2).

Table 2 The differentially expressed transcription factors encoded by the rice genes under the control of PIPs

We randomly selected a pool of 50 genes without the four PICEs in their promoters from different chromosomes as random controls to examine their expression levels in these microarray experiments. In our analysis, no significant differentiation at the transcriptional level was found for most of the random control genes between the pathogen infected samples and mock samples (Additional file 10: Table S8, Additional file 11: Table S9). We performed an enrichment analysis, and found a higher incidence of the differentially expressed rice genes driven by PIPs with the PICEs in the total rice genes driven by PIPs than that of the differentially expressed rice genes in the total rice genes (Fig. 5; Additional file 12: Table S10, Additional file 13: Table S11). Thus there is a strong association between the PICEs and the differential expression of rice genes driven by PIPs after pathogen attack. Therefore, we infer that the PICEs in the promoters led to the differential expression of the genes under the control of PIPs after pathogen infection.

Fig. 5

The enrichment analysis of the differentially expressed rice genes driven by PIPs. *P < 0.05, **P < 0.01 by post hoc Fisher’s exact test comparing each pair of data. The numbers of total reference genes were listed in Additional file/Table S10. DEGs differentially expressed genes


In this research, we performed a promoterome-wide search for AS-1, G-box, GCC-box and H-box cis-elements in rice and identified 833 genes, which potentially encode 882 putative transcripts under the control of pathogen-inducible promoters with these four cis-elements. Of the proteins encoded by these transcripts, DRR/DSR proteins account for 21.5% (190/882), and transcription factors constitute 7.9% (70/882). More than half (472/882) of the transcripts, encoded by the genes with PICEs in their promoter regions, were attested to be pathogen-inducible by the analyses of the available microarray and mRNA-Seq data. It can be expected that more of the rice genes, with the PICEs in the promoter regions, would be demonstrated to be pathogen-inducible if more analyses are carried out on the increasing microarray and mRNA-Seq data. Therefore, the PICEs are useful markers for identification of rice genes involved in response to pathogen infections.

AS-1 cis-element has been shown to be required for transcriptional activation of some genes. A functional plant ocs-element, a homologue of AS-1 element, was identified in the soybean Gmhsp26-A gene (Ellis et al. 1993). AS-1 cis-element was also found present in the promoters of some SA-inducible glutathione S-transferase genes (GSTs), which are considered as defense genes and can be activated by SA, auxins, methyl jasmonate, hydrogen peroxide and pathogens (Ulmasov et al. 1994; Chen and Singh 1999; Yang et al. 1997; Xiang et al. 1996; Dudler et al. 1991). In addition, some pathogenesis-related protein encoding genes (PRs) were also found to contain AS-1 cis-elements in their promoters (Strompen et al. 1998). As has been demonstrated, the AS-1 cis element is an oxidative stress-responsive element and can be activated by SA with the help of oxidative species (Garretón et al. 2002). In plants, it can be bound by the TGA family of the basic/Leu zipper transcription factors (bZIPs) (Table 1) (Xiang et al. 1997; Johnson et al. 2001; Kim and Delaney 2002; Sarkar et al. 2017; Després et al. 2000). bZIP proteins have been demonstrated to play important roles in activating a number of defense genes (Alves et al. 2013; E et al. 2014).

G-boxes have been demonstrated to be implicated in the induced expression of some genes after pathogen attacks. Usually, these boxes play roles in concert with other cis-regulatory elements. G-box and H-box have been revealed to shape in combination the complex patterns of the bean chs15 gene expression (Loake et al. 1992). These two boxes are also implicated in pathogen induction and other defense genes. Two proteins, named KAP-1 and KAP-2 respectively, were identified to bind to H-boxes in the promoter of the bean chs15 gene (Yu et al. 1993). G-box and H-box could be bound by a bZIP protein, designated G/HBF-1, for activation of initial plant defense against pathogen attack (Droge-Laser et al. 1997). G-box also might be bound by the basic/helix-loop-helix (bHLH) proteins or the bHLH-leu zipper MYC proteins (Table 1) (Toledo-Ortiz et al. 2003; Boter et al. 2004).

ERFs (ethylene-responsive element binding proteins, i.e., ethylene response factors), belonging to APETALA2 (AP2) /ERF family, are ethylene-inducible DNA binding proteins that could recognize and bind the GCC-box element (Table 1) (Ohme-Takagi and Shinshi 1995; Brown et al. 2003; Ohme-Takagi et al. 2000). In addition, AP2/ERFs have been also revealed to modulate the expression levels of jasmonate-responsive genes through the interaction with the GCC-box (Brown et al. 2003).

Other than the above four cis-elements, GT-1 (GAAAAA; GGTTAA), MRE [A(A/C)C(A/T)A(A/C)C; ACC(A/T)A(A/C)(T/C)], and W-box (TTGACY), which were observed to be over-represented within the rice promoters in our analysis, are also common pathogen-inducible cis-regulatory elements (Park et al. 2004; Gurr and Rushton 2005; Rushton and Somssich 1998). Elicitor-responsive element (ERE) [AATTGACC; (C/T)TGAC(C/T); GTCAGAAAGTCAG] is another pathogen-inducible cis-regulatory element (Heise et al. 2002; Fukuda and Shinshi 1994). These four PICEs are as well occurring within the 2000 bp upstream sequences that match the 212 differentially expressed transcripts captured from the microarray and RNA-Seq data analyses (Additional file 9: Figure S2).

It is well established on the basis of various experiments that promoters with particular cis-elements respond to specific triggers. The above PICEs are the target binding sites for specific transcription factors, such as bZIPs/bHLHs/MYC, AP2/ERFs, MYB (with recognition of MRE), and WRKY (interacting with W-box), which direct pathogen-inducible expression. We think that the combinatorial interactions of these PICEs in the rice promoters with their corresponding transcription factors are pivotal processes regulating spatio-temporal expressions of rice DRR/DSR genes during pathogen attacks. In other words, these genes respond to pathogen attacks through specific transcription factors which interact with the PICEs present in their promoters (Fig. 6).

Fig. 6

A working model for rice DRR/DSR genes under the governing of a pathogen-inducible promoter with the PICEs. DRR/DSR gene: disease resistance/susceptibility-related gene

Rice genes with the PICEs in their promoter regions are likely implicated in multiple defense mechanisms. A great number of the identified DRR/DSR proteins are protein kinase, protein kinase-like and phosphatase, which are potentially involved in signaling pathways, including mitogen-activated protein kinases. Many genes, e.g., encoding F-box-like domain proteins and ubiquitin-like domain proteins, are related to degradation. Some proteins, e.g., serine/threonine kinases, and NB-ARC domain proteins are likely involved in ETI. The leucine-rich repeat domain proteins are probably relevant to the perception of pathogen and PTI. There are some proteins functioning in defense response (e.g. cytochrome P450, thaumatin, chalcone/stilbene synthase). A lot of proteins belong to transcription factors, e.g., zinc finger proteins, heat shock transcription factors, bZIP proteins, and bHLH domain proteins. Several proteins, e.g., SKP1/BTB/POZ domain proteins, homeobox domain proteins, and ERF domain proteins, may be involved in SA, JA and ET signaling pathways, respectively. Other proteins, e.g., thioredoxin-like proteins, WAT1-related protein, and Barwin domain proteins, participate in redox reactions, cell wall modification, and HR, respectively. In our analysis, there are 17.1% (151/882) of the transcripts with unknown function. The genes encoding these transcripts contain PICEs in their promoter regions, indicating some unknown response mechanisms of rice to pathogen infections remain to be revealed. Our study sheds new light on rice molecular response mechanisms to pathogen infections.

The genes with the PICEs in the promoter regions in rice are likely involved in more than one regulation mechanism concerning disease resistance. All the rice genes identified here contain the PICEs in their promoter regions, they, therefore, can be potentially targeted by both bZIP and ERF transcription factors. It has been shown that bZIP and ERF transcription factors are implicated in SA, JA, and ET signaling pathways. So, the identified rice genes are likely to be activated through the hormone signaling pathways after pathogen infections. It seems that cross-coupling occurs between bZIP and ERF transcription factors and may, therefore, be of great importance in the modulation of the expression levels of the genes during the rice response to pathogen infections.

Identification and investigation of rice PIPs are valuable and useful in many aspects. Genetic engineering is an economical and convenient approach to improving rice disease resistance. Accurate regulation of exogenous gene expression is paramount to the engineering of rice plants with increased disease resistance. Many previous attempts to increase disease resistance employed constitutive over-expression of transgenes but again and again this caused poor quality or yield plants. It is now known that the poor quality or yield is attributed to the extensive cellular reprogramming connected with defence once unregulated defence reactions are triggered in uninfected cells. Therefore, compared with the constitutive expression of exogenous genes, gene expression under the control of PIPs is preferred in generating transgenic rice plants with resistance to pathogen attack, for this avoids the additional cost of resistance through restricting the expression to infected cells when needed. Therefore, it is useful to search promoters inducible by pathogens for rice disease resistance engineering. Our results would be a benefit for developing rice disease resistance engineering. The defining of a set of rice genes that are putatively regulated by PIPs will help elucidate the mechanisms by which rice plants respond to pathogen infections. It provides markers useful for subsequent study and will enable the elucidation of crosstalk among diverse signaling networks in rice disease resistance/susceptibility.


A set of rice genes under the control of PIPs that contain the PICEs, AS-1, G-box, GCC-box, and H-box, were identified. More than half of the genes were activated or repressed after pathogen infection. The identified rice genes may be implicated in diverse disease resistance mechanisms. The PICEs in the promoters are important for rice response to pathogen infections. They are also valuable tags for identification of rice genes in response to pathogen infections.


Retrieval of upstream sequences of O. sativa genes

For identification of genes with PICEs in the promoter regions, upstream sequences of 2 kb of 44,609 genes of O. sativa were downloaded from the Rice Annotation Project database (Os-Nipponbare-Reference-IRGSP-1.0) (

Identification of PIPs with PICEs and genes controlled by PIPs

The putative PIPs with AS-1, G-box, GCC-box and H-box cis-regulatory elements were screened out from the upstream sequences of 2 kb of 44,609 rice genes by custom Perl scripts. On the basis of identification of the putative PIPs from the rice upstream sequences, we further screened out the genes (or transcripts) controlled by these PIPs with PICEs from the whole rice genes.

Functional annotation for genes controlled by PIPs

To infer the molecular functions of these transcripts, which genes were controlled by PIPs, we performed homology searches against the protein databases (Panther-12.0 (Mi et al. 2017), PfamA-31.0, PRINTS-42.0, ProDom-2006.1, SuperFamily-17.5, and TIGRFAM15.0) using the InterProScan ver. 5.25 program (Jones et al. 2014; Finn et al. 2017). To assign further functional information to these transcripts, we conducted a search using the Rice Annotation Project database, and then gene ontology terms were obtained and assigned to each identified transcript.

Expression analysis of genes controlled by PIPs

The rice microarray data on pathogen inoculation vs mock were extracted from GEO (Gene Expression Omnibus) database of NCBI ( GEO2R, an interactive GEO-based web tool was used for the identification of significantly regulated genes across pathogen vs mock (GEO accession numbers: GSE7256, GSE18361, GSE21772, GSE21919, GSE30941, GSE33204, GSE33411, GSE39635, and GSE41798). We entered these GEO accession numbers and then defined groups as inoculation and mock with given number of replicates discovered in GEO. The false discovery rate method (Benjamini and Hochberg 1995) was applied to correct for multiple testing.

mRNA-Seq data (GSE81906: GSM2177582, GSM2177583, GSM2177584, GSM2177585, GSM2177586, and GSM2177587) on pathogen inoculation vs mock in SRA format were downloaded from the GEO Database and processed as follows using a combination of publicly available tools and custom Perl scripts. Quality was assessed using FastQC. Sequences were aligned onto the latest O. sativa genome assembly (released 37) using TopHat version 2.1.1(Langmead et al. 2009). Reads that map to a unique location in the O. sativa genome were adopted for differential expression analysis. The number of single mapping reads that locate each annotated gene in the O. sativa gene model annotations release-37 was counted by HTSeq-Count (version 0.9.1). Then, these data were provided as inputs to DESeq (R package, version 1.26.0) for statistical analysis of differential gene expression (Anders and Huber 2010).

Three studies presented complete information on differential gene expression. Therefore, some differentially expressed genes were obtained from these published data (Bagnaresi et al. 2012; Kawahara et al. 2012; Zhang et al. 2017).



activation sequence-1


the basic/Leu zipper transcription factors


disease resistance-related


disease susceptibility-related


ethylene-responsive element binding proteins




effector-triggered immunity


Gene Expression Omnibus


glutathione S-transferases


hypersensitive response


jasmonic acid


mitogen-activated protein kinase


nucleotide-binding and leucine-rich repeat


pathogen-associated molecular pattern


pathogen-inducible cis regulatory elements




pattern recognition receptors


PAMP-triggered immunity

R protein:

resistance protein


reactive oxygen species


salicylic acid


  1. Alves MS, Dadalto SP, Goncalves AB, De Souza GB, Barros VA, Fietto LG (2013) Plant bZIP transcription factors responsive to pathogens: a review. Int J Mol Sci 14(4):7815–7828.

  2. Anders S, Huber W (2010) Differential expression analysis for sequence count data. Genome Biol 11(10):R106.

  3. Bagnaresi P, Biselli C, Orru L, Urso S, Crispino L, Abbruscato P, Piffanelli P, Lupotto E, Cattivelli L, Vale G (2012) Comparative transcriptome profiling of the early response to Magnaporthe oryzae in durable resistant vs susceptible rice (Oryza sativa L.) genotypes. PLoS One 7(12):e51609.

  4. Benjamini Y, Hochberg Y (1995) Controlling the false discovery rate: a practical and powerful approach to multiple testing. J R Stat Soc Ser B Methodol 57(1):289–300

  5. Boter M, Ruiz-Rivero O, Abdeen A, Prat S (2004) Conserved MYC transcription factors play a key role in jasmonate signaling both in tomato and Arabidopsis. Genes Dev 18(13):1577–1591.

  6. Bouchez D, Tokuhisa JG, Llewellyn DJ, Dennis ES, Ellis JG (1989) The ocs-element is a component of the promoters of several T-DNA and plant viral genes. EMBO J 8(13):4197–4204

  7. Brown RL, Kazan K, McGrath KC, Maclean DJ, Manners JM (2003) A role for the GCC-box in jasmonate-mediated activation of the PDF1.2 gene of Arabidopsis. Plant Physiol 132(2):1020–1032.

  8. Buttner M, Singh KB (1997) Arabidopsis thaliana ethylene-responsive element binding protein (AtEBP), an ethylene-inducible, GCC box DNA-binding protein interacts with an ocs element binding protein. Proc Natl Acad Sci U S A 94(11):5961–5966

  9. Chen W, Singh KB (1999) The auxin, hydrogen peroxide and salicylic acid induced expression of the Arabidopsis GST6 promoter is mediated in part by an ocs element. Plant J 19(6):667–677.

  10. Chisholm ST, Coaker G, Day B, Staskawicz BJ (2006) Host-microbe interactions: shaping the evolution of the plant immune response. Cell 124(4):803–814.

  11. Cho MH, Lee SW (2015) Phenolic phytoalexins in rice: biological functions and biosynthesis. Int J Mol Sci 16(12):29120–29133.

  12. Després C, DeLong C, Glaze S, Liu E, Fobert PR (2000) The Arabidopsis NPR1/NIM1 protein enhances the DNA binding activity of a subgroup of the TGA family of bZIP transcription factors. Plant Cell 12(2):279–290.

  13. Droge-Laser W, Kaiser A, Lindsay WP, Halkier BA, Loake GJ, Doerner P, Dixon RA, Lamb C (1997) Rapid stimulation of a soybean protein-serine kinase that phosphorylates a novel bZIP DNA-binding protein, G/HBF-1, during the induction of early transcription-dependent defenses. EMBO J 16(4):726–738.

  14. Dudler R, Hertig C, Rebmann G, Bull J, Mauch F (1991) A pathogen-induced wheat gene encodes a protein homologous to glutathione-S-transferases. Mol Plant-Microbe Interact 4(1):14–18

  15. E ZG, Zhang YP, Zhou JH, Wang L (2014) Mini review roles of the bZIP gene family in rice. Genet Mol Res 13(2):3025–3036.

  16. Ellis JG, Tokuhisa JG, Llewellyn DJ, Bouchez D, Singh K, Dennis ES, Peacock WJ (1993) Does the ocs-element occur as a functional component of the promoters of plant genes? Plant J 4(3):433–443.

  17. Finn RD, Attwood TK, Babbitt PC, Bateman A, Bork P, Bridge AJ, Chang H-Y, Dosztányi Z, El-Gebali S, Fraser M, Gough J, Haft D, Holliday GL, Huang H, Huang X, Letunic I, Lopez R, Lu S, Marchler-Bauer A, Mi H, Mistry J, Natale DA, Necci M, Nuka G, Orengo CA, Park Y, Pesseat S, Piovesan D, Potter SC, Rawlings ND, Redaschi N, Richardson L, Rivoire C, Sangrador-Vegas A, Sigrist C, Sillitoe I, Smithers B, Squizzato S, Sutton G, Thanki N, Thomas PD, Tosatto Silvio CE, Wu CH, Xenarios I, Yeh L-S, Young S-Y, Mitchell AL (2017) InterPro in 2017—beyond protein family and domain annotations. Nucleic Acids Res 45(D1):D190–D199.

  18. Fukuda Y, Shinshi H (1994) Characterization of a novel cis-acting element that is responsive to a fungal elicitor in the promoter of a tobacco class I chitinase gene. Plant Mol Biol 24(3):485–493

  19. Garretón V, Carpinelli J, Jordana X, Holuigue L (2002) The as-1 promoter element is an oxidative stress-responsive element and salicylic acid activates it via oxidative species. Plant Physiol 130(3):1516–1526.

  20. Giuliano G, Pichersky E, Malik VS, Timko MP, Scolnik PA, Cashmore AR (1988) An evolutionarily conserved protein binding sequence upstream of a plant light-regulated gene. Proc Natl Acad Sci U S A 85(19):7089–7093

  21. Gurr SJ, Rushton PJ (2005) Engineering plants with increased disease resistance: how are we going to express it? Trends Biotechnol 6(23):283–290

  22. Heise A, Lippok B, Kirsch C, Hahlbrock K (2002) Two immediate-early pathogen-responsive members of the AtCMPG gene family in Arabidopsis thaliana and the W-box-containing elicitor-response element of AtCMPG1. Proc Natl Acad Sci U S A 99(13):9049–9054.

  23. Johnson C, Boden E, Desai M, Pascuzzi P, Arias J (2001) In vivo target promoter-binding activities of a xenobiotic stress-activated TGA factor. Plant J 28(2):237–243

  24. Jones JD, Dangl JL (2006) The plant immune system. Nature 444(7117):323–329.

  25. Jones P, Binns D, Chang H-Y, Fraser M, Li W, McAnulla C, McWilliam H, Maslen J, Mitchell A, Nuka G, Pesseat S, Quinn AF, Sangrador-Vegas A, Scheremetjew M, Yong S-Y, Lopez R, Hunter S (2014) InterProScan 5: genome-scale protein function classification. Bioinformatics 30(9):1236–1240.

  26. Kawahara Y, Oono Y, Kanamori H, Matsumoto T, Itoh T, Minami E (2012) Simultaneous RNA-seq analysis of a mixed transcriptome of rice and blast fungus interaction. PLoS One 7(11):e49423.

  27. Kim HS, Delaney TP (2002) Over-expression of TGA5, which encodes a bZIP transcription factor that interacts with NIM1/NPR1, confers SAR-independent resistance in Arabidopsis thaliana to Peronospora parasitica. Plant J 32(2):151–163

  28. Kim S, Kim Y, An G (1993) Identification of methyl jasmonate and salicylic acid response elements from the nopaline synthase (nos) promoter. Plant Physiol 103(1):97–103

  29. Lam E, Benfey PN, Gilmartin PM, Fang RX, Chua NH (1989) Site-specific mutations alter in vitro factor binding and change promoter expression pattern in transgenic plants. Proc Natl Acad Sci U S A 86(20):7890–7894

  30. Langmead B, Trapnell C, Pop M, Salzberg SL (2009) Ultrafast and memory-efficient alignment of short DNA sequences to the human genome. Genome Biol 10(3):R25.

  31. Lebel E, Heifetz P, Thorne L, Uknes S, Ryals J, Ward E (1998) Functional analysis of regulatory sequences controlling PR-1 gene expression in Arabidopsis. Plant J 16(2):223–233.

  32. Li B, Meng X, Shan L, He P (2016) Transcriptional regulation of pattern-triggered immunity in plants. Cell Host Microbe 19(5):641–650.

  33. Loake GJ, Faktor O, Lamb CJ, Dixon RA (1992) Combination of H-box [CCTACC(N)7CT] and G-box (CACGTG) cis elements is necessary for feed-forward stimulation of a chalcone synthase promoter by the phenylpropanoid-pathway intermediate p-coumaric acid. Proc Natl Acad Sci U S A 89(19):9230–9234.

  34. Macho Alberto P, Zipfel C (2014) Plant PRRs and the activation of innate immune signaling. Mol Cell 54(2):263–272.

  35. Maekawa T, Kufer TA, Schulze-Lefert P (2011) NLR functions in plant and animal immune systems: so far and yet so close. Nat Immunol 12(9):818–826.

  36. Manners JM, Penninckx IA, Vermaere K, Kazan K, Brown RL, Morgan A, Maclean DJ, Curtis MD, Cammue BP, Broekaert WF (1998) The promoter of the plant defensin gene PDF1.2 from Arabidopsis is systemically activated by fungal pathogens and responds to methyl jasmonate but not to salicylic acid. Plant Mol Biol 38(6):1071–1080

  37. Meng X, Xu J, He Y, Yang KY, Mordorski B, Liu Y, Zhang S (2013) Phosphorylation of an ERF transcription factor by Arabidopsis MPK3/MPK6 regulates plant defense gene induction and fungal resistance. Plant Cell 25(3):1126–1142.

  38. Menkens AE, Schindler U, Cashmore AR (1995) The G-box: a ubiquitous regulatory DNA element in plants bound by the GBF family of bZIP proteins. Trends Biochem Sci 20(12):506–510.

  39. Mi H, Huang X, Muruganujan A, Tang H, Mills C, Kang D, Thomas PD (2017) PANTHER version 11: expanded annotation data from gene ontology and Reactome pathways, and data analysis tool enhancements. Nucleic Acids Res 45(D1):D183–D189.

  40. Nasir F, Tian L, Chang C, Li X, Gao Y, Tran LP, Tian C (2017) Current understanding of pattern-triggered immunity and hormone-mediated defense in rice (Oryza sativa) in response to Magnaporthe oryzae infection. Semin Cell Dev Biol.

  41. Ohme-Takagi M, Shinshi H (1995) Ethylene-inducible DNA binding proteins that interact with an ethylene-responsive element. Plant Cell 7(2):173–182

  42. Ohme-Takagi M, Suzuki K, Shinshi H (2000) Regulation of ethylene-induced transcription of defense genes. Plant Cell Physiol 41(11):1187–1192

  43. Park HC, Kim ML, Kang YH, Jeon JM, Yoo JH, Kim MC, Park CY, Jeong JC, Moon BC, Lee JH, Yoon HW, Lee SH, Chung WS, Lim CO, Lee SY, Hong JC, Cho MJ (2004) Pathogen- and NaCl-induced expression of the SCaM-4 promoter is mediated in part by a GT-1 box that interacts with a GT-1-like transcription factor. Plant Physiol 135(4):2150–2161.

  44. Robert-Seilaniantz A, Grant M, Jones JD (2011) Hormone crosstalk in plant disease and defense: more than just jasmonate-salicylate antagonism. Annu Rev Phytopathol 49:317–343.

  45. Rushton PJ, Reinstadler A, Lipka V, Lippok B, Somssich IE (2002) Synthetic plant promoters containing defined regulatory elements provide novel insights into pathogen- and wound-induced signaling. Plant Cell 14(4):749–762

  46. Rushton PJ, Somssich IE (1998) Transcriptional control of plant genes responsive to pathogens. Curr Opin Plant Biol 1(4):311–315

  47. Sarkar S, Das A, Khandagale P, Maiti IB, Chattopadhyay S, Dey N (2017) Interaction of Arabidopsis TGA3 and WRKY53 transcription factors on Cestrum yellow leaf curling virus (CmYLCV) promoter mediates salicylic acid-dependent gene expression in planta. Planta.

  48. Sato F, Kitajima S, Koyama T, Yamada Y (1996) Ethylene-induced gene expression of osmotin-like protein, a neutral isoform of tobacco PR-5, is mediated by the AGCCGCC cis-sequence. Plant Cell Physiol 37(3):249–255

  49. Shigenaga AM, Argueso CT (2016) No hormone to rule them all: interactions of plant hormones during the responses of plants to pathogens. Semin Cell Dev Biol 56:174–189.

  50. Shimono M, Sugano S, Nakayama A, Jiang CJ, Ono K, Toki S, Takatsuji H (2007) Rice WRKY45 plays a crucial role in benzothiadiazole-inducible blast resistance. Plant Cell 19(6):2064–2076.

  51. Sohn KH, Lee SC, Jung HW, Hong JK, Hwang BK (2006) Expression and functional roles of the pepper pathogen-induced transcription factor RAV1 in bacterial disease resistance, and drought and salt stress tolerance. Plant Mol Biol 61(6):897–915.

  52. Strompen G, Gruner R, Pfitzner UM (1998) An as-1-like motif controls the level of expression of the gene for the pathogenesis-related protein 1a from tobacco. Plant Mol Biol 37(5):871–883

  53. Toledo-Ortiz G, Huq E, Quail PH (2003) The Arabidopsis basic/helix-loop-helix transcription factor family. Plant Cell 15(8):1749–1770

  54. Ulmasov T, Hagen G, Guilfoyle TJ (1994) The ocs element in the soybean GH2/4 promoter is activated by both active and inactive auxin and salicylic acid analogues. Plant Mol Biol 26(4):1055–1064

  55. van Loon LC, Rep M, Pieterse CM (2006) Significance of inducible defense-related proteins in infected plants. Annu Rev Phytopathol 44:135–162.

  56. Vlot AC, Dempsey DA, Klessig DF (2009) Salicylic acid, a multifaceted hormone to combat disease. Annu Rev Phytopathol 47:177–206.

  57. Wang Y, Kwon SJ, Wu J, Choi J, Lee YH, Agrawal GK, Tamogami S, Rakwal R, Park SR, Kim BG, Jung KH, Kang KY, Kim SG, Kim ST (2014) Transcriptome analysis of early responsive genes in rice during Magnaporthe oryzae infection. Plant Pathol J 30(4):343–354.

  58. Xiang C, Miao Z, Lam E (1996) Coordinated activation of as-1-type elements and a tobacco glutathione S-transferase gene by auxins, salicylic acid, methyl-jasmonate and hydrogen peroxide. Plant Mol Biol 32(3):415–426

  59. Xiang C, Miao Z, Lam E (1997) DNA-binding properties, genomic organization and expression pattern of TGA6, a new member of the TGA family of bZIP transcription factors in Arabidopsis thaliana. Plant Mol Biol 34(3):403–415

  60. Yang DL, Yang Y, He Z (2013) Roles of plant hormones and their interplay in rice immunity. Mol Plant 6(3):675–685.

  61. Yang Y, Shah J, Klessig DF (1997) Signal perception and transduction in plant defense responses. Genes Dev 11(13):1621–1639

  62. Yin S, Mei L, Newman J, Back K, Chappell J (1997) Regulation of sesquiterpene cyclase gene expression. Characterization of an elicitor- and pathogen-inducible promoter. Plant Physiol 115(2):437–451

  63. Yu LM, Lamb CJ, Dixon RA (1993) Purification and biochemical characterization of proteins which bind to the H-box cis-elementimplicated in transcriptional activation of plant defense genes. Plant J 3(6):805–816

  64. Zander M, Thurow C, Gatz C (2014) TGA transcription factors activate the salicylic acid-suppressible branch of the ethylene-induced defense program by regulating ORA59 expression. Plant Physiol 165(4):1671–1683.

  65. Zhang J, Chen L, Fu C, Wang L, Liu H, Cheng Y, Li S, Deng Q, Wang S, Zhu J, Liang Y, Li P, Zheng A (2017) Comparative transcriptome analyses of gene expression changes triggered by Rhizoctonia solani AG1 IA infection in resistant and susceptible rice varieties. Front Plant Sci 8:1422.

  66. Zhou J, Tang X, Martin GB (1997) The Pto kinase conferring resistance to tomato bacterial speck disease interacts with proteins that bind a cis-element of pathogenesis-related genes. EMBO J 16(11):3207–3218.

Download references


This work was supported by a grant from the National Natural Science Foundation of China (30300207).

Availability of data and materials

The datasets supporting the conclusions of this article are included within the article and its additional files.

Author information

WK conceived and supervised the design of this study. LD, JC and BW analyzed the data. WK, LD and JC discussed the data. WK wrote the manuscript. All authors read and approved the final version of the manuscript.

Correspondence to Weiwen Kong.

Ethics declarations

Ethics approval and consent to participate

Not applicable.

Consent for publication

Not applicable.

Competing interests

The authors declare that they have no competing interests.

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Additional files

Additional file 1:

Table S1. Chromosomal distribution of the rice genes controlled by putative PIPs. (XLSX 21 kb)

Additional file 2

Figure S1. Annotation of the identified rice transcripts driven by the PIPs. (A) Annotation of Go molecular function of the identified rice transcripts driven by the PIPs. (B) Annotation of Go cellular component of the identified rice transcripts driven by the PIPs. (DOCX 1355 kb)

Additional file 3:

Table S2. InterPro annotation for 190 putative disease resistance/susceptibility-related protein-encoding transcripts (119 groups) encoded by the rice genes under the control of PIPs. (XLSX 58 kb)

Additional file 4:

Table S3. The putative transcription factors encoded by the rice genes under the control of PIPs. (XLSX 12 kb)

Additional file 5:

Table S4. Pathogen-infected rice microarray and RNA-Seq data used in this study. (XLSX 13 kb)

Additional file 6:

Table S5. Differentially expressed transcripts driven by PIPs with the PICEs identified from the rice microarray data analysis. (XLSX 20 kb)

Additional file 7:

Table S6. Differentially expressed transcripts driven by PIPs with the PICEs identified from the rice RNA-Seq data analysis. (XLSX 20 kb)

Additional file 8:

Table S7. Differentially expressed transcripts in common from the rice microarray and RNA-Seq data analyses. (XLSX 21 kb)

Additional file 9:

Figure S2. The upstream sequences and putative pathogen-inducible cis-elements (PICEs) in rice. This document-like figure contains 212 upstream sequences (2000 bp) that correspond to the 212 differentially expressed transcripts obtained from the microarray and mRNA-Seq data analyses. The PICEs (AS-1, G-box, GCC-box and H-box) are highlighted in yellow. W-box, GT1, ERE (elicitor-responsive element) and MRE cis-elements are highlighted in green, cyan, magenta and red, respectively. (DOCX 165 kb)

Additional file 10:

Table S8. Differentially expressed genes without the PICEs in the promoters identified from the rice microarray data analysis. (XLSX 16 kb)

Additional file 11:

Table S9. Differentially expressed genes without the PICEs in the promoters identified from the rice RNA-Seq data analysis. (XLSX 13 kb)

Additional file 12:

Table S10. The proportion of the differentially expressed genes in the total rice genes. (XLSX 12 kb)

Additional file 13:

Table S11. The enrichment analysis of the differentially expressed rice genes driven by PIPs through Fisher’s exact test comparing each pair of the proportions of the differentially expressed genes in the total genes from Additional file 8: Table S10. (XLSX 11 kb)

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (, which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made.

Reprints and Permissions

About this article

Verify currency and authenticity via CrossMark


  • Rice
  • Pathogen-inducible cis-regulatory elements (PICEs)
  • Disease resistance
  • Disease susceptibility