1. Introduction
Gene transcription is activated or repressed by transcription factors, which bind to specific cis-elements of the promoter [
1]. The RWP-RK protein family, with a conserved RWP-RK domain, is a class of plant-specific transcription factors that is expressed in green algae, mosses, and vascular plants [
2]. It is divided into two subfamilies: NLPs (NIN-like proteins) and RKDs (RWP-RK domain proteins) [
3]. Unlike RKDs, all members of the NLP subfamily possess the PB1 (Phox and Bem 1) domain at their C-termini [
4].
Two subfamilies of RWP-RK have been to participate in the modulation of various processes in plants. Members of the RKD subfamily are primarily involved in embryonic development [
5]. For example,
AtRKD4 deficiency may lead to inhibition of zygote elongation and disrupt early cell division patterns [
6]. Overexpression of
AtRKD1 and
AtRKD2 was found to induce the formation of egg-like structures [
7]. Members of the NLP subfamily are also linked to the nitrogen response [
8]. The AtNLP7 receptor directly binds nitrate in intracellular environments [
9]. Loss of
NLP2 might trigger a decreased in nitrogen fixation and nitrogen content in plants [
10]. Moreover, evidence suggests that RWP-RK proteins modulate plant responses to abiotic stress, such as heat stress [
11].
Zanthoxylum armatum DC., also known as ‘Tengjiao’ or ‘Qinghuajiao’, belongs to the rutaceae family and is widely distributed in southwest China [
12,
13]. The fruit of
Z. armatum can be utilized to prepare food spice, medicine, and oil, highlighting its significant economic value [
14]. In addition,
Z. armatum, like citrus, exhibit apomixis traits [
15]. CitRWP, a gene that encodes a protein with a RWP-PK domain, was associated with apomixis in citrus [
16]. To date, analysis of the RWP-RK family has only been conducted for a few species such as soybean, tea, and
Brassica napus [
5,
17,
18]. However, and no family analysis of RWP-RK proteins in
Z. armatum has been reported. Recently, the genome of the
Z. armatum was published allowing researchers to analyze various gene families [
15,
19].
In this study, 36 ZaRWP-RK transcription factors were identified from the Z. armatum genome. Chromosomal localization, phylogenetic relationships, physicochemical properties, gene structure, and cis-element of the ZaRWP-RK transcription factors were explored. Furthermore, the expression levels of ZaRWP-RK in different tissues and protein-protein interaction predictions were conducted which showed that some ZaRWP-RK transcription factors may regulate apomixis process. Therefore, this study provides a theoretical basis for further investigations on RWP-RK genes and their regulatory mechanisms in the apomixis process in Z. armatum.
4. Discussion
The RWP-RK transcription factor is widely expressed in many plants and regulates various physiological processes such as nitrogen response, gametophyte development, and abiotic stress regulation [
28]. In many plants, including
Arabidopsis, rice, elephant grass and soybean, members of the RWP-RK transcription factor family have been reported [
5,
25,
29,
30]. However, no study has analyzed this family in
Z. armatum. In this study, we performed the genome-wide identification of the RWP-RK family members in
Z. armatum and analyzed their expression during reproductive processes, based on the recently published genome information of
Z. armatum [
19].
Using the conserved motif (RWPXRK) of RWP-RK transcription factor, 36 members of the RWP-RK family in Chili pepper genome were identified, which surpassed that reported in other plants, for example, 14 in
Arabidopsis, 13 in rice, and 8 in pummelo (
Figure 1 and
Figure 4).
Z. armatum, having undergone a whole-genome duplication, has higher number of chromosomes (2n = 4x = 132) [
15,
19]. This indicates that the
ZaRWP-RK genes have extensively expanded throughout the evolutionary process, probably due to the diversity of their functions. Gene duplication is a significant driving force for the diversification of gene functions and species evolution [
31]. In eukaryotes, there are two main types of duplication events: tandem and segmental [
32]. Previous studies have shown that the NLP subfamily in
Arabidopsis evolved through segmental duplication rather than tandem amplification [
25]. Similarly, we found that the ZaRWP-RK family in
Z. armatum evolved primarily through segmental duplication (
Figure 2). Moreover, we observed that the number of orthologous gene pairs between
Z. armatum and dicots (
Arabidopsis and pummelo) exceeded that between
Z. armatum and monocots (rice), indicating that after the divergence of monocots and dicots, the
ZaRWP-RK transcription factors have undergone extensive evolution and duplication (
Figure 3).
The RWP-RK transcription factor family is generally divided into two subfamilies: the RKD subfamily and the NLP subfamily [
25]. The available studies indicate that in Arabidopsis, the RKD subfamily has 5 members and the NLP subfamily has 7 members [
29,
33]. In rice, there are 7 members in the RKD subfamily and 6 members in the NLP subfamily [
25]. In this study, we analyzed 36 ZaRWP-RK transcription factors from the
Z. armatum genome. Through a phylogenetic analysis, we classified 15 of these members into the RKD subfamily and 21 members into the NLP subfamily (
Figure 4,
Table S1). Notably, the transcriptional activity, DNA-binding activity, and protein-protein interactions of transcription factors are often controlled by their motifs [
34]. Among the 10 conserved motifs identified in ZaRWP-RK family, motifs 1 and 7 were expressed in all ZaRWP-RK members, indicating that they might constitute the most conserved part of ZaRWP-RK (
Figure 5B). Motifs 2-6 and 8-10 were only identified in members of the NLP subfamily, suggesting that these motifs might plat important role in unique biological functions of the NLP subfamily. Analysis of the conserved domains of ZaRWP-RK showed that the PB1 domain was present in all NLP subfamily members, which is the main distinguishing point between RKD and NLP subfamily (
Figure 5C). Interestingly, our analyzes showed that ZaNLP2 contained a conserved GAF_2 domain. A recent study reported that nitrate-triggered ROS signaling and the detection of nitrate deficiency are dependent upon the GAF domain of AtNLP7 [
35]. Therefore, we speculated that ZaNLP2 might be involved in the regulation of nitrogen response. In addition, the PoIC superfamily domain was detected in ZaRKD10. PoIC is a major DNA polymerase involved in genome duplication, suggesting that ZaRKD10 might participate in rapid cell division stage, such as embryonic development [
36].
The activation of gene expression is regulated by Cis-elements present in the promoter region [
37]. Therefore, the physiological functions of cis-elements on the promoter can be used to predict the biological function of the genes [
38]. In ZaRWP-RK family, 21 types of cis-elements were identified, categorized into abiotic stress (6), response to phytohormone (9), and plant growth and development (6) (
Figure 6,
Table S2). Most of them were hormone response elements, with the ABRE (ABA response element) accounting for the largest proportion. ABA are key hormones that control plant growth, embryogenesis, seed physiology reactions, and stress tolerance [
39]. Pretreatment with ABA in embryo maturation medium increased cassava somatic embryo conversion into plants [
40]. In addition, a significant number of auxin response elements and gibberellin response elements were identified. Auxin and gibberellin have been shown to regulate plant embryo development [
41,
42]. Furthermore, numerous elements related to plant growth and development were identified in the promoters of ZaRWP-RK members, suggesting that ZaRWP-RK transcription factors participate in embryogenesis. Consistent with this view, numerous studies have shown that ZaRWP-RK modulates plant reproduction processes [
7]. Among the five RKD members in Arabidopsis,
AtRKD1 and
AtRKD4 are highly expressed during early embryo development, and the
AtRKD4 mutant leads to abnormal embryo development [
6,
29]. The MpRKD controls the development of gametophytes and contributes to the formation of gemma cups [
43]. Therefore, our results indicate that ZaRWP-RK transcription factors may participate in plant reproductive processes by responding to and regulating hormones such as ABA, auxin, and gibberellin.
Apomixis, found in over 300 genera and 40 families throughout the plant kingdom, displays a wide taxonomic distribution [
44]. In this mechanism, the fertilization step is skipped, allowing cells from nucellus or integument tissues to develop into somatic embryos. Previous studies have highlighted the crucial role of RWP-RK transcription factors in both somatic embryogenesis and apomixis processes [
15,
26,
45]. Therefore, investigating the expression pattern of the
ZaRWP-RK transcription factors during the apomictic reproduction process in
Z. armatum is of great importance. In this study, we found that the expression levels of most
ZaRWP-RK genes increase from flowers to young fruits significantly (
Figure 7). Among them, the expression levels of
ZaNLP6,
ZaNLP10,
ZaNLP18, and
ZaRKD3 and
ZaRKD5 show the most significant differences, indicating their potential functions in apomixis. In contrast,
ZaNLP7 expression was significantly lower in the female flower and young fruit than in the stems and leaf, suggesting it might negatively regulate apomixis.
Autophagy proteins are required for multiple functions during embryogenesis [
46]. Our predictions based on the protein-protein interaction network suggested that Cs_ont_4g023450 (homolog of ZaRKD2, ZaRKD10 and ZaRKD14) and Cs_ont_5g040080 (homolog of ZaRKD8 and ZaRKD9) may interact with ATG5 and ATG12, suggesting that ZaRKDs may be involved in autophagy-regulated embryonic development (
Figure 8). In addition, our results showed that Cs_ont_4g023450 and Cs_ont_5g040080 may interact with AP2A1, AP2S1, and CHC1 (
Figure 8). The adaptor protein complex 2 (AP-2) proteins participate in various biological processes, such as plant growth and development, and stresses response [
47]. AP2σ and AP1/2β modulate embryogenesis and plant growth in
Arabidopsis [
48,
49]. AP-2 has been shown to interact with CHC and regulate flower organ development. Therefore, we speculate that ZaRKDs may interact with AP2A1, AP2S1, and CHC1, thereby influencing the reproductive process in
Z. armatum. Moreover, we also found that Cs_ont_4g023450 and Cs_ont_5g040080 interacted with MIS1 and Alba3. The mitochondrial fission complex has been implicated in
Arabidopsis heat tolerance [
50]. ALBA proteins modulate various stress responses [
51]. Recent studies also have shown that overexpression of one
RWP-RK gene in pearl millet can significantly enhance plant heat tolerance [
11]. Therefore, it can be hypothesized that ZaRKDs may regulate heat tolerance in
Z. armatum.