1. Introduction
Small peptides (SPs), consisting of 5 to 100 amino acids, are widely distributed in plants and animals and function as critical signaling molecules regulating growth, development, and stress responses (Hellens et al., 2016; Ong et al., 2022). Most SPs are encoded by precursor genes, with typical peptide precursors having conserved family-specific sequences at the C-terminus (Olsson et al., 2019). These precursors undergo proteolytic cleavage and post-translational modifications to yield mature, functional peptides (Yu et al., 2019b). Mature SPs are recognized by nearby cell membrane surface receptors, such as receptor kinases and receptor-like kinase families. Alternatively, SPs may undergo long-distance transport to bind to the extracellular domains of corresponding receptors (Takahashi et al., 2018; Datta et al., 2024). This binding event activates the intracellular protein kinase domains of the receptors, triggering downstream signaling cascades, such as MAPK signaling and transcription factors (Fletcher, 2020; Zhou et al., 2022; Yang et al., 2024).
SPs are essential regulators of plant growth and development, influencing cell proliferation in stems, roots, leaves, flowers, fruits, and seeds (Song et al., 2016; Zhang et al., 2018; Ogawa-Ohnishi et al., 2022). They also mediate plant resistance to environmental stresses such as drought, salt, pests, and diseases (Reichardt et al., 2020; Zhou et al., 2022; Xiao and Zhou, 2023). Numerous SPs have been identified in various crops beyond model plants like Arabidopsis and rice, with their homologous genes and structures showing high conservation across plant species (Olsson et al., 2019; Jeon et al., 2021).
This review provides an overview of the latest methods for identifying peptides, which involve screening all amino acid sequences and small open reading frames (sORFs) of a species to pinpoint candidate peptides. We then delve into SPs intricately linked to plant growth and development, focusing on peptide families such as phytosulfokine (PSK), plant peptide-containing sulfated tyrosine (PSY), casparian strip integrity factors (CIF), root growth factor (RGF), clavata/embryo surrounding region (CLE), epidermal patterning factor/epidermal patterning factor-like (EPF/EPFL), LURE, rapid alkalinization factor (RALF), C-terminally encoded peptide (CEP), and Inflorescence deficient in abscission (IDA). Using these small peptides as exemplars, the review explores their origins and characteristics, summarizing how they fine-tune plant growth and development by activating various downstream signal transduction pathways. Additionally, it offers a forward-looking perspective on future research directions in peptide signaling pathways, aiming to inspire further exploration and discovery in this dynamic field.
1.1. Classification & Identification of SPs
SPs, known as small molecule signaling peptides, exhibit remarkably conserved homologous gene structures and domains among plants (Ong et al., 2022). The first small plant peptide, systemin, was identified in Solanum lycopersicum and regulates the resistance response to wounds caused by pests and diseases (Ryan and Pearce, 1998). Since then, various plant SPs have been identified and annotated using classical biochemical and molecular methods. Additionally, many predictive tools and bioinformatics approaches for identifying SPs are emerging.
1.2. Classification Based on Origin
Most SPs typically have a molecular weight of around 25 amino acids. They can be categorized into two main groups based on their origin: precursor-derived and non-precursor-derived peptides (
Figure 1; Datta et al., 2024). Precursor-derived peptides are processed from precursor proteins, often with a signal peptide at the N-terminus guiding their maturation into functional peptides (Ong et al., 2022; Datta et al., 2024). These peptides can be further classified into functional and non-functional precursors. For example, CAPE (CAP-derived peptide 1) and SUBPEP (Subtilase peptide) are functional precursors involved in immune defense mechanisms, while non-functional precursors lack a defined biological role, typically arising from extended preproproteins (Pearce et al., 2010; Chien et al., 2015; Xu et al., 2024). Non-functional precursors can be categorized into three groups based on their mature peptide characteristics (
Figure 1).
1. Peptides incorporating posttranslational modifications (PTMs), such as proline hydroxylation, hydroxyproline arabinosylation, and tyrosine sulfation, which confer biological activity and chemical stability (Royek S et al., 2022). Examples include PAMP-induced Secreted Peptides (PIP), CEP, IDA, and CLE peptides.
2. Cysteine-rich peptides (CRPs) containing structural domains with 2-16 Cys residues influence peptide structure and activity via intramolecular disulfide bond formation (Ong et al., 2022; Tian et al., 2022). The EPF/EPFL peptide family is a notable example of regulating plant stomata development (Kosentka et al., 2019).
3. Non-cysteine-rich/non-PTM peptides play roles in plant defense responses. Examples include systemin (SYS), plant elicitor peptide (PEP), and plant natriuretic peptides (PNP) (Lee et al., 2020).
Nonprecursor-derived peptides are directly translated from sORFs without intermediate propeptides or additional processing steps (Datta et al., 2024). These peptides can be classified into three classes based on the genomic location of the sORF genes (
Figure 1): (1) peptides encoded by upstream ORFs on the mRNA 5’ end, (2) peptides encoded within longer non-coding proteins in other transcript, and (3) peptides encoded by sORFs within primary transcripts of microRNAs (pri-miRNAs), In summary, SPs display diverse characteristics and functions in plants. Understanding the origin and classification of SPs provides insights into their regulatory roles in plant growth, development, and stress responses.
1.3. Based on the N-Terminal Sequence
Based on the variation in the N-terminal sequence of the peptide precursor protein, plant SPs can be categorized as non-secretory and secretory peptides (Xu et al., 2023; Datta et al., 2024). Non-secretory peptides can regulate intracellular processes and are released into the extracellular matrix from injured cells to activate plant defense responses directly (Jing et al., 2024). For example, PEP is a typical non-secretory peptide that triggers plant defense responses against pathogen (Zelman et al., 2023). Recent research has also revealed that PEP/REF1 functions as a systemin-independent local wound signal in tomato plants, predominantly orchestrating localized defense mechanisms and regenerative processes in response to injury (Yang et al., 2024). Additionally, it enhances the regenerative capacity of callus tissue (Lu et al., 2018; Yang et al., 2024).
In contrast, secretory peptides are synthesized intracellularly and transported to the extracellular space (Shen et al., 2022; Datta et al., 2024). They exert their effects by traversing the epidermis or xylem, facilitating intercellular signal transduction, and regulating the activities of adjacent cells in the extracellular milieu (Olsson et al., 2019; Shen et al., 2022). Peptides incorporating post-translational modifications (PTMs) and cysteine-rich peptides (CRPs) are classified as secreted peptides. Importantly, PTMs represent the most prevalent type of plant peptides (Xu et al., 2023). The precursor protein transforms into a mature peptide through electrostatic charge, hydrophilicity, and conformation modifications. The mature peptide then binds to receptors, activating downstream signal transduction pathways (Tian et al., 2022; Xu et al., 2023).
1.4. Identification Methods of SPs
Various functional plant SPs have been identified and annotated to regulate multiple aspects of plant growth (Hsiao and Yamada, 2020; Datta et al., 2024). Traditional biochemical and genetic approaches have been used to validate the biological functions of plant peptides. However, these methods are not ideal for high-efficiency peptide identification, often yielding only a few identified peptides (Molisso et al., 2022).
Currently, researchers have integrated novel peptidomics, bioinformatics screening, and biochemical and genetic validation methods for peptide identification, overcoming the high costs and technical difficulties associated with traditional biochemical analysis and circumventing obstacles arising from low gene abundance and redundancy (Wang et al., 2020; Ong et al., 2022). For instance, high-throughput analytical approaches that couple selective enrichment, fractionation, phenotype screening, and mass spectrometry identification provide an established framework for screening plant tissues for biologically relevant small secreted peptides (SSPs) (Wang et al., 2020; Hu et al., 2021). This method begins with selective enrichment strategies to isolate molecules of interest from complex crude extracts, commonly using techniques such as size exclusion ultrafiltration, gel-based separations, solvent extractions, and size exclusion chromatography (
Figure 2A; Feng et al., 2023).
Next, further fractionation is performed based on physicochemical properties (e.g., polarity, hydrophobicity, stability, solubility). SSPs can be screened for bioactivity in cell-based or cell-free systems to evaluate functions such as antimicrobial activity or free radical scavenging (Feng et al., 2023). After detecting fractions with relevant bioactivity, high-throughput LC-MS/MS is used to sequence unknown SSPs (
Figure 2A; Hu et al., 2021). However, mass spectrometry identification of SSPs faces challenges due to insufficient SSP representation in protein databases and unclear mechanisms of SSP maturation.
A recent bioinformatics approach for peptide identification involves screening for SSPs within the entire proteome of a given plant species. Initially, the focus is on the amino acid sequences of proteins. Proteins containing fewer than 250 amino acids are selected as potential secretory peptides, as they typically possess N-terminal signal peptides. Subsequently, signal peptide prediction is performed on these small peptides, eliminating those lacking N-terminal signal peptides. Since secretory peptides lack transmembrane domains, further prediction of transmembrane domains is employed on the remaining peptides, removing those exhibiting such structures. Finally, peptides with C-terminal K/HDLE sequences, which serve as endoplasmic reticulum docking proteins, are discarded, yielding the secretory peptide sequences for the given species (
Figure 2B; Tian et al., 2022; Xu et al., 2023).
Notably, genes encoding SSPs are often overlooked during genome annotation due to their diminutive size (Ong et al., 2022). Therefore, a more comprehensive approach involves screening sORFs to identify SSPs (
Figure 2B). This method first identifies non-coding sequences (NCDS) from the chromosomes using software tools like sORF finder. The subsequent steps are similar to the methods described earlier, wherein the presence or absence of an N-terminal signal peptide, the presence of transmembrane domains, and whether the sORF encodes an ER-docking protein are employed as distinguishing SSP characteristics for selecting potential sORFs within the NCDS (
Figure 2B). It is worth noting that only a tiny fraction of sORFs can be categorized into known gene families, while most remain unclassifiable. This issue may be attributed to undiscovered peptide families and the lack of comprehensive sORF annotation, which could lead to false positives. Together, this represents a novel approach to plant peptide identification, highlighting the potential of bioinformatics in unraveling SSPs. This method offers a promising direction for future research, utilizing bioinformatics to identify and characterize secretory peptides, thereby advancing our understanding of their roles in plant biology.
1.5. Tyrosine-Sulfated Peptides
Sulfated peptides are plant hormone-like molecules known for their potent activity even at nanomolar concentrations. Four distinct sulfated peptides have been characterized in plants—PSK, PSY, CIF and RGF. These peptides mediate growth and development processes by activating specific signal transduction pathways.
1.6. PSK Peptides
PSK peptides, originating from 80 to 120 amino acid preproproteins, were first identified in asparagus suspension cell cultures as growth promoters (Matsubayashi and Sakagami, 1996; Kutschmar et al., 2008; Ding et al., 2023). They are widely distributed across plants, including angiosperms and gymnosperms, with several members in various species sharing significant sequence similarity and a conserved C-terminal domain (Sauter, 2015; Zhang et al., 2018; Geng et al., 2020; Kou et al., 2020). PSK maturation involves tyrosine sulfation and proteolytic cleavage, facilitated by plant-specific enzymes such as tyrosylprotein sulfotransferases (TPSTs) and subtilisin serine proteases (Komori et al., 2009; Stührwohldt et al., 2021).
PSK peptides play crucial roles in various aspects of plant growth (Kutschmar et al., 2008; Reichardt et al., 2020; Geng et al., 2020). Synthetic PSK-α peptide treatment significantly enhances root and hypocotyl elongation in Arabidopsis, mediated by the PSK RECEPTOR (PSKR) (Bendahmane et al., 2011; Wang et al., 2015). Loss-of-function mutants of PSKR and AtTPST, exhibit defects in root growth and cell size reduction, confirming the importance of PSKs in promoting growth (Matsubayashi et al., 2006; Bendahmane et al., 2011; Fang et al., 2024). BRI1-ASSOCIATED RECEPTOR KINASE 1 (BAK1)/SOMATIC EMBRYOGENESIS RECEPTOR-LIKE KINASEs (SERKs) act as co-receptors in the PSK signaling pathway, they are not directly involved in the binding of PSK (Li et al., 2002; Wang et al., 2015). Furthermore, PSK regulates plant reproductive development, promoting quorum sensing during pollen germination and pollen tube growth (Chen et al., 2000; Huang et al., 2014; Kou et al., 2020). Specific PSK expression patterns in soybean seeds and overexpression of PSK genes in wheat enhance seed and grain growth, respectively (Yu et al., 2019a; Geng et al., 2020).
PSK also modulates fruit ripening and nutrient accumulation, with exogenous PSK application promoting tomato fruit ripening via PSKR-dependent pathways (
Figure 3A; Fang et al., 2024). Exogenous application of SlPSK5 enhances tomato fruit ripening, but this effect is lost in
pskr1 mutants, indicating PSKR dependency (Fang et al., 2024). PSK also impedes the ubiquitination alteration of its receptor by PUB12/13, amplifying the stability of PSKR1 in tomatoes (
Figure 3A; Hu et al., 2023). Activation of the SlPSK5-SlPSKR module phosphorylates the transcription factor DREBF2, boosting the expression of ripening-related genes (
Figure 3A; Fang et al., 2024). A recent investigation elucidated that PSK orchestrates the equilibrium between growth and defense mechanisms in tomatoes by phosphorylating distinct amino acid residues of GLUTAMINE SYNTHETASE 2 (GS2) via the action of CALCIUM-DEPENDENT PROTEIN KINASE 28 (CPK28) (
Figure 3A; Ding et al., 2023). In summary, PSK peptides are critical in root development, pollen germination, and fruit ripening processes.
1.7. PSY Peptides
PSY peptides were initially discovered in Arabidopsis suspension culture cells. In Arabidopsis, nine
PSY genes encoding PSY precursor proteins with lengths ranging from 71 to 104 amino acids have been identified (Amano et al., 2007; Ogawa-Ohnishi et al., 2022). The mature PSY1 peptide comprises 18 amino acids, featuring sulfation modification on Tyr2 and hydroxylation modifications on Pro16 and Pro17 (Tost et al., 2021). Recent studies have revealed that PSY–PSYR signaling is a switch to mediate the trade-off between plant growth and stress response (Ogawa-Ohnishi et al., 2022). PSYR1, PSYR2, and PSYR3 act as redundant negative regulators of plant growth in the absence of the PSY peptide, while PSY peptides suppress PSYR signaling, promoting root growth (
Figure 3B; Ogawa-Ohnishi et al., 2022). PSYR1 can intricately interact with and enhance the activity of AHA2 by phosphorylating its T881 residue. Activation of the plasma membrane H
+-ATPase AHA2 by PSY1 occurs in a PSYR1-dependent manner, leading to extracellular acidification and promoting hypocotyl elongation (
Figure 3B; Fuglsang et al., 2014; Pei et al., 2022). Thus, the growth-promoting function of PSY family peptides may represent a delicate balance between growth and stress resilience (Amano et al., 2007; Ogawa-Ohnishi et al., 2022). Overall, the interaction between PSY and PSYR is crucial in plant growth and development.
1.8. CIFs and TWS1
CIFs play a crucial role in regulating the integrity of the plant Casparian strip (Nakayama et al., 2017; Fujita, 2021)). In Arabidopsis, five
CIF genes—
CIF1,
CIF2,
CIF3,
CIF4, and
TWS1—encode post-translationally modified peptides. Their receptors, GASSHO 1 (GSO1) and GSO2, belong to the leucine-rich repeat receptor-like kinase family (Tsuwamoto et al., 2008). CIF1 and CIF2 share 21 conserved amino acids at the C-terminus and contain sulfation and hydroxylation sites crucial for binding to GSO1 (Doll et al., 2020; Okuda et al., 2020; Fujita, 2021). The structure of the GSO1-CIF2 complex reveals an interaction that stabilizes the receptor/peptide binding, forming the GSO pathway (
Figure 3C; Doll et al., 2020; Royek et al., 2022). After sulfation by tyrosylprotein sulfotransferases (TPSTs) in the vascular bundle, CIF1 and CIF2 are transported to endodermal cells, where they bind to the receptor complex GSO1 and SERK family proteins (
Figure 3C; Zhang et al., 2022). CIF transport halts upon Casparian strip completion, terminating the GSO pathway and lignin synthesis. Additionally, the GSO pathway regulates CASP structural domain formation and suberin synthesis in cork tissue (Doll et al., 2020; Hohmann et al., 2020; Okuda et al., 2020).
CIF3 and CIF4 are crucial in regulating pollen development; CIF3 and CIF4 precursor proteins require cleavage by the protease SBT5.4 (Truskina et al., 2022). SBT5.4 is expressed in pollen. Mature CIF3 or CIF4 molecules are detectable by GSO1/GSO2 receptors, thereby initiating the process of pollen wall development (
Figure 3C; Truskina et al., 2022). TWS1, expressed in the embryo, exhibits weaker sequence similarity to the core region of CIF1 (Royek et al., 2022). Its precursor is directly processed at the C-terminus by the ALE1 peptidase, allowing it to cross the cuticle barrier and re-enter the embryo (Royek et al., 2022). It interacts with GSO1 and GSO2 receptors to establish cell-cell connections during embryonic epidermis formation (
Figure 3C; Moussu et al., 2017). However, few studies have examined the processing and modification of CIF peptides in plants, indicating a need for further comprehensive research. However, few studies have examined the processing and modification of CIF peptides in plants, indicating a need for further comprehensive research.
1.9. RGF/GLV/CLEL Peptides
The RGF peptides, also known as GOLVEN (GLV) or CLE-like (CLEL) peptide family, consist of 13 amino acids and originate from precursor peptides of approximately 100 amino acids (Meng et al., 2012; Shinohara et al., 2021). These sulfated peptides typically feature the characteristic DY-motif, except for GLV9, and a highly conserved hydroxylated proline residue. Across plant species, RGFs exhibit a combination of conserved and non-conserved amino acids (Komori et al., 2009; Shinohara et al., 2021). Biologically active RGF peptides undergo tyrosine sulfation and proteolytic cleavage (Shinohara et al., 2021).
RGFs play a crucial role in root development by regulating the apical meristem. Recent studies have shown that in vitro treatments with GLVs and RGFs can significantly inhibit lateral root formation by impeding auxin accumulation during lateral root initiation (Jourquin et al., 2023). Specifically, RGF peptides modulate root apical meristem activity by regulating the expression of PLETHORA (PLT) proteins, which serve as master regulators of root formation (
Figure 3D; Aida et al., 2004; Hsiao et al., 2024). Most
RGF genes are expressed in quiescent center cells and adjacent cells in roots, leading to the anticipation of peptide diffusion in the meristematic region with a gradient (Matsuzaki et al., 2010; Ou et al., 2016). This gradient of RGF peptides delineates the gradient of PLT protein, modulating protein stability to ensure resilient root growth and development in dynamic environments (Shinohara et al., 2021). Researchers have found that the RGF and their receptor pair (RGF-RGFR) orchestrate the concentration gradient of the PLT protein to maintain root apical meristem activity (
Figure 3D; Aida et al., 2004; Galinha et al., 2007).
In Arabidopsis, three RGF receptors (RGFRs), namely RGFR1, RGFR2, and RGFR3, have been identified in proximal meristematic tissues (Shinohara et al., 2016; Yamada et al., 2020). It has been documented that the receptor kinases RGI4/SKM2 and RGI5 are involved in RGF perception in conjunction with RGI1/RGFR1, RGI2/RGFR2, and RGI3/RGFR3 (Ou et al., 2016; Shinohara et al., 2016; Yamada et al., 2020). The quintuple mutant
rgi1,2,3,4,5 shows a significant decrease in root apical meristem activity and complete insensitivity to RGF (Ou et al., 2016). Moreover, it has been confirmed that
RGI4/SKM2, a paralog of RGI3/RGFR3, directly binds to RGF (
Figure 3D; Song et al., 2016). Subsequent studies revealed that BAK1 and its paralogs, the SERKs, act as coreceptors of RGI1 for sensing RGF1 (
Figure 3D; Song et al., 2016; Ou et al., 2022). Upon recognition by the RGI1-SERKS complex at the cellular periphery, the RGF1 signal is relayed to PLTs through a YODA–MKK4/5–MPK3/6 signaling cascade (
Figure 3D; Lu et al., 2020; Shao et al., 2020). However, the direct interaction between RGI5 and RGF has not yet been demonstrated. These findings underscore the pivotal role of the RGF-RGFR module in root development by regulating the apical meristem.
1.10. CLE Peptides
CLE peptides, crucial plant signaling molecules, interact with membrane-bound receptors, modulating transcription factors and phytohormone pathways (Goad et al., 2016; Hake et al., 2019). Notably, specific CLE peptides like CLV3, TDIF, CLE40, and CLE45 play vital roles in plant development and response to stimuli (Willoughby and Nimchuk, 2021; Takahashi et al., 2018). Subsequent sections of this review will delve deeper into their regulatory roles and signaling mechanisms in plant growth and development.
1.11. CLV3
The CLAVATA3 (CLV3) peptide, crucial for cell fate determination in Arabidopsis’ shoot apical meristem, has two mature forms identified via mass spectrometry: a 12-amino acid peptide with hydroxyproline (Hyp) residues at positions 4 and 7, and a 13-amino acid arabinosylated form with a histidine at position 13 (Zeng et al., 2017; Okamoto et al., 2022). These mature CLE peptides are recognized by the CLV1/CLV2 receptors, which consist of a leucine-rich repeat receptor-like kinase (LRR-RLK) and a leucine-rich repeat receptor-like protein (LRR-RLP) lacking a kinase domain (Je et al., 2018; Lee et al., 2019). Additionally, receptors CORYNE (CRN) and RECEPTOR-LIKE PROTEIN KINASE 2 (RPK2) are involved in CLV3 signaling (Somssich et al., 2015). Recent studies show that CLV1, CLV2, and CRN form a receptor complex for CLV3 (
Figure 4; Somssich et al., 2015).
RPK2, also known as TOADSTOOL 2 (TOAD2), mediates the CLV3 signaling cascade by associating with BAM1 (Han et al., 2020; Wang et al., 2022). The three BARELY ANY MERISTEM (BAM) receptor kinases, clustered within the same clade as CLV1, play a pivotal role in maintaining stem cell preservation along the peripheries of the shoot apical meristem (Somssich et al., 2015; Willoughby and Nimchuk 2021). BAM1 and BAM2 can bind to CLV3, functioning synergistically with CLV1 as supplementary receptors for CLV3. This interaction initiates a downstream phosphorylation cascade via mitogen-activated protein kinase (MAPK) pathways, with MPK3 and MPK6 phosphorylation ultimately regulating stem cell equilibrium (
Figure 4; Li et al., 2019). The CLV3 signal is perceived by multiple receptor complexes—CLV1/CLV1 homomeric, CLV2/CRN heteromeric, and possibly RPK2 homomeric—negatively regulating WUSCHEL, a master regulator in plant growth signaling, thereby controlling the shoot apical meristem (
Figure 4; Somssich et al., 2015; Wang et al., 2022).
1.12. TDIF
TDIF, initially derived from
Zinnia elegans, regulates plant vascular meristem development (Ito et al., 2006). In Arabidopsis, the
CLE41 and
CLE44 genes share a 12-amino-acid sequence within their CLE domains, with CLE44 resembling TDIF but lacking glycosylation (Ohyama et al., 2008). Treatment with TDIF inhibits xylem cell differentiation while promoting procambial cell proliferation, highlighting its role in plant development (Hirakawa et al., 2008). The receptor for TDIF, TDR, belongs to the LRR-RLK XI subfamily and plays a pivotal role in maintaining vascular meristem (Morita et al., 2016; Smit et al., 2020; Yang et al., 2020). In a manner parallel to the CLV3 signaling pathways, the WUS family protein WUS-RELATED HOMEOBOX 4 (WOX4) acts as a mediator of TDIF signaling by facilitating the proliferation of procambial cells (
Figure 4; Hirakawa et al., 2010). Downstream effectors of the TDIF pathway, glycogen synthase kinase 3 proteins (GSK3s), suppress xylem differentiation via the WOX4 pathway (
Figure 4; Hirakawa et al., 2010). TDR activates GSK3s at the plasma membrane in a TDIF-dependent manner (Hirakawa et al., 2010). Inhibition of GSK3s induces xylem cell differentiation through the transcription factor BRI1-EMS SUPPRESSOR 1 (BES1) (Hirakawa et al., 2010; Zhang et al., 2016). Additionally, a NAC domain transcription factor, XVP, localizes on the plasma membrane, interacting with the TDIF co-receptor PXY-BAK1 to form a complex (
Figure 4; Yang et al., 2020). XVP expression is localized in the cambium and is critical in regulating xylem differentiation and vascular patterning (Yang et al., 2020). Subsequent investigations elucidated that XVP governs xylem differentiation via the pivotal factor VASCULAR-RELATED NAC-DOMAIN6 (VDN6). Notably, the conservation of the CLE41/TDIF-TDR-WOX4 signaling pathway in wood-forming Populus species suggests its role as a universal regulator of vascular development (
Figure 4; Yamaguchi et al., 2016).
1.13. CLE40
CLAVATA3/ESR-RELATED40 (CLE40), the closest homolog of CLV3 in Arabidopsis, regulates columella stem cells (CSCs) in the root meristem (Hobe et al., 2003; Zhu et al., 2021). CSCs, located at the root apices, perceive gravitational stimuli and have a unique cell wall composition that protects the root meristem (Hobe et al., 2003). CLE40 expression in CSCs controls cell division, as evidenced by
cle40 mutants displaying excessive CSC layers (Stahl et al., 2013). Genetic studies indicate that CLE40 functions through the receptor kinases ARABIDOPSIS CRINKLY4 (ACR4) and CLAVATA1 (CLV1) to regulate WOX5 expression (
Figure 4; Berckmans et al., 2020). ACR4, but not CLV1, is transcriptionally upregulated following ectopic CLE40 peptide treatment (Sarkar et al., 2007; Qin et al., 2019). ACR4 is also implicated in forming the epidermal cell layer and initiating lateral roots (
Figure 4; De Smet et al., 2008). CLE40’s action is limited to cortex cells, whereas WOX5 expression in the quiescent center positively regulates stem cell production, suggesting a balance between CLE40 and WOX5 in regulating the stem cell population of the root meristem (
Figure 4; Stahl et al., 2013; Richards et al., 2015; Berckmans et al., 2020).
1.14. CLE45
The interaction between CLE45 and BAM3 establishes a signaling module that regulates vascular development independently of the TDIF-TDR pathway (Yuan and Wang 2021). Synthetic CLE45 peptide inhibits root growth and protophloem differentiation in Arabidopsis (Kang and Hardtke, 2016). BAM3 acts as a receptor for CLE45, as the impact of synthetic CLE45 is mitigated in a
bam3 mutant (Hazak et al., 2017). The class II LRR-RLK, known as CLE-RESISTANT RECEPTOR KINASE (CLERK), also termed CIK2, is indispensable for the complete perception of CLE45 and other CLE peptides active in early proto-phloem development (Anne et al., 2018). However, CLERK operates within a genetically distinct pathway from BAM3 and CLV2/CRN. Notably, MEMBRANE-ASSOCIATED KINASE REGULATOR 5 (MAKR5) emerges as a pivotal facilitator of CLE45 signaling, operating downstream of BAM3. CLE45 signaling recruits MAKR5 to the plasma membrane and promotes its accumulation in developing sieve elements (
Figure 4; Kang and Hardtke, 2016). This signaling module interacts genetically with plasma membrane-localized proteins BREVIS RADIX (BRX) and OCTOPUS (OPS), which are critical for sieve-element (SE) cell fate control (Rodriguez et al., 2014; Anne et al., 2015). Deletion of BRX or OPS leads to gap cell development in the protophloem due to interference with interactions between CLE45 signaling receptors BAM3 and the CLV2/CRN complex.
brx and
ops mutants also exhibit abbreviated root characteristics and increased lateral root production from primary root branching zones (Anne et al., 2015; Rodriguez-Villalon et al., 2014). The CLE45-SKM1/SKM2 pathway facilitates intercellular communication between male and female cells, aiding pollen tube growth under heat stress and fertilization (Kang and Hardtke, 2016). While the regulatory role of CLE in root and pollen tube development is documented, the detailed molecular framework and specific signaling components governing these processes remain unclear.
In addition to the previously mentioned CLE peptides, various others exhibit diverse functionalities. For instance, peptides CLE1, 3, 4, and 7 suppress lateral root formation via the CLV1 receptor under nitrate deficiency (Araya et al., 2014). CLE18 and CLE26 are associated with root growth (Meng et al., 2012b; Czyzewicz et al., 2015). The CLE5 and CLE6 complexes influence B-type CLE and shoot development (Whitford et al., 2008). CLE8 regulates embryo and endosperm development (Xie et al., 2022), while CLE10 influences protoxylem vessel formation (Soyano et al., 2014). Overall, CLE peptides are crucial in various aspects of plant growth and development.
1.15. Other SPs
1.15.1. EPF/EPFL Peptides
The EPF/EPFL peptides are characterized by abundant cysteine residues and an N-terminal signal peptide (Kosentka et al., 2019). Upon proteolytic cleavage, mature EPF/EPFL peptides feature a conserved C-terminal region, typically containing six or eight cysteine residues that facilitate intramolecular disulfide bond formation (Tavormina et al., 2015; Bessho et al., 2016). EPF1 and EPF2, which share significant homology, interact with the ERL1 and ERL2 receptors, respectively (Hara et al., 2007). EPF/EPFL peptides are known to negatively regulate stomatal density negatively, thereby inhibiting stomatal development (Tavormina et al., 2015; Kosentka et al., 2019). Additionally, several Arabidopsis EPF members, including EPF1, EPF2, and EPFL9, regulate stomatal development by binding to a complex formed by TMM and ER receptor kinases (
Figure 5A; Lin et al., 2017). EPFL9 (STOMAGEN) competes with EPF2 for ER receptor binding and fine-tuning stomatal development (
Figure 5A; Yin et al., 2017; Lin et al., 2017). In contrast, CHALLAH (CHAL)/EPFL6 exhibits low expression in leaves and the stomatal lineage (Uchida et al., 2012). Although single
chal mutants and triple mutants do not display any discernible phenotype in stomatal development, CLL2/EPFL4 and CHAL/EPFL6 are predominantly expressed in the endodermis of the inflorescence stem, where they regulate proper stem architecture (
Figure 5A; Uchida et al., 2012; Kosentka et al., 2019). Consequently, CLL2/EPFL4 and CHAL/EPFL6 primarily act on inflorescence stem growth regulation (Kosentka et al., 2019). Recent research has elucidated that EPFL4-6 interacts with the ER-SERK receptor complex, activating the downstream MKK5/6-MPK6 cascade, thereby dynamically regulating cellular proliferation in stamen filaments (
Figure 5A; He et al., 2023). Despite the incomplete understanding of EPF/EPFL-associated signaling pathways, research suggests that many EPF peptides exert a negative regulatory effect on stomatal density, thereby inhibiting stomatal development (Kosentka et al., 2019; He et al., 2023).
1.15.2. LURE Peptides
LUREs, the first identified pollen-tube attractants found in Torenia fournieri’s synergid cells, belong to the defensin-like cysteine-rich peptide family (Okuda et al., 2009). Among the sixteen cysteine-rich peptides isolated from these cells, TfCRP1 and TfCRP3, known as LURE1 and LURE2, act as pollen tube attractants (Maruyama and Higashiyama, 2016). Despite variations in their sequences, all LUREs share a conserved pattern of six cysteine residues for disulfide bond formation. However, comparisons across species, such as Torenia and Arabidopsis, reveal that the alignment of cysteine residues is the only conserved feature (Okuda et al., 2009; Maruyama and Higashiyama, 2016).
A recent study identified pollen-specific receptor kinase 6 (PRK6) as a crucial receptor for sensing the attractant peptide AtLURE1 (
Figure 5B; Takeuchi and Higashiyama, 2016). It was observed that pollen tubes from
prk loss-of-function mutants could not respond to AtLURE1 (Takeuchi and Higashiyama, 2016). PRK6, consisting of six leucine-rich repeats (LRRs), belongs to a subclade containing eight PRK family receptors (PRK1–8) in Arabidopsis (Takeuchi, 2021). Interestingly, studies on PRK family receptors in tomatoes and Arabidopsis have indicated their functional roles in regulating the efficiency of pollen tube growth (Yang et al., 2022). The results suggest that
prk mutants (
prk3 prk6 prk8 and
prk1 prk3 prk6) demonstrate a dramatic decrease in pollen tube growth and fertility, while individual
prk mutants do not exhibit such pronounced effects (Yang et al., 2022). Although the exact relationship with PRKs remains unclear, a separate group of pollen-expressed RLKs, including MALE DISCOVERER1 (MDIS1), MDIS2, MDIS1-INTERACTING RECEPTOR LIKE KINASE1 (MIK1), and MIK2, have also been implicated in the sensing of AtLURE1 (
Figure 5B; Takeuchi, 2021; Yang et al., 2022). Even though the downstream signaling pathway of LUREs remains incompletely elucidated, we can still infer that PRK6, in conjunction with the multiple receptor components mentioned earlier, may serve as a pivotal player in regulating pollen tube growth and attraction by detecting extracellular ligands, such as AtLURE1.
1.15.3. RALF Peptides
RALF was initially discovered in tobacco plants for its ability to raise the pH of the culture medium quickly (Pearce et al., 2001). RALFs are widespread in various plant species, including dicots, monocots, and gymnosperms. In Arabidopsis, 40 RALF and RALF-like (RALFL) genes have been identified, showing diverse expression patterns in different organs and tissues (Pearce et al., 2001; Germain et al., 2005). RALF belongs to the cysteine-rich peptide family in plants and is characterized by four conserved cysteine residues forming two disulfide bonds, crucial for proper folding (Abarca et al., 2021). Upstream of the mature RALFs N-terminus are a pair of arginine residues, serving as cleavage sites for peptidases and coordinating immune responses (Datta et al., 2024). Downstream, the YISY motif is crucial for receptor binding (Matos et al., 2008). The C-terminal sequences GASYY and RCRR(S) of RALFs might stabilize the peptide structure or mediate interactions between peptides (Campbell and Turner, 2017).
Catharanthus roseus receptor-like kinase 1-like (CrRLK1L) proteins recognize RALFs, influencing plant growth (Xiao et al., 2019). For instance, the binding of RALF1 to FERONIA (FER), a member of the CrRLK1L family, suppresses lateral root growth, where BAK1 forms a receptor complex with FER (
Figure 5C; Haruta et al., 2014; Stegmann et al., 2017). Additionally, RALFs regulate plant reproductive development. RALF4 and RALF19, expressed in pollen, interact with the ANXUR1/2 (ANX1/2)-BUPS1/2 (Buddha’s paper seal 1/2) complex, influencing pollen tube growth (
Figure 5C; Ge et al., 2017). As the pollen tube approaches the embryo sac, RALF34 competes with RALF4 and RALF19 for binding to the ANX1/2-BUPS1/2 complex, leading to pollen tube rupture and sperm release, thereby completing double fertilization (
Figure 5C; Franck et al., 2018; Galindo-Trigo et al., 2020). Moreover, LORELEI-like-GPI anchored proteins 2 and 3 (LLG2/3) serve as co-receptors alongside ANX1/2 and BUPS1/2, facilitating their function (
Figure 5C; Ge et al., 2019). When the pollen tube grows towards the embryo sac, it was documented that RALF4/19 instigated the ANX1/2-BUPS1/2-LLG2/3-MARIS cascade, subsequently activating AtMLO1, 5, 9, and 15 (
Figure 5C; Gao et al., 2023). This activation facilitated tip-focused Ca
2+ influx, upholding pollen tube integrity (
Figure 5C; Gao et al., 2023). Together, RALFs play crucial regulatory roles mediated through receptor interactions, impacting processes ranging from root development to reproductive success.
1.15.4. CEP Peptides
CEP peptides originate from proteins featuring an N-terminal secretion signal, a variable domain, and one or more CEP domains. These peptides, each spanning 15 amino acids, are reportedly essential in plant growth and development (Taleski et al., 2018; Ota et al., 2020;Taleski et al., 2023). For instance, AtCEP3 negatively regulates lateral root number in Arabidopsis, as demonstrated by increased lateral root density in
cep3 mutant plants compared to wild-type under various nutrient-deficient and abiotic stress conditions (Imin et al., 2013). Additionally, CEPs are crucial in negatively regulating lateral root emergence in
Medicago truncatula. Overexpression of
MtCEP1 reduces lateral root number, while simultaneous knockdown of
MtCEP1 and
MtCEP2 increases lateral root number (
Figure 5D; Imin et al., 2013). CEP5 regulates the auxin response in lateral root formation by enhancing AUX/indole-3-acetic acid (IAA) stability, thereby suppressing the expression of auxin-responsive genes (
Figure 5D; Roberts et al., 2016).
In recent studies, overexpression of
GhCEP46-D05 in cotton and Arabidopsis resulted in reduced plant height, fiber length, root length, and the length and width of mature seeds in transgenic lines (
Figure 5D; Mei et al., 2024). Notably, CEP peptides impact plant development by specifically binding to CEP RECEPTORS 1/2 (CEPR1/CEPR2) kinases (Bellegarde et al., 2017; Chapman et al., 2019). The CEP-CEPR1 pathways inhibit lateral root growth in the presence of sucrose, other metabolizable sugars, and increased light intensity by reducing lateral root meristem size and the length of mature lateral root cells (
Figure 5D; Chapman et al., 2019). Recent studies have revealed that CEP and cytokinin signaling mediate the action of CEP DOWNSTREAM (CEPD) glutaredoxins, thereby inhibiting primary root growth (
Figure 5D; Taleski et al., 2023). Despite some literature outlining the functional roles of the CEP-CEPR signaling pathway in plant lateral root development, further elucidation is needed to understand the comprehensive regulatory network of this pathway in plant development (Chapman et al., 2019; Taleski et al., 2023).
1.15.5. IDA Peptides
IDA peptides, a class of small signaling proteins, are generated via the proteolytic cleavage of their precursor protein catalyzed by subtilases (Katharina et al., 2016; Stintzi and Schaller, 2022). They possess an N-terminal secretory signal peptide, guiding the protein to the extracellular milieu, while the C-terminal proline-rich extended motifs, known as the EPIP motif, comprised of 20 amino acids, play a pivotal role in inducing the abscission of floral organs post-pollination (Stenvik et al., 2008; Vie et al., 2017). Reportedly, IDA peptides regulate the shedding of cauline leaves in response to dehydration stress (Taylor et al., 2019). Under water deficit conditions, when leaves show signs of wilting, the bioactive IDA peptide ligand is perceived by a receptor complex composed of either the receptor kinases HAESA or HAESA-like 2 (HSL2) along with SERK coreceptors (
Figure 5E; Stenvik et al., 2008; Taylor et al., 2019). This recognition event initiates a downstream signaling cascade involving MAPKs, ultimately governing the expression of cell-wall modifying and hydrolytic enzymes, particularly polygalacturonases and xyloglucan endotransglucosylase/hydrolases (
Figure 5E; Stenvik et al., 2008; Taylor et al., 2019). These enzymes degrade the middle lamella, thereby inducing cell separation.
Furthermore, corroborating evidence confirms that IDA expression occurs shortly after abscission zone cells attain receptivity to abscission signals, signifying early loosening of cell walls (Vie et al., 2015). Moreover, overexpression of the IDA protein leads to the abscission of floral organs and the enlargement of abscission zone cells precisely at the sites where organs detach from the plant(Ying et al., 2016). These discoveries suggest that the IDA peptide likely orchestrates the initial relaxation of cell walls and subsequent detachment of abscission zone cells, possibly by regulating cell wall remodeling enzymes (Ying et al., 2016). The initial cell wall relaxation primarily occurs in the root cap and inflorescence, ultimately leading to their detachment (Geng et al., 2022; Wang et al., 2023). Furthermore, it has been observed that the overexpression of the IDA phenotype depends on the presence of the protein’s N-terminal secretion signal (Dong et al., 2022). No overexpression phenotypes are evident when IDA is overexpressed without a secretion signal (Dong et al., 2022). This finding emphasizes the vital role of IDA translocation to the apoplastic space in its functionality. In summary, IDA-mediated signaling pathways play essential roles in promoting plant organ abscission.
2. Conclusions and Perspectives
Small peptides, consisting of 5-100 amino acids, are crucial signaling molecules in plants, exhibiting diverse processing, folding, and modifications (Zhang and Zhang, 2022). Advances in sequencing, informatics, molecular biology, genetics, and biochemistry have made it possible to identify and understand these peptides’ functions (Feng et al., 2023). These peptides regulate key plant processes such as cell proliferation, tissue differentiation, organ formation, reproductive development, maturation, and senescence, and they are vital for responding to environmental stresses. Despite progress, challenges remain due to their short length and low expression levels. Recent advancements in genomics and transcriptomics have expanded research from model plants like Arabidopsis and rice to economically important crops, leading to the identification and characterization of more functional peptides (Holzwart et al., 2018; Huang et al., 2016).
In conclusion, small plant peptides are a promising emerging research area. Simplified, accurate, and rapid methods for identifying and verifying their functions are urgently needed. Future developments could enable the quantitative production of plant peptide hormones to regulate growth, enhance stress resistance, improve yields, and address challenges in eco-agriculture. Continued innovation in this field could revolutionize agriculture and significantly contribute to sustainable food production.
Figure 1.
Classification of Small Peptides in Plants. Small peptides (SPs) in plants fall into two categories: precursor-derived and nonprecursor-derived. Precursor-derived peptides can be functional or nonfunctional; functional ones exhibit their role post-transport, while nonfunctional ones may undergo posttranslational modifications and are categorized as Cys-rich or non-Cys-rich/non-PTM. Nonprecursor-derived peptides are encoded by small open reading frames (sORFs) containing fewer than 100 amino acids, typically found in (a) the 5′ upstream region of the main open reading frame (ORF), (b) transcripts not exceeding 100 amino acids, or (c) primary transcripts of microRNAs (pri-miRNAs).
Figure 1.
Classification of Small Peptides in Plants. Small peptides (SPs) in plants fall into two categories: precursor-derived and nonprecursor-derived. Precursor-derived peptides can be functional or nonfunctional; functional ones exhibit their role post-transport, while nonfunctional ones may undergo posttranslational modifications and are categorized as Cys-rich or non-Cys-rich/non-PTM. Nonprecursor-derived peptides are encoded by small open reading frames (sORFs) containing fewer than 100 amino acids, typically found in (a) the 5′ upstream region of the main open reading frame (ORF), (b) transcripts not exceeding 100 amino acids, or (c) primary transcripts of microRNAs (pri-miRNAs).
Figure 2.
Identification of Small Peptides in Plants. (A) The workflow includes four steps: extracting and enriching the low molecular weight (MW) fraction of the secreted proteome, fractionating or isolating low MW fractions, testing SSP bioactivity, and analyzing SSP fraction libraries using high-resolution/high-mass accuracy LC-MS/MS with de novo search strategies. (B) Recent bioinformatics methods for peptide identification follow a comprehensive approach. Initially, proteins are predicted based on size criteria, with SignalP predicting N-terminal signal sequences (NSS) and TMMHMM identifying and excluding transmembrane domains. ER domains are targeted to eliminate proteins associated with endoplasmic reticulum docking. This process identifies Putative SPs for further validation. Similarly, A similar strategy is applied to identify Noncoding Small Secreted Peptides (NCDS), initially identifying potential SSP-encoding small open reading frames (sORFs) and validating them using similar predictive tools.
Figure 2.
Identification of Small Peptides in Plants. (A) The workflow includes four steps: extracting and enriching the low molecular weight (MW) fraction of the secreted proteome, fractionating or isolating low MW fractions, testing SSP bioactivity, and analyzing SSP fraction libraries using high-resolution/high-mass accuracy LC-MS/MS with de novo search strategies. (B) Recent bioinformatics methods for peptide identification follow a comprehensive approach. Initially, proteins are predicted based on size criteria, with SignalP predicting N-terminal signal sequences (NSS) and TMMHMM identifying and excluding transmembrane domains. ER domains are targeted to eliminate proteins associated with endoplasmic reticulum docking. This process identifies Putative SPs for further validation. Similarly, A similar strategy is applied to identify Noncoding Small Secreted Peptides (NCDS), initially identifying potential SSP-encoding small open reading frames (sORFs) and validating them using similar predictive tools.
Figure 3.
The signaling pathways of four sulfated peptides. (A) The PSK signaling pathway regulates plant growth, development, and fruit ripening. (B) The interaction between PSYs and PSYR governs the regulation of root elongation. (C) CIFs interact with the CSOs-BAK1 co-receptor to modulate the formation of the plant’s Casparian strip, pollen wall development, and embryonic cuticle formation. (D) The RGF signaling pathway modulates root meristem development and elongation.
Figure 3.
The signaling pathways of four sulfated peptides. (A) The PSK signaling pathway regulates plant growth, development, and fruit ripening. (B) The interaction between PSYs and PSYR governs the regulation of root elongation. (C) CIFs interact with the CSOs-BAK1 co-receptor to modulate the formation of the plant’s Casparian strip, pollen wall development, and embryonic cuticle formation. (D) The RGF signaling pathway modulates root meristem development and elongation.
Figure 4.
The processes modulated by CLE signaling pathways and their impact on cells. CLEs bind to receptors, including CLV2, CLV3, BAMs, TDR, CIK2, and ACR4, triggering recruitment of co-receptors CRN, RPK2, and BAK1 to the complex. These initial interactions initiate downstream signaling cascades regulating SAM homeostasis, xylem differentiation, cambial cell division, protophloem differentiation, columella stem cells, and lateral root development. Arrows and bars represent positive and negative regulation, while solid and dashed lines denote direct and indirect regulation.
Figure 4.
The processes modulated by CLE signaling pathways and their impact on cells. CLEs bind to receptors, including CLV2, CLV3, BAMs, TDR, CIK2, and ACR4, triggering recruitment of co-receptors CRN, RPK2, and BAK1 to the complex. These initial interactions initiate downstream signaling cascades regulating SAM homeostasis, xylem differentiation, cambial cell division, protophloem differentiation, columella stem cells, and lateral root development. Arrows and bars represent positive and negative regulation, while solid and dashed lines denote direct and indirect regulation.
Figure 5.
The signaling transduction pathways and functional implications of EPF/EPFL, CEP, RALF, LURE, and IDA/IDL. (A) EPFs modulate stomatal development by binding to the ER-TMM co-receptor and regulate stamen filament proliferation by interacting downstream with the ER-SERK co-receptor. (B) The binding of LURE1 to its receptor promotes the growth of pollen tubes. (C) Activation of downstream signaling pathways by the binding of RALFs to their receptors regulates the development of pollen tubes and the elongation of root cells. (D) AtCEP3 and AtCEP5 interact with CEPRs to initiate signaling cascades, thereby modulating lateral root growth. GhCEP46 binds to CEPRs to inhibit fiber length and plant height. (E) IDA/IDL’s binding with the HSL2-SERK co-receptor triggers downstream signal transduction, facilitating root cap and inflorescence detachment.
Figure 5.
The signaling transduction pathways and functional implications of EPF/EPFL, CEP, RALF, LURE, and IDA/IDL. (A) EPFs modulate stomatal development by binding to the ER-TMM co-receptor and regulate stamen filament proliferation by interacting downstream with the ER-SERK co-receptor. (B) The binding of LURE1 to its receptor promotes the growth of pollen tubes. (C) Activation of downstream signaling pathways by the binding of RALFs to their receptors regulates the development of pollen tubes and the elongation of root cells. (D) AtCEP3 and AtCEP5 interact with CEPRs to initiate signaling cascades, thereby modulating lateral root growth. GhCEP46 binds to CEPRs to inhibit fiber length and plant height. (E) IDA/IDL’s binding with the HSL2-SERK co-receptor triggers downstream signal transduction, facilitating root cap and inflorescence detachment.