Glossary
Initiating ribosome: a ribosome which integrates the interaction of initiation factors, mRNA and initiator tRNA (Formyl-Met-tRNA) for optimal start codon selection, and concomitantly reading frame determination for translation of an mRNA sequence into a protein product [
1].
N-terminal proteomics (N-terminomics): the discipline of mapping protein N-termini of a given proteome sample, also known as the N-terminome. Next to representing the start of proteolytically processed proteins, N-termini may also serve as proxies of translation initiation, thereby greatly contributing to the process of genome annotation.
Proteoforms: multiple molecular forms of proteins that describe the biological protein variability at the level of protein primary structure and thus protein isoforms that are expressed from a single gene. Therefore, proteoforms generally contribute to the increased complexity of proteomes [
2]. While post-translational modifications can give rise to expressed proteoforms, N-terminal proteoforms specifically point to proteoforms generated co-translationally by alternative translation initiation and/or differential co-translational initiator methionine (iMet) excision.
Retapamulin (RET): a member of the pleuromutilins, a class of antibiotics known for their ability to act as bacterial protein synthesis inhibitors by their specific interaction with the 50S subunit of bacterial initiating ribosomes. Retapamulin is known to obstruct the first steps of translation elongation thereby arresting initiating ribosomes at start codons. By complementing ribosome profiling (Ribo-seq) with retapamulin treatment (Ribo-RET), the signal-to-noise ratio can be improved specifically for calling of (alternative) translation start sites [
3]. However, Gram-negative bacteria, including the important model organisms
Escherichia coli (
E. coli) and
Salmonella Typhimurium (
S. Typhimurium), are partially protected against retapamulin action thanks to their TolC multidrug efflux pump, requiring
tolC deletion strains for optimal Ribo-RET performance [
4]. Lefamulin, however, is a newer pleuromutilin which might eventually outcompete retapamulin for Ribo-RET purposes as for this drug higher activity has been reported in Gram-negative bacteria [
5].
Riboproteogenomics: a term used to refer to the combination of systematic complementary ribosome profiling, proteomics (
e.g. N-terminal proteomics) and genomics for studying translation (initiation) landscapes [
6].
Ribosome profiling (by sequencing) (Ribo-seq): the identification of open reading frames (ORFs) by deep-sequencing of ribosome-protected mRNA fragments [
7]. By relying on ribosome-mRNA binding, Ribo-seq offers a genome-wide view on the translational landscape of an organism (translatome). The standard ribosome profiling procedure in prokaryotes employs chloramphenicol treatment to specifically stall elongating ribosomes [
8], but this step can be omitted by performing flash-freezing of the samples [
5].
Shotgun proteomics: composition analysis of complex peptide samples (obtained either through digestion or small polypeptide-enrichment) through the complementary use of high-performance liquid chromatography (HPLC) and mass spectrometry (MS), for peptide separation and peptide/protein identification, respectively.
sORFeome: the small ORF collection of the ORFeome, which refers to the totality of ORFs harboured by a species’ genome. In this review, the sORFeome considers all ORFs shorter in length than 300 base pairs (bp).
sORFs and SEPs: small in size but not in importance
In the continuous effort to improve bacterial genome annotations, the development of
ribosome profiling by next-generation sequencing [
7],
Ribo-seq (see Glossary) in short, allowed the recent discovery of a plethora of small open reading frames (sORFs). Classified as open reading frames built of no more than 300 nucleotides (nt.), these newly discovered genes potentially give rise to their encoded small proteins; referred to as sORF encoded polypeptides (SEPs). By providing direct evidence of many sORFs harbouring ribosomal activity [
7,
9,
10], Ribo-seq freed these specific ORFs from their status as ‘noise’ during the process of gene prediction and genome annotation. Currently, multiple Ribo-seq datasets have been published for model bacterial species like the Gram-negative
Escherichia coli (
E. coli) [
3,
11,
12,
13] and the Gram-positive
Bacillus subtilis (
B. subtilis) [
14]. Similar efforts were also reported for specific bacterial human pathogens including the model species
Salmonella enterica subspecies
enterica serovar Typhimurium (
S. Typhimurium) [
15,
16,
17] and more recently for
Streptococcus pneumoniae (
S. pneumoniae) [
18],
Mycobacterium tuberculosis (
M. tuberculosis) [
19],
Staphylococcus aureus (
S. aureus) [
20] and
Campylobacter jejuni (
C. jejuni) [
21]. A comprehensive overview of available prokaryotic ribosome profiling studies has been compiled by Vazquez-Laslop
et al [
5] and ribosome profiling traces corresponding to (some of) these and other studies can be consulted via the online genome browser GWIPS-viz [
22].
Since the aforementioned studies report on the discovery of novel, putative sORFs, these recent efforts all contributed to a now exhaustive list of hypothetical bacterial SEPs. In aid of gene annotation, Ribo-seq studies provide translational evidence for only a small subset of
in silico predicted sORFs, making their consideration in genome (re-)annotation efforts more straightforward. However, functional characterization has only been reported for a small portion of putative sORFs and their encoding SEPs, leaving an enormous world of the
sORFeome (see Glossary) uncharted. With documented bacterial SEP functions falling within diverse categories of basic and essential bacterial physiology [
23,
24,
25,
26,
27,
28] as well as infection biology [
29,
30,
31,
32], the need for more large-scale validation and functional characterization efforts is high. In this context, it is noteworthy that difficulties in biochemical detection and therefore validation of SEPs are known and have been extensively documented [
33,
34,
35], but that also recent bacterial SEP validation studies fail to fully address many of the challenges in small protein detection [
35], such as their proposed low expression or low stability [
36]. Nonetheless, as expression detection is a prerequisite for functional investigations, further improvement in SEP detection might turn out to be of great value to expand our current understanding of bacterial (infection) biology. Fundamentally, because of the lack of standard work-flows to go from computationally predicted sORFs to functionally annotated SEPs, a whole piece might be missing out of the puzzle the bacterial life is known to be.
sORFs: the weak spots of automated bacterial genome annotation
The annotation bias is especially challenging when it comes to the detection of sORFs. These coding sequences (CDS) of arbitrarily no more than 150-300 nt. encode small proteins or SEPs, with the interpretation for ‘small’ ranging from smaller than (or equal to) 50 [
53,
54] to 100 amino acids (AA) [
55]. SEPs distinguish themselves from canonical small peptides in the fact that their origin is translational and no proteolytic processing step is required to make them this small. In this way, the typical length cut-offs for gene prediction and annotation, chosen based on the strong belief that genes should be of sufficient length to be functional, turned out to be too short-sighted thereby obstructing sORF identification. Within existing genome annotations, a grant majority of sORFs encode ribosomal SEPs, which are significantly more conserved than newly discovered sORFs [
33]. One of the explanations for their generally lower conservation can be found in their origin, being constant genomic evolution events from larger genes [
33].
In line with previous ORF annotations, newly discovered sORFs can be classified as intergenic sORFs, upstream (overlapping) (regulatory) sORFs (u(o)sORFs), internal (out-of-frame) sORFs (intsORFs) or downstream (out-of-frame) (overlapping) (regulatory) sORFs (d(o)sORFs); a classification based on the relation between the genome-orientation of the newly discovered sORF and existing gene annotations (
Figure 1A) [
33,
36,
56]. For bacterial genomes, the genomic positioning of annotations is especially informative in case of polycistronic mRNAs, frequently encoding gene product(s) with a strong interplay.
Figure 1.
Small ORF (sORF) re-annotations and categorization of unannotated sORFs discovered by riboproteogenomics. A. The annotation of newly discovered sORFs is based on the relation between the genomic location of the novel sORF and existing (s)ORF annotations. Especially for the typical bacterial polycistronic gene organization, positional ORF annotations in the context of transcripts are meaningful as the interaction of the resulting gene products can be regulatory in nature. B. The implementation of riboproteogenomics for genome annotation can also result in the (conditional) re-annotation of previously annotated genes. Under given circumstances, ORFs can appear as 3’ (3’tr(sORF)) or 5’ (5’tr(sORF)) truncated variants, meaning that ORFs can turn into sORF annotations, or sORFs into shorter sORF annotations (resulting in N-terminally truncated SEP translation products). For sORFs, 5’ extensions (5’ext(sORF)) can exist still resulting in N-terminally extended SEP translation products.
Figure 1.
Small ORF (sORF) re-annotations and categorization of unannotated sORFs discovered by riboproteogenomics. A. The annotation of newly discovered sORFs is based on the relation between the genomic location of the novel sORF and existing (s)ORF annotations. Especially for the typical bacterial polycistronic gene organization, positional ORF annotations in the context of transcripts are meaningful as the interaction of the resulting gene products can be regulatory in nature. B. The implementation of riboproteogenomics for genome annotation can also result in the (conditional) re-annotation of previously annotated genes. Under given circumstances, ORFs can appear as 3’ (3’tr(sORF)) or 5’ (5’tr(sORF)) truncated variants, meaning that ORFs can turn into sORF annotations, or sORFs into shorter sORF annotations (resulting in N-terminally truncated SEP translation products). For sORFs, 5’ extensions (5’ext(sORF)) can exist still resulting in N-terminally extended SEP translation products.
Figure 2.
Riboproteogenomics-supported novel and re-annotated Salmonella sORFs. Ribo-seq/Ribo-RET profiles of the Salmonella Typhimurium strain SL1344 are shown.
A. MgtL was delineated as a new sORF in the SL1344 genome, but was found to have matching annotations in related genomes [
15]. Moreover, peptide evidence is available for this MgtA regulatory leader peptide [
15].
B. VapB and VapC, an upstream and downstream osORF respectively, take part in the plasmid-encoded Vap toxin-antitoxin system and were also only recently annotated in the SL1344 genome [
57].
C. For the pseudogene glpR, a 3’ truncated version of GlpR (GlpR*) was predicted with the same start site [
57].
D. CspA was found to have an in-frame upstream alternative start encoding a 5’ extended proteoform (CspA*), supported by peptide evidence [
15]. * is indicative of an ORF re-annotation.
Figure 2.
Riboproteogenomics-supported novel and re-annotated Salmonella sORFs. Ribo-seq/Ribo-RET profiles of the Salmonella Typhimurium strain SL1344 are shown.
A. MgtL was delineated as a new sORF in the SL1344 genome, but was found to have matching annotations in related genomes [
15]. Moreover, peptide evidence is available for this MgtA regulatory leader peptide [
15].
B. VapB and VapC, an upstream and downstream osORF respectively, take part in the plasmid-encoded Vap toxin-antitoxin system and were also only recently annotated in the SL1344 genome [
57].
C. For the pseudogene glpR, a 3’ truncated version of GlpR (GlpR*) was predicted with the same start site [
57].
D. CspA was found to have an in-frame upstream alternative start encoding a 5’ extended proteoform (CspA*), supported by peptide evidence [
15]. * is indicative of an ORF re-annotation.
sORFs were generally overlooked until increasingly more SEPs were identified – rather by chance – across all domains of life as well as viruses [
58]. Moreover, sORFs and their encoding SEPs turned out to be of considerable biological importance for the respective organisms, further strengthened by Lluch-Senar
et al who identified the genomic class of sORFs as being the most frequently essential one in case of the genome-reduced bacterium
Mycoplasma pneumoniae [
59]. Bacterial SEPs are, among other functions, known to be involved in basic (essential) processes underlying bacterial functioning, including cell division (
e.g. MciZ [
23]), transport of molecules (
e.g. KdpF [
25]) and signal transduction (
e.g. SafA [
27]) and to act as chaperones (
e.g. MntS [
60]). The discovery of the unexpected coding potential of bacterial sRNAs – not surprisingly – took place through mining the
E. coli genome [
53,
61,
62]. In this regard, the bacterial operon gene structure surely deserves some credit for the initial, unintended discovery of the functional potential of small proteins. In 1999, Gaβel
et al discovered, at that time, the smallest
E. coli protein KdpF (29 AA) through extensive examination of the K
+-transporter complex encoding
KdpABC operon, and co-purified KdpF with the complex emphasizing the possibility for small proteins to take active roles in protein complexation and bacterial functioning [
63].
Ribo-seq: a game changer for genome (re)annotation
The intriguing bacterial SEP functions reported highlight the need for new advances to ‘enrich’ bacterial genome annotations for sORFs. Ribo-seq revolutionized the study of translation by deep sequencing of ribosome-protected mRNA fragments [
75]. Ribosomes cover approximately 30 nt. when bound onto mRNA, causing these ‘ribosome protected parts’ to be resistant towards nuclease degradation (Figure B). Sequencing of these footprints gives clues on the whereabouts of ribosomes along translated mRNAs while additionally enabling to demarcate boundaries of translation, and thus delineation of translated ORFs. Recently, Ribo-seq was tailored towards identification of prokaryotic translation initiation sites (TISs) by stalling
initiating ribosomes (see Glossary) [
3] through the action of
retapamulin (RET) (see Glossary) (Ribo-RET) [
7] or alternatively, the newer pleuromutilin lefamulin, as especially in Gram-negative bacteria lefamulin exceeds retapamulin activity [
5].
The more precise TIS delineation further enabled the discovery of overlapping (
Figure 2B) (s)ORFs besides the discovery of ORFs translated as distinct protein isoforms or
N-terminal proteoforms (see Glossary) (
Figure 2C-D) [
3,
47], features that challenge standard annotation algorithms [
76,
77] and that are widespread among sORFs. Ribo-RET together with Ribo-seq data are at the basis of (conditional) gene reannotations [
15,
35,
57]. Besides revealing differential expression, conditional Ribo-seq and -RET profiles (
e.g. when comparing diverse bacterial growth conditions) can further disclose the existence of (conditional) gene extensions and truncations by showing differential Ribo-seq coverage patterns (3’ truncations and 5’ extensions) or alternative translations starts (5’ truncations and 5’ extensions) across the tested conditions (
Figure 2D). More recently, Ribo-seq protocols were developed to search the genome for ribosomal activity at stop codons (
Figure 3C) using the terminating ribosome bound release factor sequestrator apidaecin (API) (Ribo-API) in combination with puromycin, the latter to remedy the obstacles of stop codon read through and ribosome queuing inherent to the use of API [
5,
21,
36].
Since ribosomal protection does not necessarily point to translation, combining Ribo-RET with Ribo-API data (
Figure 3) may prove valuable for the finding of truly translated ORFs [
36], while additionally enabling the discovery of translational particularities such as ribosomal frameshifting events (
e.g. intsORF in
E. coli sfsA [
4]). Ribo-seq data in turn can fuel
de novo machine learning algorithms, like ribosome profiling assisted (re)annotation (REPARATION) [
15] and the modular algorithm smORFer [
20] for the delineation of translated prokaryotic ORFs. In particular for sORFs, that are so difficult to find in genomes through standard annotation tools, Ribo-seq has been proven instrumental to uncover their translation potential [
5,
15,
16,
18,
35]. SmProt offers a dedicated platform for the structured storage of SEPs from diverse model organisms, including
E. coli SEPs, which have been experimentally or computationally identified (by Ribo-seq) [
78]. As befit every technical application, Ribo-seq has its difficulties and shortcomings [
79] and SEP detection by means of (immuno)blotting, most commonly following epitope tagging, often serves as a sole means of experimental, biochemical validation of Ribo-seq-derived newly discovered SEPs and can therefore be used to filter out likely false positive SEP candidates [
80].
The ultimate aim: functional characterization of validated bacterial SEPs
As it are typically only few SEP validations that corroborate genome-wide bacterial SEP discoveries, there is definitely a need for more general, unbiased sORFeome-wide validation efforts. For example, an extensive study of the translational landscape of
S. pneumoniae connected the SEP of only one of their newly discovered sORFs,
rio3, to bacterial host colonization through targeted endogenous mutagenesis (
Table 1) [
18], as also done in the pathogenic bacterium
Y. pestis and in the extremophilic bacterium
Deinococcus radiodurans (
D. radiodurans) for the functional characterization of SEP-yp1 and SEP-yp2 [
48] and SEP068184 [
101], respectively implicating these SEPs in regulation of antiphagocytic capability and regulation of oxidative resistance.
Prior to functional studies, motif or domain prediction might provide a first hint towards the biological implication of the newly discovered SEPs (
Table 1). Bioinformatics prediction of hydrophobic transmembrane motifs is relatively straightforward and widely exploited for the exploratory study of novel SEPs [
21,
35,
48,
101]. The short primary SEP structures are however no ideal subjects for functional domain searches [
3,
36,
62], which is explained by the average size of protein domains coinciding with the upper length threshold of SEPs (100 AA) [
102]. When contrasting bacterial domain annotations of SEPs versus non-SEPs for the SL1344 and LT2
S. Typhimurium strains as well as the K12
E. coli strain (
Figure 4), the annotation ranged from 7% to 18% for SEPs, while for non-SEPs these percentages varied from 28% to 62%. While remarkably big discrepancies in the percentages of domain annotations between these related species/strains could be observed, SEP versus non-SEP domain annotations were in each case shown to be 3- to 4-fold lower. Large-scale SEP studies reporting Pfam domain predictions for high-confidence, novel SEPs are in line with these lower percentages of domain annotation [
48,
101]. Conservation analysis [
12,
36] of (the genomic surrounding of) predicted sORFs (
e.g. gene co-occurrence in case of polycistrons) might also help to prioritize functional SEPs (
Table 1) [
33]. For the genomic context of the putative sORF start codons, higher RNA secondary structure predictions (as compared to the start codon region itself) may further also serve as indicators of translation initiation [
36].
Specifically for sORF discovery in pathogenic bacteria, exploring putative SEP sequences for transmembrane domains and signal peptides is commonly exerted [
18,
48] as cell contact and secreted molecules are the major interaction routes between bacterial pathogens and their host cells [
48]. These predictions are also used for the discovery of novel quorum sensing systems and players in Gram-negatives, for which peptides are known to fulfil the role of quorum sensing pheromones [
18]. Cao
et al performed differential expression analysis of a set of putative sORFs interrogating different host environment-mimicking conditions [
48]. The same principle can be applied for all kinds of alternative stress conditions to find SEPs involved in different bacterial defence mechanisms and responses [
33,
101], but also expression differences over standard growth conditions may suggest a biological impact [
12]. A recent study on SEP profiling specifically focussed on identification of stress response SEPs through the choice for the extremophilic bacterium
D. radiodurans. Being a model organism for studying bacterial extreme stress responses, bacteria were subjected to ionizing radiation and oxidative stress resulting in the identification of 19 and 11 out of 109 newly identified SEPs as being upregulated under the respective stress condition [
101].
Concluding Remarks and Future Perspectives
In recent years, well-recognized bacterial SEP examples have been drawing the attention of microbiologists to sORFs as new study subjects in the context of general microbiology as well as host-pathogen interactions. As sORFs are generally under annotated in genomes, Ribo-seq efforts should be appreciated for the contributions they made to the wealth of (putative) SEPs uncovered over the past years. However, experimental SEP validation is deemed required prior to including them in existing bacterial genome annotations, with epitope protein tagging and MS as the most commonly used methods. Besides the inherent small size and often hydrophobic nature of SEPs as confounding features hindering their detection, the – sometimes highly specific – expression conditions of SEPs might further complicate the validation process [
16]. Conditional gene expression programmes of
Salmonella are clearly illustrated in the continuous cataloguing effort – the SalCom repository [
103]. Whether the common assumption of lower stability and abundance of SEPs serves a general additional hindering factor however, remains to be firmly established. Of note for SEP validation is that the rapid development of more sensitive high-throughput MS developments (
e.g. data independent acquisition (DIA) and ion mobility MS) is expected to further aid bottom-up as well as top-down SEP detection [
104].
While nonetheless both go hand-in-hand, overcoming the obstacles of SEP validation is one thing, but framing these small proteins within the host’s biology is another. Studies hunting for SEPs in diverse bacterial proteomes often focus on functional investigation of few individual SEPs through targeted endogenous mutagenesis and no efforts to collectively address the functionality of the small proteome have been undertaken [
18,
48,
101]. There is, consequently, no doubt that the largest part of the sORFeome remains to be functionally explored. Although when discussing biologically meaningful SEPs, not too much emphasis must be placed on the word ‘function’, a term used for proteins that contribute to cell fitness and that are under purifying selection [
105], as even merely the act of translating a sORF can influence the expression of its genomic context, much like operon leader peptides (
e.g. the threonine operon leader peptide thrL) [
3,
62]. Lately, peptide evidence for newly annotated leader peptides, like for the
MgtA leader peptide MgtL, has been changing the perception on this type of genomic elements as being exclusively regulatory (
Figure 1A) [
15].
With the functional knowledge at hand, and irrespective of ribosomal SEPs, bacterial SEPs can be concluded to be extensively engaged in protein complexation with an important fraction represented by (trans)membrane complexes [
62]. Many of the known small proteins function through binding and resulting regulation of standard-sized proteins [
3]. From this viewpoint, the current missing factor in bacterial SEP characterization is an interactomics-oriented approach. As protein-protein interactions (PPIs) might in particular play an important role for SEPs containing transmembrane domains, the underrepresentation of members of the
e.g. (transmembrane) transporter protein class (Figure ) and domain annotations in general for UniProt SEP annotations (Figure 44) might again indicate that the biological occupations of (transmembrane) SEPs rather goes through binding and regulation of other proteins or protein complexes, as also evident from the overrepresentation of transfer/carrier proteins among SEPs (
Figure 5), while the same analysis also revealed that unclassified proteins were about 2-fold overrepresented in the category of SEPs, providing an interesting niche for future functional SEP discoveries. Also, established
E. coli SEP interactomes show the SEP players to locate on the periphery of complexes, suggesting SEPs to take part in transient and differing interactions in multiple complexes, again providing some evidence for SEPs as regulators [
62].
The interactome-associated SEP aspects ‘hydrophobicity’ and ‘transientness’ eventually bring the concept of
in vivo proximity-dependent biotinylation (PDB) to the forefront. Unlike affinity purification (coupled to MS, AP-MS), PDB approaches have the power to catch weak and transient protein-protein interactions and equally important for the SEP protein class shown to be enriched for transmembrane proteins, are capable of handling less soluble proteins viewing its compatibility with the solubilization of membrane-(associated) proteins [
106,
107]. APEX, a PDB method exploiting the enzymatic activity of peroxidases for the biotinylation of proteins [
107], was already successfully applied in bacteria for the elucidation of the type VI secretion biogenesis process in
E. coli [
108]. BioID, standing for proximity-dependent biotin identification, is another implementation of the PDB principle and requires the translational fusion of the protein of interest to a promiscuous biotin ligase (PBL) for the biotinylation of proximal and interacting proteins. As the overlap between APEX and BioID interactomes has been claimed to be limited, BioID could thus potentially offer interesting complementary perspectives into bacterial SEP biology [
107].
High-throughput phenotypic screening is also emerging as an initiative to characterize gene products through the use of phenotypic microarrays interrogating the metabolization of compounds deterministic for unique molecular pathways [
109]. The construction of knock-out libraries by means of CRISPR/Cas9 or alternative recombineering strategies could offer a valuable approach to enable sORF/SEP phenotyping at larger scales [
42,
110]. Combining data gathered through a diverse range of omics techniques like PBL and phenotyping should enable small protein research to finally piece together the functionalities of bacterial SEPs encoded by newly discovered sORFs.
Table 1.
Available toolkit for sORF prediction, sORF annotation and SEP expression validation and functional characterization. MW; molecular weight, MS; mass spectrometry.
Table 1.
Available toolkit for sORF prediction, sORF annotation and SEP expression validation and functional characterization. MW; molecular weight, MS; mass spectrometry.
|
State-of-the-art |
Pros |
Cons |
Suggested improvements |
sORF prediction and annotation |
Ribo-seq |
- Genome-wide - Independent from existing annotations - Indicative of ribosomal activity - Broadly applicable - Improved resolution for detection of start (Ribo-RET) and stop codons (Ribo-API) |
- Requirement for experimental SEP validation - Computationally intensive and complex - Poor data resolution inherent to bacterial Ribo-seq |
Refinement of bacterial Ribo-seq protocols and data-analysis |
SEP expression validation |
MS |
- SEP abundance data - Proteome-wide technique suited for empirical SEP discovery |
- Limited number of tryptic SEP peptides - Hydrophobic character of peptides - Sensitivity of detection |
- Use of alternative proteases (e.g. chymotrypsin) - High-MW protein depletion or low-MW protein enrichment |
(Immuno) blotting |
- Information on MW and thus SEP integrity - Quantification of SEP expression |
- Tag interference on SEP function/localization - Small SEP size - Sensitivity not adequate for low SEP abundances |
- Use of smaller, charge-neutral tags (e.g. HiBiT) - SEP specific customization (e.g., blotting membrane (type, pore size), blotting buffer and method) - In solution detection of SEPs |
SEP functional characterization |
Conservation analysis |
- first impression of SEP functioning - High-throughput screening |
Lower conservation of SEPs |
- Interrogation of gene co-occurrence - RNA secondary structure analysis |
Domain prediction |
First impression of SEP functioning and localization |
Too short primary SEP sequences for domain prediction |
Motif prediction (e.g. transmembrane motifs) |
Mutation analysis |
Targeted and multiplex approach |
Laboursome |
|
Expression analysis |
|
Conditional impact of expression unknown |
Conditional expression maps |