1. Introduction
Serine endopeptidases of the chymotrypsin S1A subfamily are a large group of enzymes widely distributed in nature. In insects, they play an important role in various physiological processes such as digestion, development, and innate immunity regulation [
1,
2,
3,
4,
5,
6,
7,
8,
9,
10]. Activity of SPs depends on a catalytic triad of amino acid residues: histidine H57, aspartic acid D102, and serine S195 (hereinafter bovine chymotrypsinogen A numbering, XP_003587247). The substrate specificity of SPs is largely determined by the structure of the S1 substrate binding subsite, where residues 189, 216 and 226 play the major role [
11]. According to the S1 pocket organization, various types of SPs are distinguished, including trypsins (D189, G216, G226), chymotrypsins (S189, G216, G226; S189, G216, A226, and others), and elastases (S189, V216, T226; S189, V216, D226, and others).
Development of high throughput sequencing technologies lead to the appearance of high-quality genome assemblies for the whole-genome investigation of SP/SPH genes, performed for model insects, as well as species of great agricultural and medical importance. Among Hemiptera, 90 SP/SPH genes were found in
Nilaparvata lugens (Delphacidae) [
12] (
Figure 1). In dipterans 257 genes were identified in
Drosophila melanogaster (family Drosophilidae) [
13,
14] and even more in mosquitoes (family Culicidae)
Anopheles gambiae – 337 [
15,
16], and
Aedes aegypti – 369 [
17,
18]. For the order Lepidoptera, data on several representatives are known: 242 genes were found in
Manduca sexta (Sphingidae) [
19,
20], 169 genes in
Bombyx mori (Bombycidae) [
21,
22], 221 genes in
Plutella xylostella (Plutellidae) [
23] and 109 genes in
Spodoptera frugiperda (Noctuidae) [
24]. A reduced set of only 57 SP/SPH genes was found in
Apis mellifera (Hymenoptera: Apidae) [
14,
25]. The gene repertoire was larger in parasitic hymenoptarans with 74 genes described in
Microplitis demolitor (Braconidae), 143 genes in the parasitic wasps
Nasonia vitripennis, and 183 genes in
Pteromalus puparum (Pteromalidae) [
26].
Genome-wide analyses in beetles (Coleoptera) identified 125 SP/SPH genes in
Rhyzopertha dominica (Bostrichidae) [
27]. From the first coleopteran sequenced genome of the red flour beetle
Tribolium castaneum (Tenebrionidae) 177 genes coding for SPs/SPHs were identified [
14,
28]. For another tenebrionid, the yellow mealworm
Tenebrio molitor, it was previously identified in the larval gut 38 SP/SPH [
29] transcripts, two of which corresponded to the major digestive trypsin and chymotrypsin studied using biochemical approaches [
30,
31]. Later, 48 SPs/SPHs transcripts were identified in larval gut during the study of Cry3A intoxication in
T. molitor [
32]. Analyzing trypsin-like SPs/SPHs in transcriptome datasets from different stages of
T. molitor life cycle, we have previously
de novo assembled 54 trypsins and five trypsin-like SPHs [
33]. We also characterized recombinant preparations of SP, SerP38, and SPH, SerPH122, expressed in the
Komagataella kurtzmanii system [
34,
35]. Recent work by Wu and coauthors [
36] provided information on 200
T. molitor genes including 112 SPs and 88 SPHs, and transcriptome datasets together with RT-PCR analysis were used for SP-related genes expression profiling at various developmental stages and tissues.
Here, we present the extended and corrected dataset of putative
T. molitor SP/SPH cDNAs obtained from genome and transcriptome datasets. We have identified several groups of deduced proteins based on the composition of their active site and predicted specificity, analyzed evolutionary relationships and evaluated differential expression along the life cycle. Finally, sets of SP-related genes involved in digestion, embryonic development, metamorphosis and innate immunity was predicted providing a valuable information for further physiological, biochemical, and phylogenetic studies of tenebrionid pests. These data are of particular interest due to the fact that
T. molitor is the first insect approved by the European Food Safety Authority as a novel food in specific conditions and uses, testifying its growing relevance and potential [
37].
3. Discussion
SP-related proteins of S1A family identified in
T. molitor transcriptome include 269 sequences of which 137 were identified as active SPs with classical catalytic residues, and 125 were annotated as putative non-active SPHs that possess one or more substitutions in the catalytic triad. Seven deduced sequences containing several SP/SPH domains were putative polypeptidases, which physiological role remains generally unknown.
T.
molitor SPs/SPHs of the S1A chymotrypsin family occupy an intermediate position among insects in terms of number of identified sequences. Comparable number of SP-related sequences (257) was described for
D. melanogaster (Diptera: Brachycera) [
14], whereas in mosquitoes
A. aegypti and
A. gambiae (Diptera: Nematocera) genome-wide analysis identified 369 and 337 SP-related sequences, respectively [
14,
17]. Significantly less number with only 44 identified sequences of putative SPs/SPHs was described in
A. mellifera (Hymenoptera) [
25].
In
T. molitor, 84 SPs and 102 SPHs without regulatory domains constitute the largest group of SP-related proteins. Transcripts of 61 SPs were expressed only in the feeding life stages; 24 of them were highly expressed in the larval gut and presumably play an important role in digestion. Similar quantitative data were previously obtained for other insects including larvae of
D. melanogaster (53 gut peptidases of which 35 were highly expressed) [
14]
, A. gambiae (63 and 27, respectively) [
16] and
M. sexta (61 and 35, respectively) [
20]. But even closely related insects have functional differences in the general set of digestive SPs; for example, the most highly expressed SP in
T. molitor is trypsin SerP1, and in
T. castaneum it is chymotrypsin XP_970603.1, although their major digestive cysteine peptidases are orthologs with 74% identity [
70]. At the same time, there is a close link between the primary structure of the certain digestive SPs and their functions. Accordingly, a comparison of two orthologous pairs of
T. molitor and
T. castaneum chymotrypsin-like digestive SPs, SerP38 and CBC01177 (pair I, respectively), and SerP88 and CBC01166 (pair II, respectively), shows, that pair I was expressed at larval and adult stages, while pair II was expressed only in the larval gut [
71].
The remaining 23 transcripts of SPs without regulatory domains showed constitutive or specific expression at certain stages of
T. molitor development. The physiological role of most of these SPs requires further study, but it can be assumed that SPs showing high expression at the egg stage participate in the hydrolysis of storage proteins, as was previously shown for
B. mori [
1], while the SPs expressed at the pupal stages of
T. molitor can be involved in the breakdown of the larval structures during metamorphosis.
In addition to SPs, the largest group of 95
T. molitor SPH sequences lacking regulatory regions were also expressed predominantly during feeding stages. The physiological role of SPHs is still poorly understood; however, some of them highly expressed (9 out of 95) during the feeding stages, may play a certain regulatory role that may be related with digestive peptidase activation or their interaction with substrates or inhibitors in the midgut lumen. It was shown that some of the homologs are able to bind with the substrates and even provide a low-rate hydrolysis [
35,
52].
Another group of SP-related proteins identified, included 53 sequences of SPs and 23 SPHs with regulatory domains, such as different clips, LDL, SRCR, TSP, and others. While having a significantly lower expression levels than that of the gut digestive peptidases, most of them demonstrated constitutive expression throughout the entire life cycle, while specific SPs and SPHs with various regulatory domains demonstrated increased expression at eggs or pupae stages.
Among these sequences clip SPs/SPHs were the most numerous. Of the 60 SPs/SPHs with a clip domain that we identified in
T. molitor, 16 belonged to the clip-A type (all SPHs), 16 to clip-B (13 SPs and 3 SPHs), 17 to clip-C (15 SPs and 2 SPHs) and 11 to clip-D (all SPs). A total number of 60 clip SPs/SPHs is close to 54 sequences identified in the closely related
T. castaneum [
14,
36], and about twice amount of clip SPs/SPHs, including a distinct subtype clip-E SPs, was identified in mosquitoes
A. aegypti and
A. gambiae [
5,
16]. According to the available data, SPs/SPHs with clip domains are non-digestive and are present in hemolymph of insects and other arthropods. They play an important role in regulation of various physiological processes in insects like innate immune responses leading to activation of pPO necessary for melanization, activation of Toll-dependent signaling pathway leading to synthesis of antimicrobial peptides [
43] or regulation of dorsal–ventral pattern in
D. melanogaster embryos [
72], as well as regulate coagulation cascade during hemolymph clotting in crabs [
73].
The majority of
T. molitor clip-containing transcripts were expressed at all or most stages of the ontogeny, but three of them were specific to the egg stage (SerP116 and SerP166 and SerPH203), while at the pupae stage only increased expression of constitutively expressed clip transcripts was observed. The only experimental data on the specific roles of clip SPs/SPHs in
T. molitor came from B.L. Lee's laboratory, where the extracellular larval activation cascade of the Toll receptor and pPO was characterized in detail [
6,
57,
60,
64]. The proteolytic part of the cascade starts with SerP449 with multiple regulatory domains (MSP), which activates the downstream proSerP228 with clip-C domain (proSAE), which in turn activates proSerP183 (proSPE) with clip-B involved in proSpätzle or pPO activation, but processing of pPO requires additionally activation of clip-A homolog proSerPH415.
The remaining smaller part of
T. molitor SPs/SPHs had different regulatory domains. Transcripts of SPs with a GD domain were expressed constitutively throughout the entire
T. molitor life cycle including eggs and pupae, and all of them were from non-annotated group of SPs. Similar peptidases with GD domain were well studied in
D. melanogaster, but for the egg stage only [
51,
65,
72]. Stable constitutive mRNA expression of these peptidases in
T. molitor transcriptomes indicates their possible participation in a wide range of physiological processes in addition to the expected involvement in the cascades forming embryonic polarity during egg development. Another transcript of a large SP Tequila (SerP55) with a variety of regulatory domains was upregulated during
T. molitor pupal and adult stages, and in
D. melanogaster this SP was found throughout development participating in immunity response [
74].
One of the most interesting groups in
T. molitor were polypeptidases, mainly expressed at the pupal and adult stages. Six of them comprise two or three SP/SPH domains and several Sushi domains (Sushi(2)-SP(H)-Sushi(2)-SPH(-Sushi-SPH)). A similar domain architecture, including several peptidase domains and several Sushi domains, has a peptidase SP14 in
T. castaneum [
14]. In
A. gambiae several polypeptidases with a little different structure were identified (SP(H)-SPH-clipE-SPH) [
16]. In addition, a polypeptidase Nudel (pSerP1050) was also found in
T. molitor, which contained two peptidase domains – trypsin and a SPH domain with LDL domains. Similar Nudel (LDL(2)-SP-LDL(2)-SPH-LDL(3)) peptidases were identified in many insects [
14,
16,
23]. In
D. melanogaster embryo Nudel initiates the peptidase cascade related with dorsal-ventral patterning [
72]. Thus, complex polypeptidases were found in insects, but this issue requires further study in order to accurately identify the structure and functions of such proteins.
The great diversity and abundance of serine peptidases of the chymotrypsin S1A family in various insects provide great opportunities for a more detailed study of insects important for agriculture and/or medicine, and for a fundamental understanding of their physiology. We hope that our study will allow scientists to move in this direction.
5. Conclusions
Serine peptidases (SPs) and homologs (SPHs) of S1A family constitute a very diverse family of mostly secreted proteins involved in a variety of processes including digestion as well as development and innate immunity regulation. A thorough analysis of several transcriptomes and two newly sequenced genomes of T. molitor allowed us to update available information and identify 269 SPs and SPHs in this insect, performing sequence analysis and annotation, constructing phylogenetic relationships, and evaluating expression pattern across the entire life cycle. For 122 SPs their putative trypsin-, chymotrypsin- and elastase-like specificity was predicted from the S1 binding subsite sequence analysis, and for 15 non-annotated SPs specificity remains obscure, due to peculiarities of their S1 subsite structure. All studied SP-related sequences of T. molitor were grouped according to organization of their propeptide region. The largest group of 84 SPs and 102 SPHs had no regulatory domains, while the remaining 53 SPs and 23 SPHs had different regulatory domains in the propeptide. Transcripts of 61 SPs without regulatory domains were expressed only in the feeding life stages likely being involved in digestion. The remaining 23 transcripts of SPs without regulatory domains showed mostly constitutive expression while those upregulated at the egg and pupa stages may be involved in the hydrolysis of storage proteins and in the breakdown of the larval structures during metamorphosis, respectively. In addition to SPs, the largest group of 95 T. molitor SPH sequences lacking regulatory regions were also expressed predominantly during feeding stages and their physiological role is presumably related to the digestive process, in particular it may be an interaction with substrates or inhibitors in the midgut lumen.
The group of SPs and SPHs with regulatory domains contained in the propeptide four types of clips (A-D), GD, Sushi, LDL, SEA, PAN, FZ, TSP, EGF, CUB, SRCR, CBM domains. Transcripts from the majority of these proteins were expressed constitutively throughout the entire life cycle of T. molitor, while some of them were specific to the egg stage or/and upregulated at the pupal stage. For most of this regulatory SP/SPH transcripts significantly lower expression level was documented than for the above-described transcripts associated with digestive functions. One of the most interesting groups in T. molitor were seven polypeptidases, mainly expressed at the pupal and adult stages. Most of them comprise two or three SP/SPH domains and several Sushi domains. Similar complex polypeptidases were identified in few insect species, but this group of proteins requires further study in order to accurately identify their structure and functions. The data obtained provide valuable information for the further studies on biological functions in insects of diverse S1A peptidase family.
Figure 1.
Total number of SP and SPH genes found in sequenced genomes of insects from different orders. Data on SP are shaded in blue, data on SPH are in yellow, and undifferentiated data on the sum of SP/SPH genes are shaded in green.
Figure 1.
Total number of SP and SPH genes found in sequenced genomes of insects from different orders. Data on SP are shaded in blue, data on SPH are in yellow, and undifferentiated data on the sum of SP/SPH genes are shaded in green.
Figure 2.
Domain organization of 64 trypsins and 10 trypsin-like SPs of T. molitor. Regulatory domains are marked with different shapes and colors. Description for domains: SignalP – signal peptide; TM – transmembrane domain; Clip (B/C/D) – Clip domain; CUB – C1r/C1s, Uegf, Bmp1 domain; TSP – thrombospondin domain; LDL – Low-Density Lipoprotein receptor class A repeat; CBM – Chitin-Binding Domain; SRCR – Scavenger Receptor Cysteine-Rich domain; Fz – Frizzled domain; PAN – Plasminogen-Apple-Nematode domain.
Figure 2.
Domain organization of 64 trypsins and 10 trypsin-like SPs of T. molitor. Regulatory domains are marked with different shapes and colors. Description for domains: SignalP – signal peptide; TM – transmembrane domain; Clip (B/C/D) – Clip domain; CUB – C1r/C1s, Uegf, Bmp1 domain; TSP – thrombospondin domain; LDL – Low-Density Lipoprotein receptor class A repeat; CBM – Chitin-Binding Domain; SRCR – Scavenger Receptor Cysteine-Rich domain; Fz – Frizzled domain; PAN – Plasminogen-Apple-Nematode domain.
Figure 3.
Domain organization of 30 chymotrypsin-like peptidases, 18 elastase-like peptidases and 15 non-annotated peptidases of T. molitor. Regulatory domains are marked with different shapes and colors. Description for domains: SignalP – signal peptide; TM – transmembrane domain; LDL – Low-Density Lipoprotein receptor class A repeat; Sushi – Sushi domain; GD – Gastrulation Defective domain.
Figure 3.
Domain organization of 30 chymotrypsin-like peptidases, 18 elastase-like peptidases and 15 non-annotated peptidases of T. molitor. Regulatory domains are marked with different shapes and colors. Description for domains: SignalP – signal peptide; TM – transmembrane domain; LDL – Low-Density Lipoprotein receptor class A repeat; Sushi – Sushi domain; GD – Gastrulation Defective domain.
Figure 4.
Domain organization of 125 SPHs and 7 polypeptidases of T. molitor. Regulatory domains are marked with different shapes and colors. Description for domains: SignalP – signal peptide; TM – transmembrane domain; Clip – Clip domain; SEA – Sperm protein, Enterokinase and Agrin domain; LDL – Low-Density Lipoprotein receptor class A repeat; SRCR – Scavenger Receptor Cysteine-Rich domain; Fz – Frizzled domain; Sushi – Sushi domain; EGF – laminin/Epidermal Growth Factor-like domain.
Figure 4.
Domain organization of 125 SPHs and 7 polypeptidases of T. molitor. Regulatory domains are marked with different shapes and colors. Description for domains: SignalP – signal peptide; TM – transmembrane domain; Clip – Clip domain; SEA – Sperm protein, Enterokinase and Agrin domain; LDL – Low-Density Lipoprotein receptor class A repeat; SRCR – Scavenger Receptor Cysteine-Rich domain; Fz – Frizzled domain; Sushi – Sushi domain; EGF – laminin/Epidermal Growth Factor-like domain.
Figure 5.
Phylogenetic analysis of 269 SPs and SPHs of T. molitor. Complete protein sequences were aligned using MAFFT. The phylogenetic tree was built in the IQTREE service. Peptidases in the tree are divided into two groups: group A (red) – SP and SPH without regulatory domains; group B (blue) – SP and SPH with regulatory domains (incl uding polypeptidases). For the interpretation of the colors of the identifiers, see the legend above.
Figure 5.
Phylogenetic analysis of 269 SPs and SPHs of T. molitor. Complete protein sequences were aligned using MAFFT. The phylogenetic tree was built in the IQTREE service. Peptidases in the tree are divided into two groups: group A (red) – SP and SPH without regulatory domains; group B (blue) – SP and SPH with regulatory domains (incl uding polypeptidases). For the interpretation of the colors of the identifiers, see the legend above.
Figure 6.
Heatmaps of stage-specific expression pattern of 269 SP/SPH transcripts of
T. molitor. The hierarchical clustering of RPKM values was used to compare the relative expression levels of transcripts from different
T. molitor life stages transcriptomes, differentiated into 6 distinct groups. Groups 1-4 – SP/SPH without regulatory domains in the propeptide, groups 5-6 have regulatory domains. Group 1 (red) – SPs expressed in feeding stages, group 2 (purple) – SPs expressed at the stages of development, metamorphosis or also at other stages of the life cycle, group 3 (orange) – SPHs expressed at feeding stages; group 4 (yellow) – SPHs expressed at the stages of development, metamorphosis or also at other stages of the life cycle, group 5 – SPs and SPHs containing сlip domains; group 6 – SPs, SPHs and polypeptidases containing other than clip regulatory domains. The level of mRNA expression is presented as a heatmap from blue to red (log
2(RPKM+1)). The resulting p-values were adjusted using the Benjamini and Hochberg approach [
66]. Values p<0.05 are colored green, indicating the significance of differences in the expression at different stages of
T. molitor development, values from 0.05 to 0.1 are colored yellow, values greater than 0.1 are colored red, showing the unreliability of differences in the expression values at different stages of
T. molitor development. The colors of SP/SPH names indicate the types of SPs: trypsins (TRY) – blue, trypsin-like (TRY-like) – light blue, chymotrypsin-like (CHYM) – purple, elastase-like (ELA) – orange, non-annotated (NA) – grey; pSerp – polypeptidases, TM – transmembrane domain. Designations for regulatory domains: Clip-A – brown; Clip-B – blue; Clip-C – light blue, Clip-D – grey-blue; Sushi – green; GD – red; MSP – blue-green; peptidases with several regulatory domains – dark blue. Life cycle stages: E – egg, LII – second instar larvae, LIV – four instar larvae, EP – early pupa, LP – late pupa, M – male, F - female.
Figure 6.
Heatmaps of stage-specific expression pattern of 269 SP/SPH transcripts of
T. molitor. The hierarchical clustering of RPKM values was used to compare the relative expression levels of transcripts from different
T. molitor life stages transcriptomes, differentiated into 6 distinct groups. Groups 1-4 – SP/SPH without regulatory domains in the propeptide, groups 5-6 have regulatory domains. Group 1 (red) – SPs expressed in feeding stages, group 2 (purple) – SPs expressed at the stages of development, metamorphosis or also at other stages of the life cycle, group 3 (orange) – SPHs expressed at feeding stages; group 4 (yellow) – SPHs expressed at the stages of development, metamorphosis or also at other stages of the life cycle, group 5 – SPs and SPHs containing сlip domains; group 6 – SPs, SPHs and polypeptidases containing other than clip regulatory domains. The level of mRNA expression is presented as a heatmap from blue to red (log
2(RPKM+1)). The resulting p-values were adjusted using the Benjamini and Hochberg approach [
66]. Values p<0.05 are colored green, indicating the significance of differences in the expression at different stages of
T. molitor development, values from 0.05 to 0.1 are colored yellow, values greater than 0.1 are colored red, showing the unreliability of differences in the expression values at different stages of
T. molitor development. The colors of SP/SPH names indicate the types of SPs: trypsins (TRY) – blue, trypsin-like (TRY-like) – light blue, chymotrypsin-like (CHYM) – purple, elastase-like (ELA) – orange, non-annotated (NA) – grey; pSerp – polypeptidases, TM – transmembrane domain. Designations for regulatory domains: Clip-A – brown; Clip-B – blue; Clip-C – light blue, Clip-D – grey-blue; Sushi – green; GD – red; MSP – blue-green; peptidases with several regulatory domains – dark blue. Life cycle stages: E – egg, LII – second instar larvae, LIV – four instar larvae, EP – early pupa, LP – late pupa, M – male, F - female.
Table 1.
Domain organization and key structure features of 64 trypsins and 10 trypsin-like SPs of T. molitor.
Table 1.
Domain organization and key structure features of 64 trypsins and 10 trypsin-like SPs of T. molitor.
№ |
Name |
NCBI ID (protein) |
Preproenzyme/Mature Enzyme (aa) |
SignalP#break#(aa) |
Regulatory domain |
Propeptide#break#cleavage site |
Active site |
|
S1 subsite |
Enzyme #break#specificity |
Mm mature, Da |
pI |
TM#break#(position) |
1 |
SerP1 |
ABC88729 |
258 |
227 |
16 |
- |
R|IVGG |
H |
D |
S |
|
D |
G |
G |
Trypsin |
22742 |
6.9 |
- |
2 |
SerP2 |
QWS65012 |
252 |
227 |
16 |
- |
R|IVGG |
H |
D |
S |
|
D |
G |
G |
Trypsin |
23618 |
4.3 |
- |
3 |
SerP3 |
QWS65044 |
259 |
228 |
16 |
- |
K|IVGG |
H |
D |
S |
|
D |
G |
G |
Trypsin |
24386 |
5.0 |
- |
4 |
SerP4 |
QWS65013 |
250 |
225 |
15 |
- |
R|IVGG |
H |
D |
S |
|
D |
G |
G |
Trypsin |
24140 |
5.2 |
- |
5 |
SerP5 |
QWS65045 |
333 |
236 |
24 |
- |
R|IVGG |
H |
D |
S |
|
D |
G |
G |
Trypsin |
26035 |
9.2 |
- |
6 |
SerP6 |
QWS65014 |
258 |
226 |
17 |
- |
R|IVGG |
H |
D |
S |
|
D |
G |
G |
Trypsin |
23414 |
3.8 |
- |
7 |
SerP20 |
QWS65048 |
361 |
238 |
17 |
- |
R|IVGG |
H |
D |
S |
|
D |
G |
G |
Trypsin |
26395 |
9.0 |
- |
8 |
SerP21 |
QWS65049 |
276 |
228 |
22 |
- |
R|IVGG |
H |
D |
S |
|
D |
G |
G |
Trypsin |
24732 |
4.5 |
- |
9 |
SerP22 |
QWS65050 |
290 |
242 |
17 |
- |
R|VVGG |
H |
D |
S |
|
D |
G |
G |
Trypsin |
25975 |
6.2 |
- |
10 |
SerP26 |
QWS65055 |
254 |
227 |
23 |
- |
R|IVGG |
H |
D |
S |
|
D |
G |
G |
Trypsin |
24214 |
5.8 |
- |
11 |
SerP28 |
QWS65056 |
310 |
241 |
26 |
- |
R|IVGG |
H |
D |
S |
|
D |
G |
G |
Trypsin |
27033 |
7.6 |
- |
12 |
SerP30 |
QWS65015 |
249 |
226 |
16 |
- |
K|IIGG |
H |
D |
S |
|
D |
G |
G |
Trypsin |
24862 |
8.9 |
- |
13 |
SerP35 |
QWS65057 |
260 |
231 |
21 |
- |
R|IVGG |
H |
D |
S |
|
D |
G |
G |
Trypsin |
24884 |
5.6 |
- |
14 |
SerP37 |
QWS65058 |
298 |
251 |
19 |
- |
R|VVGG |
H |
D |
S |
|
D |
G |
G |
Trypsin |
27327 |
6.2 |
- |
15 |
SerP48 |
QWS65017 |
321 |
295 |
22 |
- |
R|IVGG |
H |
D |
S |
|
D |
G |
G |
Trypsin |
32018 |
6.7 |
- |
16 |
SerP76 |
QWS65019 |
387 |
362 |
18 |
- |
K|IIGG |
H |
D |
S |
|
D |
G |
G |
Trypsin |
39417 |
5.7 |
367-386 |
17 |
SerP77 |
QWS65060 |
288 |
252 |
17 |
- |
K|IVGG |
H |
D |
S |
|
D |
G |
G |
Trypsin |
27164 |
8.3 |
- |
18 |
SerP84 |
QWS65020 |
332 |
308 |
20 |
- |
K|VVGG |
H |
D |
S |
|
D |
G |
G |
Trypsin |
33286 |
5.0 |
313-330 |
19 |
SerP104 |
QWS65061 |
323 |
300 |
18 |
- |
K|IVGG |
H |
D |
S |
|
D |
G |
G |
Trypsin |
32646 |
4.2 |
300-323 |
20 |
SerP125 |
QWS65024 |
278 |
254 |
19 |
- |
R|IVGG |
H |
D |
S |
|
D |
G |
G |
Trypsin |
27535 |
4.8 |
257-275 |
21 |
SerP135 |
QWS65027 |
292 |
246 |
22 |
- |
G|IIGG |
H |
D |
S |
|
D |
G |
G |
Trypsin |
26850 |
9.5 |
- |
22 |
SerP209 |
QWS65033 |
258 |
227 |
16 |
- |
R|IIGG |
H |
D |
S |
|
D |
G |
G |
Trypsin |
22943 |
4.8 |
- |
23 |
SerP266 |
QWS65037 |
281 |
256 |
18 |
- |
K|IVGG |
H |
D |
S |
|
D |
G |
G |
Trypsin |
27895 |
8.8 |
- |
24 |
SerP360 |
CAH1374004 |
286 |
249 |
19 |
- |
K|IVGG |
H |
D |
S |
|
D |
G |
G |
Trypsin |
27480 |
4.7 |
- |
25 |
SerP635 |
WJL97986 |
249 |
224 |
19 |
- |
R|IVGG |
H |
D |
S |
|
D |
G |
G |
Trypsin |
24044 |
4.1 |
- |
26 |
SerP100 |
WJL97987 |
293 |
263 |
23 |
- |
R|IIGG |
H |
D |
S |
|
D |
G |
A |
Trypsin -like |
28605 |
8.8 |
- |
27 |
SerP105 |
CAH1374591 |
305 |
243 |
23 |
- |
L|IIGG |
H |
D |
S |
|
D |
G |
A |
Trypsin -like |
26155 |
5.9 |
- |
28 |
SerP188 |
KAJ3637256 |
303 |
271 |
20 |
- |
R|IVGG |
H |
D |
S |
|
D |
G |
A |
Trypsin -like |
29751 |
8.3 |
- |
29 |
SerP278 |
CAH1363947 |
298 |
256 |
18 |
- |
R|IIGG |
H |
D |
S |
|
D |
G |
A |
Trypsin -like |
27716 |
6.8 |
- |
30 |
SerP86 |
QWS65021 |
458 |
258 |
22 |
Clip-B |
R|ILDG |
H |
D |
S |
|
D |
G |
G |
Trypsin |
28226 |
8.4 |
- |
31 |
SerP113 |
QWS65022 |
386 |
255 |
23 |
Clip-B |
R|IING |
H |
D |
S |
|
D |
G |
G |
Trypsin |
28255 |
7.7 |
- |
32 |
SerP116 |
QWS65063 |
381 |
257 |
16 |
Clip-B |
K|IVNG |
H |
D |
S |
|
D |
G |
G |
Trypsin |
28382 |
6.4 |
- |
33 |
SerP141 |
QWS65028 |
435 |
259 |
21 |
Clip-B |
R|IFGG |
H |
D |
S |
|
D |
G |
G |
Trypsin |
28844 |
9.2 |
- |
34 |
SerP161 |
WJL97988 |
278 |
254 |
20 |
Clip-B |
R|ITSG |
H |
D |
S |
|
D |
G |
G |
Trypsin |
27807 |
7.7 |
- |
35 |
SerP166 |
QWS65064 |
376 |
259 |
15 |
Clip-B |
K|LVND |
H |
D |
S |
|
D |
G |
G |
Trypsin |
28449 |
4.8 |
- |
36 |
SerP183 SPE |
BAG14262 |
383 |
265 |
18 |
Clip-B |
R|IYGG |
H |
D |
S |
|
D |
G |
G |
Trypsin |
29203 |
7.6 |
- |
37 |
SerP193 |
QWS65032 |
375 |
247 |
22 |
Clip-B |
R|ILGG |
H |
D |
S |
|
D |
G |
G |
Trypsin |
27564 |
6.2 |
- |
38 |
SerP272 |
QWS65038 |
404 |
297 |
17 |
Clip-B |
K|IYGG |
H |
D |
S |
|
D |
G |
G |
Trypsin |
32710 |
8.0 |
- |
39 |
SerP275 |
QWS65065 |
430 |
257 |
23 |
Clip-B (2) |
K|IVGG |
H |
D |
S |
|
D |
G |
G |
Trypsin |
28969 |
8.5 |
- |
40 |
SerP370 |
QWS65041 |
407 |
257 |
21 |
Clip-B |
K|ISNG |
H |
D |
S |
|
D |
G |
G |
Trypsin |
28048 |
6.4 |
- |
41 |
SerP409 |
QWS65042 |
447 |
234 |
22 |
Clip-B |
K|IGKG |
H |
D |
S |
|
D |
G |
G |
Trypsin |
26142 |
8.8 |
- |
42 |
SerP218 |
CAH1363991 |
356 |
263 |
22 |
Clip-B |
K|VSGG |
H |
D |
S |
|
D |
A |
T |
Trypsin -like |
29129 |
6.3 |
- |
43 |
SerP119 |
QWS65023 |
387 |
253 |
19 |
Clip-C |
L|IVGG |
H |
D |
S |
|
D |
G |
G |
Trypsin |
28333 |
8.1 |
- |
44 |
SerP145 |
QWS65029 |
370 |
241 |
22 |
Clip-C |
H|IVGG |
H |
D |
S |
|
D |
G |
G |
Trypsin |
26781 |
7.7 |
- |
45 |
SerP163 |
QWS65030 |
354 |
254 |
21 |
Clip-C |
V|IAFG |
H |
D |
S |
|
D |
G |
G |
Trypsin |
28041 |
5.7 |
- |
46 |
SerP173 |
QWS65031 |
362 |
249 |
21 |
Clip-C |
F|VFGG |
H |
D |
S |
|
D |
G |
G |
Trypsin |
27495 |
4.9 |
- |
47 |
SerP227 |
QWS65034 |
376 |
251 |
23 |
Clip-C |
L|IVGG |
H |
D |
S |
|
D |
G |
G |
Trypsin |
27969 |
5.8 |
- |
48 |
SerP228 SAE |
QWS65035 |
374 |
250 |
20 |
Clip-C |
L|IVGG |
H |
D |
S |
|
D |
G |
G |
Trypsin |
27849 |
6.2 |
- |
49 |
SerP247 |
QWS65036 |
379 |
257 |
18 |
Clip-C |
T|IISM |
H |
D |
S |
|
D |
G |
G |
Trypsin |
28343 |
6.1 |
- |
50 |
SerP282 |
QWS65039 |
349 |
270 |
17 |
Clip-C |
G|ITGG |
H |
D |
S |
|
D |
G |
G |
Trypsin |
29212 |
6.0 |
- |
51 |
SerP297 |
QWS65066 |
350 |
255 |
18 |
Clip-C |
V|EYEE |
H |
D |
S |
|
D |
G |
G |
Trypsin |
28238 |
5.7 |
- |
52 |
SerP345 |
QWS65040 |
359 |
234 |
22 |
Clip-C |
L|IVGG |
H |
D |
S |
|
D |
G |
G |
Trypsin |
26360 |
6.5 |
- |
53 |
SerP347 |
QWS65067 |
367 |
256 |
25 |
Clip-C |
G|IAIG |
H |
D |
S |
|
D |
G |
G |
Trypsin |
28001 |
5.8 |
- |
54 |
SerP398 |
CAH1365893 |
385 |
253 |
19 |
Clip-C |
L|IIGG |
H |
D |
S |
|
D |
G |
G |
Trypsin |
28360 |
8.9 |
- |
55 |
SerP61 |
CAH1377522 |
422 |
246 |
26 |
Clip-C |
L|IVGG |
H |
D |
S |
|
D |
G |
A |
Trypsin -like |
27368 |
8.7 |
- |
56 |
SerP124 |
CAH1383174 |
371 |
250 |
20 |
Clip-C |
L|IVGG |
H |
D |
S |
|
D |
S |
G |
Trypsin -like |
27690 |
6.0 |
- |
57 |
SerP291 |
WJL97989 |
357 |
251 |
20 |
Clip-C |
Q|IWGG |
H |
D |
S |
|
D |
G |
T |
Trypsin -like |
28108 |
7.1 |
- |
58 |
SerP15 |
QWS65047 |
516 |
235 |
23 |
Clip-D |
R|IVGG |
H |
D |
S |
|
D |
G |
G |
Trypsin |
25699 |
9.2 |
- |
59 |
SerP24 |
QWS65051 |
810 |
243 |
19 |
Clip-D |
R|IVGG |
H |
D |
S |
|
D |
G |
G |
Trypsin |
27555 |
5.4 |
- |
60 |
SerP27 |
QWS65052 |
369 |
242 |
19 |
Clip-D |
R|IVGG |
H |
D |
S |
|
D |
G |
G |
Trypsin |
26704 |
9.0 |
- |
61 |
SerP31 |
CAH1379474 |
557 |
244 |
15 |
Clip-D |
K|IVGG |
H |
D |
S |
|
D |
G |
G |
Trypsin |
26888 |
6.5 |
- |
62 |
SerP40 |
QWS65016 |
392 |
241 |
21 |
Clip-D |
G|NPGG |
H |
D |
S |
|
D |
G |
G |
Trypsin |
26535 |
5.5 |
- |
63 |
SerP65 |
QWS65053 |
619 |
240 |
20 |
Clip-D |
R|IVGG |
H |
D |
S |
|
D |
G |
G |
Trypsin |
26001 |
9.2 |
- |
64 |
SerP66 |
QWS65059 |
523 |
245 |
29 |
Clip-D |
R|VVGG |
H |
D |
S |
|
D |
G |
G |
Trypsin |
27560 |
9.1 |
- |
65 |
SerP109 |
QWS65062 |
964 |
247 |
17 |
Clip-D |
R|IVGG |
H |
D |
S |
|
D |
G |
G |
Trypsin |
26987 |
7.8 |
- |
66 |
SerP127 |
QWS65025 |
376 |
247 |
22 |
Clip-D |
R|IVNG |
H |
D |
S |
|
D |
G |
G |
Trypsin |
27075 |
7.0 |
- |
67 |
SerP131 |
QWS65026 |
375 |
247 |
22 |
Clip-D |
R|VVNG |
H |
D |
S |
|
D |
G |
G |
Trypsin |
26799 |
8.4 |
- |
68 |
SerP317 |
QWS65054 |
389 |
246 |
16 |
Clip-D |
R|IIGG |
H |
D |
S |
|
D |
G |
G |
Trypsin |
27195 |
6.2 |
- |
69 |
SerP178 |
KAJ3638924 |
409 |
242 |
27 |
CUB |
R|IVGG |
H |
D |
S |
|
D |
G |
A |
Trypsin -like |
26019 |
5.0 |
- |
70 |
SerP725 |
KAJ3638922 |
405 |
246 |
23 |
CUB |
K|IVGG |
H |
D |
S |
|
D |
G |
A |
Trypsin -like |
26660 |
4.9 |
- |
71 |
SerP285 Corin |
CAH1378270 |
965 |
247 |
- |
Fz, LDL (2), SRCR |
R|IVGG |
H |
D |
S |
|
D |
G |
G |
Trypsin |
27268 |
5.9 |
338-359 |
72 |
SerP14 |
QWS65046 |
1289 |
286 |
- |
LDL (3) |
R|IVGG |
H |
D |
S |
|
D |
G |
G |
Trypsin |
31448 |
5.9 |
68-94 |
73 |
SerP11 TSP |
QWS65043 |
447 |
231 |
19 |
TSP (2) |
K|IIGG |
H |
D |
S |
|
D |
G |
G |
Trypsin |
26306 |
9.5 |
- |
74 |
SerP55 Tequila |
QWS65018 |
1672 |
245 |
23 |
CBM (3), LDL (3), SRCR (2) PAN |
R|VVRG |
H |
D |
S |
|
D |
G |
G |
Trypsin |
26947 |
5.9 |
- |
Table 2.
Domain organization and key structure features of 30 chymotrypsin-like SPs of T. molitor.
Table 2.
Domain organization and key structure features of 30 chymotrypsin-like SPs of T. molitor.
№ |
Name |
NCBI ID (protein) |
Preproenzyme/Mature Enzyme (aa) |
SignalP#break#(aa) |
Regulatory domain |
Propeptide#break#cleavage site |
Active site |
|
S1 subsite |
Enzyme specificity |
Mm mature, Da |
pI |
1 |
SerP16 |
CAH1383061 |
275 |
246 |
16 |
- |
H|ITNG |
H |
D |
S |
|
S |
G |
S |
Chymotrypsin-like |
25749 |
3.9 |
2 |
SerP69 |
ABC88746 |
275 |
230 |
16 |
- |
R|IISG |
H |
D |
S |
|
S |
G |
S |
Chymotrypsin-like |
22899 |
8.8 |
3 |
SerP71 |
CAG9035017 |
271 |
235 |
21 |
- |
R|IING |
H |
D |
S |
|
S |
G |
A |
Chymotrypsin-like |
24308 |
4.1 |
4 |
SerP303 |
CAG9018553 |
281 |
237 |
18 |
- |
R|ITGG |
H |
D |
S |
|
S |
G |
A |
Chymotrypsin-like |
25047 |
4.2 |
5 |
SerP129 |
CAH1365737 |
265 |
230 |
18 |
- |
R|IISG |
H |
D |
S |
|
G |
A |
S |
Chymotrypsin-like |
24439 |
4.0 |
6 |
SerP314 |
ABC88747 |
266 |
232 |
16 |
- |
R|IVGG |
H |
D |
S |
|
G |
A |
S |
Chymotrypsin-like |
24475 |
4.2 |
7 |
SerP7 |
CAG9037665 |
279 |
246 |
16 |
- |
R|IING |
H |
D |
S |
|
G |
G |
S |
Chymotrypsin-like |
25707 |
3.9 |
8 |
SerP39 |
CAG9029806 |
267 |
225 |
16 |
- |
R|IIGG |
H |
D |
S |
|
G |
G |
S |
Chymotrypsin-like |
23838 |
4.3 |
9 |
SerP54 |
CAH1375188 |
276 |
233 |
16 |
- |
R|IIGG |
H |
D |
S |
|
G |
G |
S |
Chymotrypsin-like |
24736 |
4.0 |
10 |
SerP107 |
WJL97990 |
277 |
234 |
16 |
- |
R|IIGG |
H |
D |
S |
|
G |
G |
S |
Chymotrypsin-like |
25428 |
4.3 |
11 |
SerP108 |
CAH1375189 |
276 |
233 |
16 |
- |
R|IIGG |
H |
D |
S |
|
G |
G |
S |
Chymotrypsin-like |
24987 |
3.8 |
12 |
SerP246 |
CAH1375190 |
275 |
233 |
16 |
- |
R|IIGG |
H |
D |
S |
|
G |
G |
S |
Chymotrypsin-like |
24822 |
3.9 |
13 |
SerP253 |
CAH1367742 |
277 |
241 |
21 |
- |
R|IIGG |
H |
D |
S |
|
G |
G |
S |
Chymotrypsin-like |
25998 |
4.1 |
14 |
SerP479 |
WJL97991 |
276 |
233 |
16 |
- |
R|IIGG |
H |
D |
S |
|
G |
G |
S |
Chymotrypsin-like |
25012 |
4.2 |
15 |
SerP33 |
WJL97992 |
256 |
217 |
24 |
- |
R|IVGG |
H |
D |
S |
|
G |
S |
G |
Chymotrypsin-like |
22618 |
4.2 |
16 |
SerP251 |
CAH1372320 |
255 |
232 |
17 |
- |
R|IIVG |
H |
D |
S |
|
G |
S |
G |
Chymotrypsin-like |
24576 |
5.2 |
17 |
SerP101 |
ABC88734 |
258 |
235 |
17 |
- |
R|IVNG |
H |
D |
S |
|
G |
S |
G |
Chymotrypsin-like |
25014 |
6.6 |
18 |
SerP19 |
WJL97993 |
252 |
227 |
16 |
- |
R|IVGG |
H |
D |
S |
|
S |
S |
G |
Chymotrypsin-like |
23900 |
4.5 |
19 |
SerP38 |
QRE01764 |
258 |
229 |
16 |
- |
R|VVGG |
H |
D |
S |
|
G |
G |
D |
Chymotrypsin-like |
24410 |
5.3 |
20 |
SerP88 |
ABC88737 |
258 |
229 |
18 |
- |
R|VVGG |
H |
D |
S |
|
G |
G |
D |
Chymotrypsin-like |
24896 |
5.3 |
21 |
SerP226 |
WJL97994 |
258 |
221 |
22 |
- |
R|LIGG |
H |
D |
S |
|
G |
G |
D |
Chymotrypsin-like |
23606 |
4.2 |
22 |
SerP146 |
CAH1383003 |
262 |
222 |
18 |
- |
R|IVGG |
H |
D |
S |
|
G |
G |
D |
Chymotrypsin-like |
23993 |
4.5 |
23 |
SerP276 |
KAJ3628034 |
284 |
247 |
15 |
- |
R|IIHG |
H |
D |
S |
|
G |
G |
D |
Chymotrypsin-like |
27432 |
6.9 |
24 |
SerP301 |
CAH1380401 |
244 |
221 |
17 |
- |
R|IFGG |
H |
D |
S |
|
G |
S |
D |
Chymotrypsin-like |
23620 |
4.1 |
25 |
SerP368 |
WJL97995 |
247 |
233 |
- |
- |
R|IFGG |
H |
D |
S |
|
A |
G |
D |
Chymotrypsin-like |
24560 |
4.2 |
26 |
SerP137 |
CAH1379909 |
248 |
218 |
19 |
- |
K|IVGG |
H |
D |
S |
|
A |
G |
D |
Chymotrypsin-like |
23683 |
5.4 |
27 |
SerP484 |
CAH1368908 |
247 |
226 |
16 |
- |
R|IVGG |
H |
D |
S |
|
A |
G |
D |
Chymotrypsin-like |
24734 |
5.0 |
28 |
SerP215 |
CAH1380399 |
254 |
231 |
17 |
- |
R|IFGG |
H |
D |
S |
|
G |
A |
D |
Chymotrypsin-like |
24822 |
4.4 |
29 |
SerP586 |
KAJ3636193 |
270 |
224 |
- |
- |
L|KDNG |
H |
D |
S |
|
T |
G |
S |
Chymotrypsin-like |
24961 |
5.0 |
30 |
SerP449 MSP |
BAG14264 |
632 |
258 |
23 |
LDL (4), Sushi |
L|IVNG |
H |
D |
S |
|
S |
S |
G |
Chymotrypsin-like |
28757 |
6.4 |
Table 3.
Domain organization and key structure features of 18 elastase-like SPs of T. molitor.
Table 3.
Domain organization and key structure features of 18 elastase-like SPs of T. molitor.
№ |
Name |
NCBI ID (protein) |
Preproenzyme/Mature Enzyme (aa) |
SignalP#break#(aa) |
Regulatory domain |
Propeptide#break#cleavage site |
Active site |
|
S1 subsite |
Enzyme specificity |
Mm mature, Da |
pI |
1 |
SerP41 |
ABC88760 |
266 |
233 |
19 |
- |
R|IVGG |
H |
D |
S |
|
G |
I |
S |
Elastase-like |
25006 |
4.4 |
2 |
SerP121 |
CAH1368236 |
274 |
236 |
16 |
- |
R|IIGG |
H |
D |
S |
|
G |
I |
S |
Elastase-like |
26285 |
4.5 |
3 |
SerP144 |
WJL97996 |
268 |
234 |
19 |
- |
R|IIGG |
H |
D |
S |
|
G |
I |
S |
Elastase-like |
25448 |
4.4 |
4 |
SerP238 |
KAJ3632560 |
264 |
234 |
21 |
- |
R|IVGG |
H |
D |
S |
|
G |
I |
S |
Elastase-like |
25330 |
4.2 |
5 |
SerP441 |
KAJ3632561 |
267 |
234 |
22 |
- |
R|IIGG |
H |
D |
S |
|
G |
I |
S |
Elastase-like |
25072 |
4.3 |
6 |
SerP156 |
CAH1380384 |
267 |
236 |
19 |
- |
R|IING |
H |
D |
S |
|
A |
V |
S |
Elastase-like |
25326 |
4.6 |
7 |
SerP94 |
WJL97997 |
266 |
232 |
21 |
- |
H|IVAG |
H |
D |
S |
|
G |
V |
N |
Elastase-like |
24874 |
4.8 |
8 |
SerP120 |
WJL97998 |
268 |
232 |
19 |
- |
H|IILG |
H |
D |
S |
|
G |
V |
S |
Elastase-like |
24988 |
4.7 |
9 |
SerP288 |
CAH1375483 |
266 |
232 |
16 |
- |
R|IVGG |
H |
D |
S |
|
G |
V |
S |
Elastase-like |
24259 |
4.0 |
10 |
SerP472 |
CAH1380701 |
272 |
235 |
17 |
- |
R|IVNG |
H |
D |
S |
|
S |
V |
A |
Elastase-like |
25265 |
4.4 |
11 |
SerP73 |
KAJ3638657 |
267 |
232 |
16 |
- |
R|IING |
H |
D |
S |
|
S |
V |
S |
Elastase-like |
24485 |
4.1 |
12 |
SerP74 |
KAH0820461 |
261 |
229 |
16 |
- |
R|IING |
H |
D |
S |
|
S |
V |
S |
Elastase-like |
23423 |
8.6 |
13 |
SerP110 |
KAH0813654 |
266 |
231 |
16 |
- |
R|IING |
H |
D |
S |
|
S |
V |
S |
Elastase-like |
24831 |
4.2 |
14 |
SerP98 |
CAH1365740 |
267 |
232 |
16 |
- |
R|IING |
H |
D |
S |
|
S |
V |
S |
Elastase-like |
24969 |
4.2 |
15 |
SerP751 |
CAH1365741 |
267 |
232 |
16 |
- |
R|IING |
H |
D |
S |
|
S |
V |
S |
Elastase-like |
24360 |
4.0 |
16 |
SerP185 |
KAJ3620429 |
266 |
233 |
17 |
- |
R|IING |
H |
D |
S |
|
S |
T |
S |
Elastase-like |
24779 |
4.9 |
17 |
SerP155 |
KAJ3632649 |
265 |
235 |
16 |
- |
R|IIGG |
H |
D |
S |
|
G |
F |
S |
Elastase-like |
24905 |
4.4 |
18 |
SerP85 |
ABC88761 |
267 |
237 |
16 |
- |
R|IIGG |
H |
D |
S |
|
G |
Y |
S |
Elastase-like |
25364 |
4.3 |
Table 4.
Domain organization and key structure features 15 non-annotated SPs of T. molitor.
Table 4.
Domain organization and key structure features 15 non-annotated SPs of T. molitor.
№ |
Name |
NCBI ID (protein) |
Preproenzyme/Mature Enzyme (aa) |
SignalP#break#(aa) |
Regulatory domain |
Propeptide#break#cleavage site |
Active site |
|
S1 subsite |
Enzyme specifi-city |
Mm mature, Da |
pI |
TM (position) |
1 |
SerP18 |
WJL97999 |
258 |
228 |
16 |
- |
K|IVWG |
H |
D |
S |
|
A |
A |
T |
NA |
24375 |
8.4 |
- |
2 |
SerP169 |
CAH1378761 |
257 |
226 |
16 |
- |
K|IVGG |
H |
D |
S |
|
G |
A |
T |
NA |
24300 |
9.9 |
- |
3 |
SerP423 |
CAH1372319 |
279 |
257 |
17 |
- |
R|IVNG |
H |
D |
S |
|
G |
G |
K |
NA |
28256 |
4.5 |
- |
4 |
SerP416 |
KAH0820967 |
300 |
- |
23 |
- |
- |
H |
D |
S |
|
Q |
G |
S |
NA |
- |
- |
277-299 |
5 |
SerP378 |
WJL98000 |
357 |
253 |
22 |
- |
K|ISGG |
H |
D |
S |
|
R |
G |
I |
NA |
28551 |
8.1 |
- |
6 |
SerP424 |
KAJ3633461 |
250 |
227 |
18 |
- |
R|IIGG |
H |
D |
S |
|
R |
G |
V |
NA |
25210 |
7.0 |
- |
7 |
SerP462 |
KAH0817404 |
257 |
- |
23 |
- |
- |
H |
D |
S |
|
T |
S |
F |
NA |
- |
- |
- |
8 |
SerP653 |
CAH1380361 |
252 |
- |
16 |
- |
- |
H |
D |
S |
|
V |
A |
D |
NA |
- |
- |
- |
9 |
SerP355 |
WJL98001 |
551 |
267 |
19 |
LDL (4), Sushi |
L|IVNG |
H |
D |
S |
|
G |
S |
T |
NA |
29980 |
5.1 |
- |
10 |
SerP1040 |
WJL98002 |
432 |
263 |
22 |
Sushi |
L|IING |
H |
D |
S |
|
S |
S |
S |
NA |
27132 |
7.7 |
- |
11 |
SerP454 |
CAH1384889 |
476 |
257 |
15 |
GD |
L|ITHG |
H |
D |
S |
|
S |
S |
V |
NA |
28642 |
7.8 |
- |
12 |
SerP442 |
CAH1384890 |
561 |
257 |
17 |
GD |
L|ISYG |
H |
D |
S |
|
T |
G |
I |
NA |
28750 |
7.7 |
- |
13 |
SerP466 |
KAJ3628554 |
427 |
247 |
23 |
GD |
K|PANE |
H |
D |
S |
|
S |
G |
V |
NA |
27618 |
7.3 |
- |
14 |
SerP550 |
CAH1380129 |
447 |
249 |
18 |
GD |
L|VLKG |
H |
D |
S |
|
G |
A |
I |
NA |
27949 |
8.9 |
- |
15 |
SerP1035 |
CAH1380127 |
568 |
249 |
25 |
GD |
L|VVNG |
H |
D |
S |
|
G |
S |
V |
NA |
27582 |
9.7 |
- |
Table 5.
Domain organization and key structure features of seven polypeptidases of T. molitor.
Table 5.
Domain organization and key structure features of seven polypeptidases of T. molitor.
№ |
Name |
NCBI ID (protein) |
Preproenzyme (aa) |
SignalP#break#(aa) |
Regulatory #break#domain |
Propeptide#break#cleavage site |
Active site |
|
S1 #break#subsite |
Enzyme specificity |
1 |
pSerP448 |
WKK29891 |
892 |
20 |
Sushi (2) |
L|IVGG |
H |
D |
S |
|
S |
S |
G |
Chymotrypsin-like |
Sushi (2) |
L|IVKG |
H |
D |
A |
|
S |
S |
A |
SPH |
2 |
pSerP900 |
CAH1380589 |
891 |
22 |
Sushi (2) |
L|IVGG |
H |
D |
S |
|
S |
S |
G |
Chymotrypsin-like |
Sushi (2) |
L|IVKG |
H |
D |
A |
|
S |
S |
A |
SPH |
3 |
pSerP333 |
CAH1382424 |
891 |
24 |
Sushi (2) |
L|IVSG |
H |
D |
S |
|
S |
S |
G |
Chymotrypsin-like |
Sushi (2) |
L|IVNG |
R |
N |
V |
|
F |
Q |
V |
SPH |
4 |
pSerP382 |
WKK29892 |
837 |
23 |
Sushi (2) |
L|IVGG |
H |
D |
S |
|
S |
A |
G |
Chymotrypsin-like |
Sushi (2) |
L|IIGG |
Q |
D |
R |
|
I |
S |
G |
SPH |
5 |
pSerPH608 |
WKK29893 |
895 |
23 |
Sushi (2) |
L|IVGG |
H |
D |
G |
|
S |
S |
G |
SPH |
Sushi (2) |
L|IIGG |
Y |
D |
G |
|
S |
F |
T |
SPH |
6 |
pSerP614 |
WKK29894 |
1347 |
24 |
Sushi (2) |
L|IVNG |
H |
D |
S |
|
S |
S |
A |
Chymotrypsin-like |
Sushi (2) |
L|IING |
H |
D |
G |
|
S |
S |
S |
SPH |
Sushi |
L|IVNG |
Q |
D |
S |
|
A |
S |
A |
SPH |
7 |
pSerP1050 Nudel |
CAH1374346 |
1830 |
TM (58-80) |
LDL (7) |
R|VVGG |
H |
D |
S |
|
D |
G |
G |
Trypsin |
N|ITSQ |
T |
E |
D |
|
D |
S |
A |
SPH |
Table 6.
T. molitor SP/SPH transcripts with the highest expression levels at the egg stage compared to other stages.
Table 6.
T. molitor SP/SPH transcripts with the highest expression levels at the egg stage compared to other stages.
Table 7.
T. molitor SP/SPH transcripts with the highest expression levels at the early and late pupal stages compared to other stages.
Table 7.
T. molitor SP/SPH transcripts with the highest expression levels at the early and late pupal stages compared to other stages.
Table 8.
T. molitor SP/SPH transcripts with the highest expression levels at the feeding stages compared to other stages and IV instar larvae gut.
Table 8.
T. molitor SP/SPH transcripts with the highest expression levels at the feeding stages compared to other stages and IV instar larvae gut.