9.1. Evolutionary Evidence Derived from Amino Acid Sequence Phylogenetic Analysis
A preliminary phylogenetic analysis in IQ-Tree found some connections between CSPs, CWP, Rho, SamkC, Sec31, SamkC, TIF, SamkC, WAS/WASL, DNA-binding proteins (DBPs), DNA-regulatory proteins (DRPs), and several RNA-binding proteins (RBPs) sequences molecular sequences (
Tables S4-S5; [
45]). Transcriptional/cell division repressor, helix-turn-helix, and DRP from the Xenobiotic Response Element (XRE) family of transcriptional regulators, branched with Bommo-CSP16, Bommo-CSP8, and Bommo-CSP9 and their bacterial counterparts, whereas another orthology group included Trica-CSP AAJJ0012J and Ruminococcus DRP WP_044998036 (nucleotide binding, see [
45]). This suggested that some CSPs have evolved to carry out tasks that may be associated with nucleotide binding, transcription, translation, DNA/RNA templates, DNA/RNA control, and/or intracellular gene expression mechanisms, at least in Bommo and Trica. This initial investigation encouraged us to compare deeper CSPs with intracellular components, such as transcription factors and gene expression regulators.
A phylogenetic examination of Bommo-CSPs, Trica-AAJJ0012J, BemtaCSP1, Myzpe-Mp10, its derivative sequences, Mucin-like sequences, XREs, and several RBPs sequences in PAUP (*10Altivec) provided further evidence for this (see
Tables S3-S5 and
Figure 5 &
Figures S6 and S7). A neighbour-joining tree (BioNJ study) showed that Mp10 did not join the Bommo-CSPs, but instead seemed to be much more closely linked to the Mucins group (G1;
Figure S6A), albeit not clearly forming an orthologous group (
Figure S6B). In G1, the “CSPs” from E. balteatus, EbalDDB_G0285119X1 and CSP3 (QIS77910), linked with Bommo-CSP10 and showed high similarity to Rho-activator isoform X3 (
Table S4;
Figure S6A). The Trica-CSP AAJJ0012J molecule clustered with Allergen Thap1, and associated RBPs instead of BommoCSPs. The marmalade hoverfly Epiba-CSP4 (QIS77191) clearly deviated from the CSPs groups (see G2,
Figure S6A). Only the G1 “EbalCSP”-group maintained a relatively high bootstrap value (94%) in the Jackknife analysis (bootstrapping calculation over 1000 repeats), strongly demonstrating the branching of “CSPs” with Rho GTPase enzyme. A common branch with a significant bootstrap value (57%) was formed by the sequences combining CSP, CWP, LRR, Mucin, SamkC, TIF, and WAS/WASL. Mp10 slipped off of this group, but AglaCWPX3 drew it in (
Figure S6B). The papilionid IpodCSP (CAH2042437), DBPs, RBPs, and TIFs were drawn at the bottom of the tree in this analysis using Bommo-CSP2 and bacterial counterparts as the reference outgroup (
Figure S6B). This strongly implied that there were significant and expected relationships between “CSPs” and all of these various intracellular proteins, mucins, Rho-activators, and translational regulatory factors.
Focusing on Mp10 (referred to as a very typical “CSP”, 153 amino acids, 17.2 kDa, “consensus residues”, four cysteines pattern, whole body expression), we found that this specific protein sequence (XP_022173691) constructs a relevant hierarchical clustering UPGMA analysis and a phylogenetic tree supported by a high bootstrap value when compared to Allergen, Mucin, TIF, NPCP, and ASRP amino acid sequences (
Figure 5). These sequences were not selected at random. The main components of these include mucins, allergens, and phylogenetically related protein CSP, which are widely present in various insect groups (see
Table S3). The blastp analysis of the Mp10 consensus sequence has extracted them all (see
Table S3). The whole “Mp10” group comprises “chemosensory proteins” in addition to mucins, which have emerged as allergens that are highly specific for edible insects, much like OBPs and hexamerins [
139]. However, it's interesting to note that Mp10 does not cluster with other “CSPs” on the UPGMA tree. UPGMA analysis refers to “aka” cladograms or, in very general terms, evolutionary trees. Using a distance-based approach, the UPGMA is constructed by joining the most similar pair of taxa, in this case, Mp10 and mucin proteins (
Figure 5A). Here, the ASRP group—which is interpreted as a dichotomy—is the closest neighbor of the Mp10/Mucin cluster. On the UPGMA, we find that the CSP/Allergen group is more distantly related, as neighbors construct the external nodes with the fewest mismatches. Therefore, our suggestion that CSP has no or minimal role in chemosensing is not so much based on the use of Mp10 as it is on the use of particular Mp10 counterparts. Specifically, TIF could be used to assert or imply that CSPs are not involved in smell —that is, their involvement in carrying the ligand for OR and GR—. The fact that CSP amino acid sequences are present in every group in UPGMA and Bootstrap further supports our contention that CSPs are not involved in smell (
Figure 5).
Mp10 belongs to a large group that includes all Mucin taxa and TIF. It shares a close relationship with Rho-activator (XM_056065396;
Figure 5A). The Rho families of small GTP-binding proteins (20-30 kDa) are not extracellular molecules. These are intracellular proteins that control the actin cytoskeleton-related Rho-GTPase signaling pathways. They act as molecular switches that control a variety of cellular processes, including gene transcription and cytoskeleton-related events. RhoGAPs, one of the main classes of Rho-regulators, are found in all eukaryotes (within the cell) and have been shown to regulate a variety of cellular functions, such as the organization of the cytoskeleton, growth, differentiation, neuronal development, and synaptic functions [
140]. They have nothing to do with smell; they don't trigger transmembrane chemosensory receptors.
As for the other “CSPs”, they belong to either the ASRP group (see the position of BAY56819, KAI5642933, and Bommo-CSP10, in the UPGMA analysis on
Figure 5A) or the Allergen group (which looked to be the most ancestral molecule): XP_002432595, ALC42649, KAG5343447, XP_002092928, EDW02527, OIC81003, and OIC85870 (bacterial proteins). Significant outer envelope proteins, RickA-like (Arp2/3), and particular nuclear nucleoside kinase (NNK) are found in the ASRP group, while IgE-BPs, pherokines (fly hemolymph CSP proteins), acid trehalase (AHX71992, involved in intracellular trehalose mobilization during postdiauxic growth and severe saline stress in yeasts), Cell Wall-Anchored (CWA-3, XP_018563025), and TIF sequence (XP_044745729) are found in the Allergen group (
Figure 5A). RickA is the protein from rickettsiae bacteria, which are carried by ticks and lice. It is involved in bacterial host cell binding and infection as well as the actin-based motility of bacterial cells. It also triggers host cell factors related to the cytoskeleton [
141]. CWA-3 molecule is a component of the cell wall integrity-signaling pathway, which is regulated by small proteins like GTP-BP Rho1. Controlling gene expression and coordinating periodic modifications to the cell wall are the primary functions of CWA molecule during the cell cycle and in response to various forms of cell stress [
142]. NNKs (or nucleoside diphosphate kinases, NDPKs) catalyze the transfer of the terminal phosphate (P) from a donor triphosphate (TP) to an acceptor diphosphate (DP). Arp2/3 is a ubiquitous and essential component of the actin skeleton found in eukaryotic cells. It nucleates actin filaments, caps their sharp ends, and cross-links them to form orthogonal networks. These molecules form a large group that is associated with Mp10, a protein classified as a “chemosensory protein”. The functions of these proteins are not as closely related to smell as they are to the cytoskeleton, actin filaments, transcription of genes, phosphate transfer reaction/exchange, phosphorylation, mitochondrial energy production, TP-DP conversion, and cell regulation (
Figure 5 and
Figure 6).
Mp10, “CSP”, Allergen, TIF, Mucin, NPCP, and ASRP have a very distant common ancestral origin according to the topology of the UPGMA tree, which is based on the assumptions of a common root and constant evolutionary rates for all lineages (i.e., it takes the "Molecular Clock Hypothesis" to account for mutation rates). They are the outcome of a series of duplication events that produced Allergen, which includes CSP and TIF proteins, prior to Mp10, Mucin, NPCP, and ASRP (
Figure 5A). Mp10 (and Rho activator) emerged later in a sequence of duplication events that produced a wide range of Mucin variants, particularly in the mosquito genera Aedes, Anopheles, Culex, Uranotearia, and Wyeomyia (
Figure 5A). The high level of duplication and variation observed in Culpi- and Aedae-CSPs makes this point noteworthy [see 16]. During its evolutionary history, the molecule protein gene family that includes Mp10, CSP, Rho, Allergen, TIF, MUCIN, NPCP, and ASRP appears to have undergone multiple duplications. Some of these duplications are specific to particular taxonomic lineages, like mosquitoes for the long Mucin precursor proteins needed for growth, development, digestion, oviposition, and control of viral infection [
144,
145]. Other gene duplications are more ancient and common to all lineages (see
Figure 5A). This is accurate for the Allergen group, which comprises Trica, ants, damselflies (Ischnura forktails) flies, garden whites (pierids), ladybirds, lice, neodiprions, parasitoid wasps, and tuberworm moths. This also holds true for the ASRP group, which comprises taxa from multiple families of moths and butterflies (swallowtails, speckled woods, Papilio, Pararge, etc; see
Figure 5A &
Table S3). The
Figure 5A’s tree makes it very evident how far apart the ASRPs/CSPs and the Allergens are. All of these molecules have a distant common origin that is estimated to be 324–440 Mya (the latest Mississippian–Silurian or Devonian [
146]), according to the UPGMA tree of amino acid sequences. This is significantly earlier than the emergence of the various flying insect species.
The relationships between the CSP, Mp10, Rho, Allergen, TIF, Mucin, NPCP, and ASRP proteins were further examined using maximum parsimony analysis (
Figure 5B). CSP, Mp10, Rho, Allergen, TIF, Mucin, NPCP, and ASRP molecules are all widely expressed in different tissues, which is already a striking feature. There have been reports of allergens from the abdomen and thorax clones, and Mucin, PAN-1, DAN4, NPCP, WASP, and the YLP motif are present in all body parts, larvae, and pupae (see
Tables S3-S4). There is no role for these proteins in the molecular underpinnings of chemosensing. The YLP motif is known to facilitate RNA binding activity, regulate telomere maintenance, and contribute to the decrease in telomerase activity that occurs during stem cell differentiation by attaching to the core promoter of Telomerase Reverse Transcriptase (TERT) and leading to its down-regulation [
146]. DAN4 determines cellular morphology and plays a crucial role in maintaining cell integrity during cell growth and division, under stress conditions and upon cell fusion. It is a component of the molecular interactions and enzymatic activities in the cell wall in response to different growth phases and toxic signals from the environment [
147]. This is remarkably similar to what it is known about CSP ontogeny, tissue distribution, and response to chemical stress (see
Section 6). Phylogenetic information and expression analysis are combined to link CSPs to intracellular proteins, cell walls, and gene promoter regions. The insect CSP, Mp10, Rho, Allergen, TIF, Mucin, NPCP, and ASRP molecules are closely related to each other; they formed groups with a high bootstrap value (89-100%), ranging from Pedhu-CSP (XP_002432595) to Uralo-Mucin (XP0055592470) and Eupco-Rho (XM_056065396). Together, the Mucin and NPCP molecules form a group with 99% bootstrap value. Additionally, they attach to ASRP and Rho groups, which may indicate a strong relationship between these molecule families based on their extremely high bootstrap values (93-99%;
Figure 5B). Aside protein molecules implicated in actin microtubule association, cytokinesis, Arp2/3, and NDPK, PAN and the ASRP group attaches to CSPs with significantly high bootstrap value (94-100%; Figure 5B,C).
Bacterial CSPs in MP analysis drew Rho but not Mp10. This simply indicates that, as opposed to Mp10, the bacterial CSPs (A. baumannii OIC81003 and OIC85870) and Rho share the fewest evolutionary steps, branches, and common ancestors in this PAUP tree feature. We agree that the evidence for an insect CSP’s potential non-chemosensation role is not very strong when it comes to Rho and bacterial CSPs. We assert that the MP does not exhibit chemo-sensation for CSPs because CSP sequences establish a robust orthology group with 100% bootstrap support for Allergens, Mucins, TIFs, NPCPs, and ASRPs (
Figure 5B,C). It follows that these protein families are not all that dissimilar from CSPs found in insects. The relationships between the major groups of CSPs, allergens, TIF, PAN, ASRP, NPCP, and mucins are clearly depicted in the circle-shaped NJ phylogenetic tree where the various branches interlock (
Figure 5C).
The relationship between the Allergen group—particularly TIF—, and Mp10 stayed closer, indicating that CSP and TIF have certain similarities in many parts (Figure 5B,C &
Figure S7). Coleopteran, dipteran, hemipteran, hymenopteran, lepidopteran, phthirapteran, and zygopteran (Odonata, blue-tailed damselfly) Allergens/CSPs and Mp10 protein are related, albeit they do not form a general orthology grouping, in contrast to Mucins, NPCPs, and ASRPs (
Figure 5B,C). This is where the UPGMA tree and the MP tree (Bootstrap Jackknife) in PAUP4.0b10 (Altivec) study diverge significantly (
Figure 5A,B). Following the evolutionary distances between insect species, Allergens/CSPs, including TIFs, segregated independently on the MP tree (see
Figure 5B). Additionally, as the ASRP group bootstraps on the MP or NJ tree, it approaches Mp10 (Figure 5B,C), suggesting that there are many specific homologies that CSPs, TIFs, and ASRPs have in common (
Figure S7). This may suggest that, despite their extreme age—the eukaryotic cell origin—the TIF-ASRP molecule family is evolving as a fast, intense, frequent, and high rate, much like CSPs [
149,
150,
151,
152]. While further experimental evidence is required to fully understand the functional relationship between CSPs and the TIF and ASRP molecular complexes, it is reasonable to assume that these complexes share a very ancient mechanism for regulating actin skeleton and/or ribosome gene expression (
Figure 5).
The Mp10 orthology of the aphid, beetle, dragonfly, fly, louse, mosquito, moth, and sawfly belongs to different groups, ranging from Mucins and NCPCs/ASRPs to Allergens and TIFs, according to our phylogenetic analysis (see
Figure 5 &
Figures S6 and S7), regardless of anything related to olfaction, taste, and/or chemosensing (see
Table S3). This suggests that, as a result of multiple local duplications, the Mucin, Allergen, TIF, and CSP families underwent common diversification before the emergence of insects (e.g. the Carboniferous Period of the Paleozoic era 299-359 Mya;
Figure 5). Mosquito Mucins were grouped together in accordance with the phylogenetic distances between their genomes. The evolutionary histories of CSP and Mucin proteins are then different. Mp10 orthologs in insects seem to have undergone one to five separate duplication events (see
Figure 5). Four consecutive duplication events may have given rise to Mucins: two early duplications (d1 and d2) produced Allergens/CSPs, TIFs, and ASRPs; two late duplications (d3 and d4) produced Mp10 and Mucins. If the protein amino acid tree is the right tree to explain duplication and evolution within these relatively similar molecule families, then all of the molecules in the CSP, Allergen, PAN, TIF, Mucin, NPCP, and ASRP groups have the same root (
Figure 5 &
Figures S6 and S7d Table S3). The following are linked: Mp10, CSPs, pherokines, ejaculatory buld-specific proteins (Ebsps), and a multitude of immune system, coat proteins, actin-related complex, nuclear complex, cell regulation, and cytoskeleton regulation proteins. The molecular sequences of gustatory receptors and ORs do not correlate in this instance. All of these Allergen-CSP and ASRP-NPCP molecules have a very old origin that dates back to the time before flying insects even existed and even further back to the time of microbes’ prokaryotes’ origin (about 3.5 Bya), even though a fifth duplication that produced DAN4 and NPCP may only be found in Culpi (see
Figure 5).
9.3. Cellular Evidence Derived from Location, Size, Structure, and Possible Expression in Viruses and Microbes
These striking similarities between CSP, Allergen, TIF, Mucin, NPCP, and ASRP molecules suggest that “CSP” molecules function in a multitude of diverse ways inside the intracellular systems of insects and hexapods (
Figure 7).
When it comes to insects and hexapods, CSP binds to LA (C18:2) which is made from arachidonic acid (AA, 20:46) and is required for the biosynthesis of many hormones. C18:2 and 20:46 are important modulators of intracellular processes. LA and AA phosphorylate a wide range of intracellular proteins (enzymes, pumps, receptors, and so forth), which in turn controls a multitude of signal transduction pathways and cellular processes (see [
39]). They are related to the hormones leukotrienes, thromboxanes, and prostaglandins. These three classes of hormones regulate many physiological processes, including innate immune response, ion transport, egg development, and reproduction, but not smell [
156,
157,
158]. In the nucleus, mitochondria, Golgi, ER, plasmic membrane, ion channels, ion pumps, ion transporters, lysosomes, and ribosomes, CSPs may participate in the lipoid and phosphorylation processes that affect the biosynthesis of stress responses (see +,
Figure 6). The fact that CSP binds to linoleic acid in whiteflies suggests this (see [
38]). In the whitefly, the LA-CSP is highly widely expressed throughout the entire body, not just the antennae (see [
38]). Because of its interaction with LA, which helps enable the phosphorylation of many distinct cellular proteins via a variety of metabolic pathways, the CSP is able to participate in this extraordinaly broad field of cellular biology (
Figure 7). In the animal model, LA activates multiple signaling pathways and receptors, regulates ion channels in metabolic cells, and induces calcium signaling and kinase phosphorylation in gustatory cells. Therefore, numerous targets for LA have been identified in a variety of cell types, including neural and pancreatic/liver cells [
159,
160,
161,
162]. Although this is primarily explained in terms of mammals, there is no reason to believe that LA, a lipid crucial to cell physiology in all living forms, will function differently in insects [
163,
164,
165,
166,
167,
168]. Mammal cellular fractions such as the lysosome, ribosome, proteasome, Golgi, reticulum, actin cytoskeleton, microtubules, and mitochondria are the targets of LA [
164,
165,
166,
167,
168]. LA is essential for the assembly of parts involved in oxidative phosphorylation (OXPHOS, the electron transfer chain coupled to ATP synthesis) in the mitochondria, which is also involved in the immune response of insects [
167,
168]. Insect immune response is achieved when mitochondria, OXPHOS, and LA are combined [
169]. The extremely broad action of LA addresses the CSP’s tissue expression (extremely broad; see
Figure 3). This places the CSP-LA complex mostly in the intracellular space (see
Figure 7).
We wildly speculate about the potential function of CSPs and CSPs-like proteins in virtually every aspect of cellular biology, from prokaryotes to eukaryotes, taking into account the incredibly wide diversity of LA effects, LA molecular and cellular targets, and the extremely wide tissue distribution of CSP attaching LA. The “CSPs”, along with LA, AA, FAs, and stress responses, could mediate the mechanisms controlling lipid metabolism in the cell cycle at the molecular translational level in the cell ribosome of insects [
38,
39,
170,
171]. Moreover, CSP could interact with cytochrome P450 (CYP) enzyme in the ER and the mitochondrial system in response to insecticide exposure (+) and other stress responses. In response to insecticide exposure, this CSP-CYP interaction may be important for cellular metabolism, homeostasis, hormonal synthesis, toxin catabolism, and xenobiotic detoxification [
20,
172]. It is possible that the CSP-Degradative Enzyme (DE) system mediates the lysosome's functions, which include defense against pathogens such as microbes, bacteria, fungi, and viruses, as well as the digestion and breakdown of macromolecules (proteins, lipids, carbohydrates, and nucleic acids) and repairs of cell membranes (
Figure 6A). When food is eaten or absorbed by the cell, the lysosome releases its enzymes, which convert complex molecules—such as sugars, proteins, lipids, FAs, AA, and/or LA brought by CSPs— into energy that the cell needs to survive [
173]. After that, the desaturase enzymes, the ER membrane, and the eversible vesicles that secrete the cuticle, the duct, and the pheromone compound will likely also need to be worked with the CSPs if they are to transport FAs like LA and its two lipid precursors, stearic acid (SA, C:18) and elaidic acid (EA, C:18-1). We hypothesize that the coordinated action of CSPs, FAs, and ER in the sex pheromone gland would be strictly necessary for the lipid droplet formation and pheromonogenesis. We firmly believe that the sites of metabolism that are likely regulated by CSPs are not the activation of OR on sensory neuron dendrites, but rather the ER membrane of the sex pheromone gland [
174,
175]. This notion is reinforced by the tissular distribution of CSPs and the precursor role of LA in several lipid biosynthesis pathways.
Here, we suggest that CSP is linked to the ER, as well as to the endomembranes from the Golgi, mitochondria, and plasma membrane via Rho GTPase signaling complex, Mucin, ASRC, and NPCP (see
Figure 5,
Figure 6 and
Figure 7). Perhaps Rho, Mucin, ASRC, and NPCP are connected to CSPs, allowing these molecules to protect the cell membrane, control membrane interactions, regulate the trafficking of protein precursors and lipids between the different cellular organelles, particularly the lysosomes and Golgi system [
176,
177]. Interestingly, we note that “CSP” consensus is the N-terminal region of “Pglb”, a protein that resembles the glycoprotein receptor on the surface of human platelets (
Figure 6A &
Figure S7). As a result, it may reside outside of cells in order to identify pathogens, trigger immune responses, and/or receive molecules like FA or JH and stimulate cell growth. Mechanisms linked to glycoprotein receptors are remarkably old, dating back to worms [
178]. The fact that splicing factor (SF), TIF, and NDPK are linked by CSP consensus is also noteworthy (
Figure 5 and
Figure 6 &
Figures S7 and S8). This connects the CSP to nucleotide binding, nucleoside kinase, RNA metabolism, translation, splicing, and nucleus-wide regulation of gene expression rather than odor binding. The relationship between CSPs and mucins, however, may be the most intriguing of all the Mp10 relatives that were taken from the NCBI database, as it would explain a function at both the sensory lymph level and the gut epithelial membrane (
Figure 5,
Figure 6 and
Figure 7 &
Figures S7 and S8d Table S3). Mucins are the most prevalent macromolecules in mucus in vertebrates and are in charge of the biochemical and physical characteristics of mucus. This role is strikingly similar to that of mucins in insects and hexapods. Similar to vertebrates, insects’ mucins belong to three different families within the same tissue: transmembrane, gel-forming, and soluble, with possible strong similarities to CSPs in these categories [
179,
180]. In vertebrates, it is expressed in saliva, tears, glands, ear fluid, brain, lung, intestine, liver, kidney, heart, muscle, testis, ovary, and skin [
181]. The transcripts of Drome mucins localize to specific organs and fluids that are similar to the sites where vertebrate mucins are expressed. It is also found that Drome mucins are present throughout the whole life cycle of the fly [
182]. One prominent example is “JiangShi”, an extracellular glycoprotein that bears a strong resemblance with mammalian “soluble” mucins. It is one of the most extensively distributed mucin-like proteins that has been extensively studied due to its crucial role in the growth and development of Drome [
183]. Therefore, the family of mucin’s rather widespread expression bears a striking resemblance to the role, ontogeny, and tissue distribution of CSPs (see
Figure 3 &
Figure S4 and
Section 5).
The midgut of beetles is characterized by the expression of mucus-forming mucins, which also bear a striking resemblance to the mucus found in vertebrates. It is thought that the primary functions of the mucus that surrounds food in the midgut are tissue protection and improved digestion though the facilitation of enzyme recycling [
184]. Additionally, the gut of the insect has prominently enlarged posteriormost crypts that are likely specialized for vertical symbiont transmission. They have a significant impact on shielding host cells from harmful outside environments and/or invasive microbes. Like Bommo-CSP10, gut mucins are defined by highly glycosylated proteins that make up the mucin domain, which is rich in repetitive sequences of Thr, Ser, and Pro residues. These Multi-Proline mucins are primarily expressed in the gut and the female-specific symbiotic organ [
185]. Eight mucin genes have been studied in locusts, and the results indicate that they are necessary for the growth and development of the species, vital for survival, involved in the formation of the gastric caecum, necessary for the formation of the peritrophic membrane in the midgut, or necessary for the cuticle of their wings to remain structurally intact [
186]. These functions would effectively handle the distribution of CSP throughout the entire body of the insect (see
Figure 3). Mucin, which is secreted into rice during feeding and highly expressed in salivary glands in true bugs, elicits plant immune responses, much like Mp10 [
187]. An eggshell-related mucin-like protein, NIESMuc, was also identified in true bugs, specifically in the brown planthopper (Nillu). It is very likely that NIESMuc is necessary for regular oviposition and/or egg growth and development because it was highly expressed in the follicular cells of the ovarioles’ egg chamber in Nillu females [
188]. Lastly, gene expression analysis demonstrated that these mucins are dynamically expressed in various Ae. albopictus mosquito organs and throughout different developmental stages. Notably, blood feeding increased the expression of mucins in the ovary, fat body, and midgut. This clearly indicates that these mucins may serve similar developmental, growth, digestive, reproductive, and intestinal defense roles as CSPs in a multitude of insect species [
189].
Thus, in many different tissues of the digestive and reproductive tracts, as well as in peripheral sensory organs with a large surface area exposed to the external environment, both Mucins and CSPs may cooperate to support the immune system against infections, pathogens, viruses, and microbial or chemical stress [
190,
191]. The exposed epithelia and sensory neuronal membrane of mammals are protected from environmental toxins by their mucous barrier. In insects, crustaceans, and mammals alike, this barrier serves as a lubricant during organ movements and as a mediator of substance exchanges between internal organs and the environment. Similar to mucus, ‘lymph’ is produced by surface cells, epithelial cells, and glands that are closely related to other components of the innate and adaptive immune systems. Lymph acts as the first line of defense against pathogens. Much like mucus, a large variety of fluids and secretions, venom, and saliva, the building blocks of the lymph are mucins and CSPs/OBPs. The capacity of mucins, CSPs, and OBPs to bind lipids in diverse secretions and tissues is a shared characteristic [
192,
193,
194,
195]. Perhaps lipids are transported by CSPs, and lipid aggregation and droplet formation are facilitated by mucins. Gastric mucins have long been known to mediate tissue protection through lipid binding [
195]. In the gut, interactions between mucin and lipid droplets have been specifically studied [
196]. They may engage in direct relations with the gut microbiome through interactions with short-chain FAs there [
197]. It would be interesting to look into whether the primary component of the mucin-CSP association is the interaction between mucins and short-chains of lipids, while CSPs interact with long-chain lipoid molecules in the brain, gut, fat body, and sex pheromone gland. This is due to striking resemblance that we have shown here between CSPs and insect mucins (sequence identity, structural folding, response to toxins, broad tissue distribution, and developmental profiling; see
Figure 3,
Figure 4,
Figure 5,
Figure 6 and
Figure 7 &
Figures S4–S8 and
Tables S3-S5).