Mucins are a class of high molecular weight glycoproteins with molecular weights typically ranging from 0.2 to 10 million Dalton[
1], which are mainly synthesized by goblet cells and combines with inorganic salts suspended in water to form mucus. They cover the surfaces of the respiratory, digestive, gastrointestinal and genitourinary tracts, protecting epithelial cells from infection, dehydration, and physical or chemical damage, providing protection and lubrication for the epithelial surface. They are widely expressed in the body and are associated with many physiological and pathological processes[
2]. So far, there are 20 mucins in human body have been discovered, including the secreted mucins MUC2, MUC5AC, MUC5B, MUC6, MUC19, MUC7, and MUC9, of which MUC2 is the main form of secreted mucin[
3,
4], as well as transmembrane mucins including MUC1, MUC3A , MUC3B, MUC4, MUC11, MUC12, MUC13, MUC15, MUC16, MUC17, MUC20, MUC21 and MUC22[
5]. In addition, there are some proteins that belong to atypical or mucin-like protein molecules, atypical mucins including MUC10, MUC14, and MUC18. Mucin-like protein molecules are exist in parasites, viruses and fungi,
Herpes virus has a mucin region, and
Toxoplasma Gondii have mucin-like structural domains, and
Candida albicans has mucin-like proteins. Similar to the structure and function of mucins, mucin-like protein molecules have structural domains rich in Pro, Thr and Ser, which can undergo extensive O-glycosylation and also act as barriers to protect cell membranes[
6,
7]. For example, the Msb2 glycoprotein of Candida albicans produces a mucosal layer to protect cells. This protein is considered a functional analogue of mammalian MUC1/MUC2[
8]. Transmembrane mucins have a C-terminal cytoplasmic tail, a transmembrane region and an extracellular portion. Characterized by the SEA (sea urchin spermatoglycan, enterokinase and agrin) structural domain and the VWD (vascular haemophilic factor D) structural domain. Unlike transmembrane mucins, secreted mucins lack transmembrane structural domains and exist in a secreted form[
9]. Typical secreted mucins consist of a VWD domain rich in N-terminal cysteine, followed by a C-terminal "CK" domain. The N-terminal participates in polymerization through intermolecular disulfide bonds, while the C-terminal "CK" domain participates in monomer dimerization[
10] (
Figure 1). Both types of mucins have a highly glycosylated protein core, also known as the PTS domain, with multiple tandem repeats rich in Pro, Ser or Thr in the PTS domain. The size and number of tandem repeat sequences in different mucins, hence they are called variable number tandem repeats (VNTR), which means that there may be significant differences in size between individuals or individual mucins[
1]. Secreted mucins are responsible for conferring viscoelasticity to epithelial tissues, and transmembrane mucins are involved in maintaining epithelial cell polarity and cell signaling[
11]. In addition, the expression patterns of mucin genes in the respiratory, digestive, and reproductive tracts are complex and strictly regulated. The expression of each mucin has organ and cell specificity[
12]. Available studies have shown that the mucin components of the lungs are mainly MUC5AC and MUC5B, MUC5B is essential for cilia motility, while MUC5AC is more responsive to environmental or infectious factors, and elevated concentrations of MUC5AC may contribute to the development of chronic obstructive pulmonary disease (COPD)[
13]. Terada found by immunohistochemistry that normal gastric mucosa specifically expresses MUC2, MUC5AC and MUC6, but never MUC1, colorectal crypt epithelial cells highly express MUC2, but never MUC1, MUC5AC and MUC6[
14]. In the female reproductive tract, the major transmembrane mucins are MUC1 and MUC4, and the major secreted mucins are MUC5B and MUC5AC[
15]. These evidences suggest that the expression of mucins indeed organ and cell specific.
As mentioned earlier, mucin possesses a highly glycosylated PTS domain, which has a large and dense glycan chain structure. It is similar to a “bottle brush”, which allows carbohydrates to make up 80% of the total mass of the mucin. These glycan chains consist of a large number of O-glycans, and we refer to mucins and the glycans on them collectively as mucin glycans. Mucin glycans are beneficial for maintaining a high viscosity state (
Figure 2). For example, in the stomach, high viscosity helps to lubricate indigestible lumen contents and accelerate gastric emptying. At the same time, they can protect epithelial cells from dehydration and mechanical forces during the passage of lumen contents[
16]. However, abnormally high viscosity can also lead to disease, and it has been shown that abnormally high mucin viscosity leads to increased concentration of gallbladder bile and possible formation of gallstones[
17].
It should be noted that mucins encoded by the same gene may have different structures due to different glycosylation patterns. Meanwhile, their expression levels and glycan patterns may vary depending on the organization and species, which may lead to the occurrence of diseases[
18]. This may be due to changes in the topology, function and expression of individual glycosyltransferases and their molecular chaperones leading to glycan loss, which results in the production of aberrant glycan chains on mucins[
19]. Existing studies have shown that alterations in intestinal mucin glycans are associated with a variety of intestinal disorders, and that they can lead to dysbiosis, which usually occurs in the early stages of enterocolitis[
20]. Alterations in O-glycans on MUC2 have been observed in ulcerative colitis (UC), which refers to an increase in the number of truncated O-glycans Tn antigen (GalNAcα1-O-Ser/Thr), STn antigen (Neu5Acα2-6GalNAcα1-O-Ser/Thr) and a decrease in the number of complex O-glycans on MUC2. These abnormal mucin glycans will further aggravate UC[
21]. In addition, during the development of cancer, the glycosylation pattern of mucins also changes. This abnormal glycosylation pattern can alter cell function and participate in cancer cell proliferation, invasion, metastasis and angiogenesis[
19]. For example, the mucins of normal breast epithelial cells contain a mixture of O-glycans, most of which are core 2 structure (GlcNAcβ1-6Galβ1-3GalNAcα1-O-Ser/Thr) , but in breast cancer (BC) there is a decrease in core 2 structure and an increase in sialyl Lweis x (NeuAcα2-3Galβ1-4 [Fucα1-3] GlcNAc, SLe
x) antigen, which can lead to enhanced the adhesion of BCcells to endothelial cells[
22].