II.3. Molecular Basis of CDG
The molecular bases of CDG where N-glycosylation is affected are rooted in the glycosylation process itself. Therefore, the molecular bases of CDG-I involve the impairment of some of the glycosyltransferases involved, a decrease in the synthesis of activated sugars, or their availability in the cytoplasmic face and lumen of the ER. Additionally, catalytic impairments of the flippase [
27] and OST [
21] have been implicated. CDG-II includes defects in vesicular trafficking of N-glycoproteins from the ER to the GA, and within the GA, from the cis to the trans portions. Moreover, there is a decrease in the availability of activated sugars in the GA due to problems in their synthesis or transportation, impairment of glycosidases and glycosyltransferases involved, and the involvement of proteins related to the proper functionality of the GA [
8,
28].
We consider it necessary to clarify that, henceforth, alterations in the first or second stage of glycosylation will be referred to as CDG types (CDG-I and CDG-II, from the former nomenclature), and specific defects in enzymes, transporters, among others (corresponding to the lowercase letters of the former nomenclature), will be termed CDG subtypes. Furthermore, for convenience, both the old and current nomenclatures will be used interchangeably.
Up to now, 15 subtypes of defects in the N-glycosylation pathway associated with the structure of some glycosyltransferases have been described: ALG6-CDG, ALG3-CDG, ALG12-CDG, ALG8-CDG, ALG2-CDG, DPAGT1-CDG, ALG1-CDG, ALG9-CDG, ALG11-CDG, DDOST-CDG, ALG13-CDG, STT3A-CDG, STT3B-CDG, MGAT2-CDG, and Β4GALT1-CDG. Additionally, 12 subtypes of defects in these diseases are caused by the impairment of the availability and biosynthesis of monosaccharides and their respective sugar-nucleotide diphosphate and sugar-Dol-P donors. These include: PMM2-CDG, MPI-CDG, DPM1-CDG, MPDU1-CDG, DPM3-CDG, PGM1-CDG, DPM2-CDG, GNE-CDG, GFPT1-CDG, GFPT2-CDG, SRD5A3-CDG, and DK1-CDG [
29]. Furthermore, 2 subtypes result from deficiencies in enzymes that remove monosaccharide units: MOGS-CDG and MAN1B1-CDG; 4 are the product of dysfunctional monosaccharide transporter proteins: SLC35C1-CDG, SLC35A1-CDG, SLC35A2-CDG, and SLC35A3-CDG, while 1, RFT1-CDG, corresponds to a flippase alteration. Similarly, in 3 CDG subtypes, the structures of proteins involved in vesicular movement of glycoproteins from the ER to the GA are affected: SEC23A-CDG, SEC23B-CDG, and SSR4-CDG.
The products of 8 genes are related to the structure and functionality of the GA. Their structural and functional modifications have been linked to 8 mixed-type glycosylation disorders, where both N- and O-glycosylation are affected: COG7-CDG, COG1-CDG, COG8-CDG, COG4-CDG, COG4-CDG, TMEM165-CDG, COG6-CDG, and ATP6V0A2-CDG [
30].
II.4. Diagnosis of Congenital Disorders of N-Glycosylation
Hundreds of different types of proteins susceptible to damage are involved in the glycosylation process, leading to the belief that new subtypes will be diagnosed in the coming years [
39]. In fact, new defects in both N- and O-glycosylation pathways are identified each year, and since it is estimated that there are around 500 genes involved in the synthesis or function of glycoproteins, the currently known set may only be
“the tip of the iceberg”. This concerns the international scientific community due to the high morbidity and mortality associated with CDG [
40].
The organic alterations of CDG can manifest at different stages of life, involving any system. However, the broad clinical spectrum mainly affects the central and peripheral nervous systems, digestive system, musculoskeletal system, hematological system, immune system, and integumentary system. Among the associated symptoms and signs, those most commonly highlighted include psychomotor retardation, failure to thrive, seizures, hypotonia, abnormal fat accumulations, inverted nipples, and various clinical manifestations associated with hepatopathies, enteropathies, and coagulopathies [
39,
40,
41,
42,
43,
44,
45].
The diagnosis of CDG is complicated because they involve nearly all clinical specialties, and also due to the limited knowledge of such defects among healthcare providers. Moreover, only a small number of centers screen for CDG due to technical complexities and result analysis. Additionally, CDG are not clinically diagnosable as their symptoms and signs are nonspecific. Furthermore, they mimic some diseases that are better understood, for instance, PMM2-CDG, the subtype with the highest number of confirmed cases, mimics mitochondrial diseases such as MELAS (mitochondrial encephalomyopathy, lactic acidosis, and stroke-like episodes) and NARP (neuropathy, ataxia, and retinitis pigmentosa) [
46,
47]. Therefore, it is not surprising that there is a significant underdiagnosis of these conditions.
The positive diagnosis is purely biochemical, and the primary biological marker is serum transferrin (Tf), a glycoprotein responsible for transporting iron (Fe
3+) in plasma [
7,
8,
27,
39,
40].
Tf is synthesized and metabolized primarily in hepatocytes and is present in the blood at high concentrations. For instance, in children under 1 year, the range is from 125 to 270 mg/dl, and in those over 1 year, it is from 200 to 350 mg/dl. It is composed of a polypeptide chain of 679 amino acids with a molecular weight of approximately 80 kDa and an isoelectric point (pI) that can vary between 5.4 and 5.9. Each transferrin molecule consists of two lobes of similar internal and independent structure for the binding of Fe
3+; the lobe disposed toward the N-terminal contains residues 1-336, and the other (C-terminal) contains residues 337-679. The glycosidic prosthetic group is found in this latter lobe and is constituted by two complex N-glycan chains linked to residues Asn 413 and Asn 611. These chains vary in their degree of branching, each may present up to 4 antennas [
48,
49,
50]. If the N-glycosylation process involved here proceeds without defects, each antenna will terminate in a Neu5Ac residue (mononegatively charged).
Tf presents various isoforms, each with a pI and a molecular mass that represents it; thus, they can be identified based on these characteristics.
The Tf isoforms arise from:
Differential composition of the amino acid sequence in the primary structure, resulting from genetic polymorphisms. Among the allelic variants, the most prevalent is Tf C; of these, 16 subtypes have been described, with Tf C1 being present in over 95% of cases and having a pI of 5.4. Variant Tf B has a pI of 5.2 and Tf D, 5.7. This comparison of pI is made considering they have the same Fe
3+ content and carbohydrate composition [
51,
52,
53].
Differential composition of carbohydrate chains (
glycoforms). The major isoform, known as tetrasialotransferrin (tetrasialoTf) (pI 5.4), presents two biantennary N-glycan chains, corresponding to four terminations in Neu5Ac residues. However, isoforms can vary from asialotransferrin (asialoTf) to octasialotransferrin (octasialoTf), meaning from no chains to two tetraantennary ones, respectively. Nonetheless, the isoforms following tetrasialoTf in concentration are pentasialotransferrin (pentasialoTf) and trisialotransferrin (trisialoTf). Additionally, very small amounts (less than 2.5%) of isoforms with fewer than three Neu5Ac residues are determined; these isoforms are generally termed carbohydrate deficient transferrin (CDT), corresponding to asialoTf (pI 5.9), monosialotransferrin (monosialoTf) (pI 5.8), and disialotransferrin (disialoTf) (pI 5.7). From a more concrete perspective, the Neu5Ac content can range from 0-8 and determines the microheterogeneity of the transferrin molecule. The variations in the pIs of these isoforms are 0.1 units for each Neu5Ac residue attached [
11,
40,
53].
Differential Fe
3+ content. Each Tf molecule can contain a maximum of two Fe
3+ depending on the iron supply to the body. The pI of the Tf molecule decreases by approximately 0.2 units for each Fe
3+ bound [
53].
In patients with defects in N-glycosylation, as expected, there is an increase in the concentration of CDT and trisialoTf, which forms the theoretical basis for the diagnosis of CDG of the N-glycosylation pathway through isoelectric focusing (IEF) with immunofixation of serum transferrin, provided that the isoforms of genetic polymorphisms and Fe
3+ content remain constant [
11,
40,
53].
In polyacrylamide gel IEF, each type of protein can be
“focused” into a narrow band based on its pI. Small amounts of the solution containing the protein in question are applied onto a polyacrylamide gel support [
11,
40,
53].
The gel is formed by polymerization of the acrylamide monomer [CH
2=CH-C(=O)-NH
2], which generates linear chains that are crosslinked together by the crosslinking comonomer N,N’-methylenebisacrylamide (or simply bisacrylamide) [CH
2=CH-C(=O)-NH-CH
2-NH-C(=O)-CH=CH
2]. The concentration of acrylamide used determines the average length of the polymer chain, while the concentration of bisacrylamide determines the degree of crosslinking. Therefore, the proportion in which both are present determines the size of the pore. When the electrokinetic separation aims to exclude the steric effect induced by the protein’s molecular mass, the best resolution is obtained under acrylamide concentrations less than 5% and bisacrylamide concentrations less than 3%, which avoids excessive sieving. Thus, it is carried out in highly porous dispersion media [
54].
Polymerization is carried out using redox catalytic systems, employing chemical and/or photochemical catalysts. In the former case, the most commonly used are: ammonium persulfate, (NH
4)
2S
2O
8 or APS, as an oxidizing agent, causing the formation of free radicals from the monomer, which are produced by the action of oxygen free radicals (due to the action of persulfate ions), and N,N,N’,N’-tetramethylethylenediamine (TEMED) as a reducing agent, which produces the formation of persulfate free radicals. This reaction is strongly inhibited by high levels of oxygen, so the solution must be degassed to achieve reproducible gel formation. In photochemical catalysis, primarily riboflavin mononucleotide (FMN) and TEMED are used. This polymerization is induced by the free radicals that appear in the chemical decomposition of APS (S
2O
82- to 2SO
4-) or the photodecomposition of riboflavin to leucoflavin in the presence of O
2. In either case, TEMED, a free radical stabilizer, is commonly added to the gel mixture. Photopolymerization with riboflavin is not inhibited by oxygen; however, in both cases, TEMED acts as a mild reducing agent and accelerates polymerization [
54].
The APS has a high tendency to spontaneously decompose, and undergoes homolytic cleavage, forming sulfate radicals that are unstable and tend to recombine. Although this initiates the polymerization reaction, the addition of TEMED as a propagator is required. TEMED forms stable free radicals in the presence of sulfate radicals, which contributes to the propagation of the polymerization reaction and prevents its extinction. By adjusting the amounts of persulfate and TEMED, the polymerization rate can be controlled. Additionally, polymerization can also be achieved with FMN instead of APS. When riboflavin is illuminated with ultraviolet radiation, its photodecomposition occurs, generating free radicals that initiate polymerization. In this case, TEMED is not essential, but it can be added as it facilitates the process [
54].
IEF is considered as a stable gradient electrophoresis of H
+ ion concentration, measured as pH. Therefore, macromolecules will migrate through the pH gradient as long as they maintain their net positive or negative charge until they reach the point in the gradient that corresponds to their pI value. At this point, since the net charge is zero, migration ceases. The pH gradient is established through the use of low molecular weight amphoteric substances, called ampholytes, oligomers with a low molecular mass (300-600 D) that carry amino and carboxyl groups with a range of pI values. The gradient involves a decrease in pH from the cathode to the anode. Under the influence of an electric field, in a solution, different classes of ampholytes will separate according to their pI values, so that the most basic ones gradually step towards the cathode while the most acidic one’s step from the anode. These molecules have buffering capacity, so the pH of that zone will be that of the corresponding pI. When equilibrium is reached, a pH gradient is formed along the gel. The pH gradient is generated and maintained by establishing an electric field [
54].
The technique of serum Tf IEF is the most commonly used diagnostic test for detecting N-glycosylation defects. In patients with CDG, Tf IEF shows a cathodic shifting profile, indicative of a defect in negative charge(s), namely, in Neu5Ac(s) [
7,
8,
39,
40,
55].
In individuals with CDG-I, an IEF pattern is observed that suggests a relative deficiency of tetrasialoTf (not fully manifested) with an increase in the isoforms asialoTf, disialoTf, and eventually trisialoTf, referred to as pattern-I. On the other hand, in patients with CDG-II, patterns are presented where the remaining low-sialic acid isoforms are added. The monosialoTf isoform, as can be inferred, is excluded from pattern-I and appears in pattern-II; therefore, it is considered a good differential indicator of patterns [
8,
55].
Some types of CDG cannot be identified through IEF analysis because in special cases sialylation is not altered, for example: MOGS-CDG, SLC35C1-CDG, in alterations of one of the two GDP-fucose transporters of AG, and in TUSC3-CDG. This, coupled with the laboriousness and time required by the IEF technique, has led to the need for costly techniques such as high-performance liquid chromatography (HPLC), capillary zone electrophoresis (CZE), and mass spectrometry (MS) in the last two decades [
56,
57,
58,
59,
60,
61].
Tf polymorphisms and variations in Fe
3+ content can also produce anomalous patterns in IEF. These situations can be respectively ruled out by performing a preincubation of the sample with neuraminidase and saturating it with Fe
3+. In cases with polymorphism, it is useful to perform IEF of other glycoproteins, for example: haptoglobin, hexosaminidase, and α1-antitrypsin, to search for a generalized glycosylation defect. Since different polymorphic variants of Tf C1 are infrequent, neuraminidase treatment can be reserved for cases where alterations in IEF bands are observed compared to internationally reported standards in the scientific literature. Saturation should precede all serum Tf IEF for standardization purposes [
53,
63,
64].
It is valid to clarify that there are other conditions in which the IEF analysis of Tf may appear normal, in addition to those previously mentioned. For example, fucose defects cannot be detected since they do not interfere with the addition of Neu5Ac to the glycoprotein. Therefore, in patients suspected of having a fucose defect, blood groups should be determined, as they are all of the Bombay blood group [
65].
A normal IEF pattern, in the case of MOGS-CDG, has only been described in one case; however, it has been observed in several adolescent and adult patients with PMM2-CDG, the most frequent subtype. This may be due to the mild-to-moderate severity of these cases, which, combined with a very slight alteration in the Tf profile, may not be detected with conventional procedures. In fact, in CDG, the degree of alteration in the serum Tf profile seems to correlate with the severity of the clinical picture.
Alterations in the Tf IEF profile may also be observed secondary to diseases other than CDG, for example: fructosemia and galactosemia at the time of diagnosis or with poorly controlled disease, as they cause altered glycosylation. Similarly, this altered Tf pattern is observed in chronic alcohol consumption. This implies that these alterations (fructosemia, galactosemia, and chronic alcohol consumption) should be ruled out before confirming a positive diagnosis of CDG through IEF [
68,
69,
70,
71,
72].
Therefore, for accurate interpretation, it is necessary to use both a positive and a negative control. A serum treated with neuraminidase serves as a positive control, or more precisely, positive cases diagnosed using the combined HPLC-MS method. As a negative control, serum from a healthy individual, confirmed likewise by HPLC-MS, should be used [
53,
73].