1. Introduction
Pork is a significant and extensively utilized animal resource, that has emerged as a principal protein source within human diets. In recent years, China’s yearly pork production has surpassed 50 million tons. Notably, Duroc × (Landrace × Yorkshire) (DLY) pigs have accounted for over 90% of the pork market due to their rapid growth and high lean meat rate [
1]. With improved living standards, an increased pursuit of enhanced pork quality arises among individuals. Meat quality is a crucial indicator for assessing pork production and quality. Essential indicators of meat quality encompass intramuscular fat content (IFC), meat color, tenderness, and dripping loss, which can directly impact pork quality and market competitiveness [
2]. Consumers favor snowflake meat, and IFC deposition is the main cause of snowflake meat [
3]. Notably, meat color is also one of the most direct sensory indicators of pork quality for consumers and directly affects their consumption behavior. A present finding has indicated that the phenotypic score of the meat color manifested a stronger correlation with the meat color redness (a*) value (R = 0.8) [
4]. Despite DLY pork effectively meeting the quantitative demand, its muscle quality falls short of eliciting satisfaction. Both IFC and a* value are traits with relatively high heritability [
5,
6] and are the most intuitive indicators of high-quality pork. Consequently, increasing the IFC and a* value through genetic improvement is a major research focal point for pig breeding enterprises.
IFC refers to the amount of fat accumulating between muscle fibers or within muscle cells, mainly composed of phospholipids and triglycerides [
7]. Due to the relatively fixed phospholipid content, changes in IFC were primarily caused by triglyceride changes [
8]. It is widely accepted that changes in meat color in muscles are due to changes in myoglobin levels. This may be due to higher myoglobin levels in slow/oxidative myofibers (red muscle fibers) than in fast/glycolytic myofibers (white muscle fibers). When there is a high proportion of red muscle fibers in muscle tissue, its muscle color exhibits a more distinct red characteristic [
9]. This phenomenon is closely related to the biochemical markers of meat, such as oxidation state, cytochrome content, and redox forms. Previous studies have shown a significant correlation (R = 0.260 ~ 0.323) between IFC and meat redness (a*) [
10,
11]. Therefore, we speculated that these two traits might have similar genetic backgrounds, but the underlying genetic basis was largely unknown.
It is generally believed that the difference in phenotype is caused by changes in gene expression levels. Therefore, the variation in IFC and a* value within a population is thought to be driven by differences in the expression levels of critical genes involved in regulating these two traits. With the development of next-generation sequencing technologies, the emergence of transcriptome sequencing (RNA-seq) allows us to detect the expression levels of all genes across the entire genome. Researchers usually used individuals with extreme phenotypes of IFC and a * value to perform RNA-seq, which could obtain many candidate genes and signaling pathways related to IFC and a* value [
12,
13,
14].
However, organisms are complex systems with interconnected genes regulating biological activities, forming intricate network systems. Therefore, it is crucial to consider the interrelationships between thousands of genes when studying phenotypic variation. Differential expression analysis may not capture critical biological pathways or gene-gene interactions relevant to target traits, as it focuses on the impact of individual genes rather than the influence of gene networks [
15]. Coexpressed genes often form densely connected subgraphs in networks, representing functionally related gene groups or signaling pathways, and exhibit specific biological functions by developing local substructure modules [
16]. These modules reveal interactions among genes at a systems level, aiding researchers in further understanding the mechanisms underlying gene interactions and identifying regulatory hubs of coexpressed genes [
17]. Weighted gene co-expression network analysis (WGCNA) is an efficient and accurate method for describing the correlation among all genes or modules within the whole genome with traits. It is particularly advantageous for simultaneously identifying key genes of multiple complex traits [
18], such as fat deposition [
13], meat quality [
19], and reproductive performance [
20].
Based on transcriptomic data, the present study aimed to gain molecular insights into the hub genes and metabolic pathways that coregulated the variations in IFC and a* value. We collected individuals with divergent IFC and a* value for RNA-seq. Subsequently, we identified the differentially expressed genes (DEGs), and performed the gene set enrichment analysis (GSEA), WGCNA, and protein-protein interaction (PPI) analysis. We identified the candidate genes and modules significantly related to these two traits. Through systematic integration of the above results, we identified the hub genes and pathways that could co-regulate the changes in IFC and a* values. These findings can contribute to understanding the genetic mechanisms of co-regulation changes in IFC and a* value. Moreover, the identified hub genes may serve as potential biomarkers for the synergistic improvement of IFC and meat color in pigs.
4. Discussion
DLY pork is dominated in the pork industry, however, its IFC is low, and the meat exhibits a paler color, resulting in limited competitiveness within the premium pork market segment [
33,
34]. As a result, breeders are eager to undertake genetic improvements in both IFC and redness (
a*) meat color concurrently to cater to consumer market demands. In this study, a highly significant positive correlation (
R = 0.309,
P < 0.001) between IFC and the
a* value was observed, which is similar to previous reports by Mortimer et al. [
10] and Zhang et al. [
11], where they discovered the correlation coefficients of IFC and
a* value were 0.260 and 0.323, respectively. Those suggest that there might be similarities in the genetic background regulating changes in both IFC and redness value. Consequently, transcriptome analysis was conducted using the individuls with extreme IFC and
a* value to identify hub genes and metabolic pathways co-regulating IFC and rendess of prok.
In the present study, we analyzed DEGs using three R packages, DESeq2, limma, and edgeR, in the H_IFC (H_a) and L_IFC (L_a) groups. We found that the number of DEGs obtained by these three methods was different, with the DESeq2 method obtaining the highest count of DEGs (723 and 590 DEGs in the IFC and
a* groups, respectively), while the number of DEGs obtained by the limma and edgeR methods was similar. All three methods, limma, edgeR, and DESeq2, are widely used and well-established tools in transcriptome analysis. However, these three methods may yield different sets of DEGs due to variations in their statistical model, methods, and assumptions [
35]. DESeq2 uses a negative binomial distribution [
25], edgeR uses a negative binomial model with a dispersion parameter [
27], and limma employs an empirical Bayes approach with linear modeling [
26]. All methods could handle various experimental designs; limma was particularly well-suited for experiments with complex structures and might offer faster computation for medium to large datasets. DESeq2 and edgeR were well-suited for experiments with fewer replicates and small-sample size data. Moreover, each software employs its normalization methods. For example, DESeq2 used the "size factors" approach, edgeR used the "trimmed mean of M-values" (TMM), and limma employed a linear modeling-based approach [
35]. Although there are some differences among the DEGs obtained by these three methods, the genes that contribute to the phenotypic variations should be stable and genuine rather than affected by variations in the detection methods. Consequently, we focused on the overlapping DEGs that were consistently detected by all three methods for subsequent functional enrichment analysis.
We performed functional enrichment analysis using the overlapping DEGs described above, which were closely related to lipolysis, adipocytokines, redox, and antioxidant responses, such as the adipocytokine signaling pathway, regulation of lipolysis in adipocytes, MAPK signaling pathway, and FoxO signaling pathway. These significantly enriched pathways played a crucial role in the process of IFC deposition and
a* value changes. Because conventional functional enrichment analysis focuses only on DEGs, it might not provide a holistic view of the broader biological landscape. GSEA provided a comprehensive and holistic perspective on transcriptomic changes. By considering all genes rather than DEGs, GSEA offered the advantage of capturing subtle but coordinated expression patterns across the entire transcriptome. This approach helps identify biological pathways and processes that might contribute to observed changes, including those involving non-differentially expressed genes [
36]. Compared to functional enrichment analysis based on DEGs, GSEA analysis specifically identified several enriched pathways associated with IFC deposition and
a* value, such as fatty acid metabolism, glycosphingolipid biosynthesis, oxidoreductase activity, and oxidative phosphorylation. Similar to our results, Liu et al. [
37] and Kim et al. [
38] conducted GSEA in pigs and found significantly enriched pathways related to IFC and meat color, including the oxidative phosphorylation, AMPK, and MAPK signaling pathways.
WGCNA is a bioinformatics analysis technique utilized to depict gene correlation patterns among distinct samples, and facilitates clustering genes with highly similar expression patterns into specific modules, subsequently analyzing the associations between modules and particular traits or phenotypes [
30]. This method has been successfully applied in the genetic analysis of various essential economic traits in pigs [
13,
39,
40,
41,
42,
43]. In this study, WGCNA was performed to detect the vital genes and modules associated with IFC and
a* values using transcriptome data from 19 samples. The results of the WGCNA showed that the purple module demonstrated a positive correlation with both IFC and
a* value. In contrast, the dark grey, dark red, and black modules exhibited negative correlations with IFC and
a* value. These four modules contained a total of 6,045 genes encoding proteins. Based on the overlap analysis between the DEGs (DEGs of IFC and DEGs of
a* value) and the WGCNA results, more than 70% of the DEGs could be detected by the WGCNA, indicating the similarity between these two analysis methods and further proving the reliability of the results of this study. However, some genes associated with IFC and
a* value identified by WGCNA did not exhibit differential expression in high and low groups. This observation suggests that WGCNA could recognize additional information by establishing interconnected networks between genes, aligning well with the foundational principles of WGCNA. This was consistent with the findings of Xing et al. [
13].
Both IFC and
a* values are complex quantitative traits that may be regulated by multiple small-effect genes [
44]. In the present study, differential gene expression analysis, WGCNA, and functional enrichment analysis under various conditions were performed. A large number of genes significantly associated with IFC and
a* values were identified. However, as these two traits are both of medium to high heritability [
5,
6], it is hypothesized that large-effect genes regulating them may be existed. Furthermore, due to the significant correlation between these two traits, it is likely that there may also be molecular pathways that regulate them simultaneously. To effectively reduce the number of genes associated with IFC and
a* values, we performed overlap analysis of significantly enriched GO terms and KEGG pathways derived from overlapping DEGs, GESA, and WGCNA. DEGs located in the overlapping GO terms and KEGG pathways affecting IFC and
a* value were considered candidate genes. Two overlapping GO terms and eleven overlapping KEGG pathways were identified for IFC. Similarly, for the
a* value, seven overlapping GO terms and ten overlapping KEGG pathways were identified. We focused on GO terms and pathways that may be involved in IFC deposition and
a* value alterations, as shown in
Table 2 and Table 3. Response to oxygen-containing compound (GO:1901700) was a GO term that was significantly enriched for both traits (
Table 2). This biological process entails a modification in the state or activity of a cell or organism (involving aspects such as movement, secretion, enzyme production, gene expression, etc.) due to stimulation by an oxygen-containing compound. This process may directly or indirectly affect the oxidation of fatty acids and the redox reaction of myoglobin [
45].
The IFC and
a* value groups shared four significantly enriched pathways: the FoxO signaling pathway (ssc04068), adipocytokine signaling pathway (ssc04920), MAPK signaling pathway (ssc04010), and HIF-1 signaling pathway (ssc04066) (Table 3). The FoxO signaling pathway governs glucose and lipid metabolism by controlling genes associated with gluconeogenesis, glycogenolysis, and lipid metabolism [
46]. It also impacts fatty acid oxidation and storage across diverse tissues [
47]. Although the direct connection between the FoxO pathway and myoglobin oxidation is not extensively documented, it is conceivable that this pathway indirectly influences oxidative processes by regulating energy metabolism and responses to oxidative stress [
48]. The adipocytokine signaling pathway is linked with adipocyte-related functions and metabolism. It modulates insulin sensitivity, glucose uptake, and lipid metabolism, affecting the release of adipokines that influence lipid homeostasis and inflammation [
49]. This pathway likely indirectly affects myoglobin oxidation by influencing factors connected to metabolism and inflammation, thus potentially impacting oxidative processes in muscle tissues [
50]. The MAPK signaling pathway is integral to various cellular processes, encompassing cell growth, differentiation, and metabolism. It can impact lipid metabolism by regulating genes related to lipogenesis, lipolysis, and fatty acid oxidation [
51,
52]. This pathway may contribute to muscle oxidative processes by mediating cellular reactions to stress, lipid peroxidation, and growth cues, thereby influencing myoglobin oxidation under specific conditions [
53]. Activated in response to low oxygen levels, the HIF-1 signaling pathway orchestrates adaptive responses to hypoxia. It influences glycolysis, lipid, and energy metabolism when oxygen levels are low [
54]. The HIF-1 pathway can affect myoglobin oxidation by regulating the response to hypoxia, potentially influencing oxidative metabolism and the role of myoglobin in oxygen transport and storage [
55]. In summary, these pathways may play pivotal roles in both fatty acid metabolism and myoglobin oxidation.
The DEGs in
Table 2 and Table 3 were considered candidate genes influencing IFC and
a* values, and the PPI network was constructed (
Figure 5), respectively. Based on the degree of connectivity, six genes (
ATF3,
SOX9,
PPARGC1A,
CEBPB,
MYC, and
VEGFA) with connectivity exceeding ten were regarded as hub genes potentially influencing IFC. Similarly, fourteen hub genes impacting the
a* value were identified, including
IL6,
VEGFA,
MYC,
EGR1,
CEBPB,
JUNB,
THBS1,
SERPINE1,
SOCS3,
DUSP1,
SOX9,
PPARGC1A,
CCL2, and
FOXO1.
ATF3 (activating transcription factor 3), a member of the CREB family of basic leucine zipper transcription factors (TFs), has been found that the deletion of
ATF3 resulted in increased lipid body accumulation and
ATF3 directly regulates transcription of the gene encoding cholesterol 25-hydroxylase [
56].
VEGFA (vascular endothelial growth factor A) is crucial in vascular development in physiological and pathological processes. Increasing data suggest that
VEGFA regulates lipid metabolism, but further studies are needed to elucidate the specific mechanisms [
57,
58].
IL6 (interleukin-6) is a pivotal regulatory factor for lipolysis and beta-oxidation. Numerous in vitro studies have substantiated that treatment with
IL6 enhances lipolysis and beta-oxidation in both myotubes and adipocytes [
59,
60].
EGR1 (Early growth response 1) is a transcription factor. Mohtar et al. found that insulin/mTORC1-inducible
EGR1 binds to the leptin promoter and activates leptin expression in 3T3-L1 adipocytes, regulating lipid metabolism [
61]. The results of Yan et al. suggested that inhibition of
JUNB might be a key indicator of the regulation of the APOA2-associated PPARα pathway [
62].
APOA2 is a well-known member of the apolipoprotein family [
63], and the PPARα pathway is also a key pathway in regulating lipid metabolism [
64].
THBS1 (thrombospondin-1) is a prototypical matricellular protein.
THBS1-null mice exhibited elevated free fatty acids and triglycerides compared to wild-type mice, suggesting impaired fatty acid uptake [
65].
SERPINE1 (Serpin Family E Member 1), also known as plasminogen activator inhibitor type 1 (PAI-1), is a member of the serine proteinase inhibitor (serpin) superfamily. Several findings have shown that PAI-1 might promote the differentiation of mesenchymal stem cells toward adipogenesis, and PAI-1 deficiency attenuates changes in the levels of adipogenic genes such as PPARγ and aP2 [
66,
67].
SOCS3 (suppressor of cytokine signaling 3) plays an important role in regulating energy metabolism processes. In recent years, researchers have found that
SOCS3 is involved in the AMPK signaling pathway, insulin resistance, adipocytokine signaling pathway, and JAK/STAT pathway and activated/triggered by leptin signal plays important roles in lipid metabolism processes [
68,
69,
70]. DUSPs (dual-specificity phosphatases) are the key phosphatases in the MAPK pathway. Recently,
DUSP1 was suggested to play a critical role in the switch from oxidative to glycolytic myofibers [
71] and could regulate fatty acid oxidation [
72].
CCL2 (chemokine ligand 2) is a member of the C–C motif family of chemokines. Kang et al. found that after
CCL2 binds to its receptor
CCR2, it could reduce lipid peroxidation by inhibiting
CCR2, indicating its important regulatory role in lipid oxidation metabolism [
72]. Current studies suggest that the transcription factor
FOXO1 (forkhead box protein O1) is involved in lipid metabolism and lipolysis in adipocytes [
73,
74]. Song et al. found that interfering with
FOXO1 negatively regulated the expression of adipogenic differentiation marker genes and lipid anabolism marker genes, thus reducing triglyceride content and inhibiting the generation of lipid droplets in bovine adipocytes [
75].
It is worth noting that these two traits share four hub genes:
MYC,
CEBPB,
SOX9, and
PPARGC1A.
MYC is a transcription factor that regulates cell proliferation and differentiation in healthy cellular processes. Hall et al. revealed that the activation of
MYC led to the accumulation of cholesteryl esters stored in lipid droplets [
76]. A previous study found that
MYC was involved in the MAPK signaling pathway, promoting the glycolysis process in fish T cells [
77]. In addition,
MYC is involved in the WNT signaling pathway and serves as a target gene/transcriptome factor for WNT, regulating myogenesis [
78].
CEBPB (CCAAT/enhancer binding protein β) is a member of the transcription factor family of CEBP. Several studies have reported that
PPARGC1A (PPAR coactivator-1α, also known as
PGC1α), a transcriptional co-activator of PPARγ, could bind to
CEBPB and form a transcription complex. This complex could promote the transcription of
CPT1A (carnitine palmitoyl transferase 1 A) and activate fatty acid β-oxidation [
79,
80].
SOX9 (Sex-determining region Y-type box-9) is a member of the Sox supergene family and has been proven to be an essential transcription factor in cartilage formation during chondrocyte proliferation [
81]. Wang et al. confirmed that
SOX9 could directly bind to the promoters of
CEBPB and
CEBPD, inhibit their promoter activity, and prevent adipocyte differentiation [
82]. This evidence indicated that the
SOX9/
CEBPB/
PPARGC1A axis might play an essential regulatory role in fatty acid β-oxidation. Myoglobin is an oxygen-binding hemeprotein generally localized to oxidative muscle and functions as an oxygen store and reactive oxygen species scavenger [
83]. Schlater et al. confirmed that an increase in lipids could stimulate an increase in myoglobin content in muscle cells of C2C12 mice, which was closely related to fatty acid beta oxidation [
84]. In summary, we speculated that the
SOX9/
CEBPB/
PPARGC1A axis played a vital role in the co-regulation of IFC deposition and changes in redness of meat color. The expression levels of the upstream gene
STAT3 (signal transducer and activator of transcription 3) and downstream
CPT1A genes in the
SOX9/
CEBPB/
PPARGC1A axis were also significantly different in both the high and low groups in this study, further supporting the importance of this pathway in the synergistic regulation of lipid and myoglobin metabolism. Thererore, it is particularly interesting to investigate the co-regulatory mechanism of the
SOX9/
CEBPB/
PPARGC1A axis in IFC and
a* value traits in further studies.