In order to accurately screen and reduce the dimensions of the bladder metabolism characteristics associated with estrogen deprivation identified in our multi-component analysis, we comprehensively used the methods of t-test, PCA, PLS-DA and Orthogonal PLS-DA (OPLS-DA), Empirical Bayesian Analysis of Metabolomics (EBAM), and Random Forest to perform inter-group analysis between the OVX group and the sham group, as well as between the OVX group and the OVX + E group. The distribution before and after data correction is presented in
Figure S4. With a Fold Change (FC) threshold set at 2 and an FDR set at 0.05, inter-group t-tests for metabolites were conducted. Compared to the sham group, the up-regulated metabolites in the OVX group were cis-5-tetradecenoylcarnitine, and the down-regulated metabolites were N6-acetyl-L-lysine, lysoPC (15:0), and 3,5-tetradecadiencarnitine. In comparison to the OVX + E group, the up-regulated metabolite in the OVX group was 2-hydroxybutyric acid, while the down-regulated metabolites were 4-guanidinobutanoic acid, N6-acetyl-l-lysine, tartaric acid, glycocholic acid, cortisol, and lysoPC (15:0) (
Figure 6A, B,
Table S9). The hierarchical clustering heat map based on the top 30 different metabolites clearly showed the expression trends of different metabolites among different group (
Figure 6C, D). Three methods, PCA, PLS-DA, and OPLS-DA, were employed to perform dimensionality reduction analysis on inter-group data (
Figure 6E-N). Firstly, the unsupervised method (PCA) maintained a certain degree of differentiation between the Sham and OVX group (
Figure 6E). Principal component I reached 30.1%, principal component II reached 17.4%, accumulating to 47.5% (
Figure S5A, B), but the differentiation between the OVX and OVX + E group was insufficient (
Figure 6F), with principal component I at 26.2% and principal component II at 15%, accumulating to 41.2% (
Figure S5C, D). The supervised methods (PLS-DA and OPLS-DA) demonstrated excellent inter-group differentiation (
Figure 6G-J), but the permutation test results indicated overfitting of the PLS-DA model (
Figure S5E, F). The cross-validation results indicated that the R
2X for the OPLS-DA model of OVX versus sham group was 0.17, R
2Y was 0.79, and Q
2 was 0.58 (
Figure S5G). The permutation test yielded an R
2Y of 0.99 (
p = 0.007) and Q
2 of 0.68 (
p = 0.002) (
Figure S5H), indicating a good fit for the OPLS-DA model. Metabolites with VIP scores exceeding 2 (lysoPC (15:0) and 3,5-tetradecadiencarnitine) also demonstrated significant importance in the S-plot (
Figure 6K, L,
Table S10). The cross-validation results indicated that the R
2X for the OPLS-DA model of OVX versus OVX + E group was 0.15, R
2Y was 0.87, and Q
2 was 0.64 (
Figure S5I). The permutation test yielded an R
2Y of 0.99 (
p < 0.001) and Q
2 of 0.86 (
p < 0.001) (
Figure S5J), confirming a perfect fit for the OPLS-DA model. Among the metabolites with VIP scores exceeding 2 (lysoPC (15:0), fumaric acid, l-tyrosine, and cortisol), except for fumaric acid, the other three also showed significant importance in the S-plot (
Figure 6M, N,
Table S11). Additionally, from the perspective of machine learning, we carried out supplementary screening of potential different metabolites using a representative random forest tree(RF) model to evaluate inter-group differences. The classification error rate graph is shown in
Figure S5K, L. The VIP plot, based on features ranked by their contributions to classification accuracy, indicated that 3,5-tetradecadiencarnitine contributed the most to the RF model between the sham and OVX group, while fumaric acid and lysoPC (15:0) contributed the most to the RF model between the OVX and OVX + E group (
Figure 7A, B). Lastly, we applied EBAM and SAM evaluation methods (
Figure 7C-F). EBAM results indicated that the most valuable different metabolites between the sham and OVX group were 3,5-tetradecadiencarnitine, lysoPC (15:0), tetradecanoylcarnitine, and 2-hydroxybutyric acid, while the different metabolites between the OVX and OVX + E group were lysoPC (15:0), l-tyrosine, fumaric acid, cortisol, l-valine, coumarin, and 2-hydroxybutyric acid (
Table S12). SAM analysis yielded results consistent with EBAM, showing that the most valuable different metabolites between the sham and OVX group were 3,5-tetradecadiencarnitine and lysoPC (15:0), while between the OVX and OVX + E group, they were lysoPC (15:0), cortisol, and glycocholic acid (
Table S13).