3.6.2. Differential Analysis on the Evolution of Baijiu Flavor Types through Machine Learning
The dataset of trace components in Baijiu was extensive and the flavor perception of Baijiu was notably complex [
31]. Traditional identification methods, which primarily utilize principal component analysis (PCA), cluster analysis, and similar methods, were often limited in their effectiveness when analyzing small datasets. These methods typically struggle with non-linear relationships and larger datasets. In contrast, machine learning approached, such as random forests [
7] and support vector machines (SVM), possessed the advantage of uncovering potential unknown connections within extensive datasets [
32]. Recently, machine learning has been employed to establish identification methods for food based on various treatment techniques [
33]. In this study, random forest analysis (
Figure A.2) was performed, focusing on the distribution characteristics of trace components. Within 500 decision trees, the trace components were ranked according to their feature importance, ultimately identifying the differential markers between derived and basic flavor types with high accuracy [
34]. Based on, the Pearson correlation coefficient was calculated to analyze the relationship between the concentrations of differential markers and the scores of SQDA for Baijiu, as illustrated in
Figure 3b. In this figure, 3-octanone-TX and 3-octanone-FYX indicated that this component was screened in both the TX and FYX groups, a trend that applied to other components as well.
Jianxiangxing (JXX) Baijiu was identified as a flavor type in 1983. Its production technique was borrowed JX in the early stage and NX in the late stage, which formed a sensory feature that combined both JX and NX styles [
35,
36]. As shown in
Figure A.2, ethyl benzoate and ethyl butanoate made an important contribution to identifying JXX and its basic flavor types. Among them, in previous study [
37], ethyl benzoate was also identified as a flavor differential marker to distinguish JXX and its basic flavor types. Correlation analysis (
Figure 3b) indicated that ethyl butanoate was negatively correlated with ethanol aroma, softness and persistence, while positively correlated with the baked aroma. Meanwhile, ethyl benzoate was positively correlated with oily aroma and ethanol aroma but positively correlated with bitter aftertaste.
Dongxiangxing (DX) Baijiu was considered as a flavor type in 1986. The production technique combined Daqu and Xiaoqu (two saccharification fermenters with different production technology) [
38], and the flavor contained the characteristics of three flavor types, namely, JX, NX and MX. Adding traditional Chinese medicine was a feature of DX, which provided a comfortable herbal aroma while promoting or inhibiting microorganisms. In terms of flavor, DX possessed unique style, with a composite aroma composed of herbal aroma, ester aroma and ethanol aroma. As shown in
Figure A.2, pentanoic acid (positively correlated with mild aroma and sour taste while negatively correlated with mellow), 2-methylbutanal (positively correlated with mellow and bitter aftertaste while negatively correlated with jiang-aroma), and ethyl hexanoate (negatively correlated with grain aroma while positively correlated with herbal aroma and harmonious) played an important role in distinguishing DX from its basic flavor types.
Texiangxing (TX) Baijiu, with whole grain rice as the brewing material, had independently been a new flavor type since 1988. The unique ratio of flour, wheat bran, and distilled grains used in the production of Daqu was one of the importance factors that contributed to the formation of TX style [
39,
40]. TX simultaneously possessed the characteristics of three basic flavor types (referring to JX, NX, QX). The results (
Figure 3b) showed that 3-octanone, isopentyl hexanoate, ethyl heptanoate with the highest contribution were all negatively correlated with bitter aftertaste.
Fengxiangxing (FX) Baijiu had a long history, but its flavor type was not officially determined until 1992. The raw material for its Daqu was consistent with QX, while the production technology was similar to NX so that FX possessed the sensory characteristic of combining both QX and NX [
41].
Figure A.2 showed that 3-methylbutyric acid (positively correlated with ethanol aroma while negatively correlated with oily aroma), ethyl lactate (positively correlated with sweet taste and sweet aftertaste), isopentyl hexanoate (positively correlated with floral/fruity aroma, mellow, harmonious and sweet aftertaste), ethyl tetradecanoate (positively correlated with floral/fruity aroma and grain aroma while negatively correlated with softness) and ethyl benzoate (positively correlated with floral/fruity aroma while negatively correlated with milk aroma and grain aroma) were identified to make major contributions to the classification of FX and its basic aroma types. In particular, the low concentration of ethyl lactate in FX was related to its short fermentation cycle [
42], which may have a positive impact on the sweetness of Baijiu.
Zhimaxiangxing (ZMX) Baijiu was officially determined as an independent flavor type in 1995. In fact, its technology was carried out according to JX, due to the less humid and hot climate in the north compared to the south, the microorganisms produced during the accumulation process were not abundant enough [
43], resulting in the Baijiu had presenting a prominent sesame aroma. As shown in
Figure A.2, ethyl propanoate (negatively correlated with jiang-aroma and mellow) made the greatest contribution in distinguishing between ZMX and JX.
Chixiangxing (CX) Baijiu was determined in 1996 and derived from MX with the process feature of saccharification and fermentation simultaneously. Specifically, its soaking process of chen rou (i.e., fatty pork) [
44] was the biggest difference from MX and the Chixiang refered to the unique aroma that combined the basic aroma of the Baijiu (such as ester aroma, floral aroma, etc.) with the mature aroma of chen rou [
45]. The results (
Figure A.2) showed that isobutyl acetate (positively correlated with jiao-aroma and oily aroma while negatively correlated with honey aroma and ethanol aroma) played an important role in distinguishing between CX and MX.
Laobaiganxiangxing (LBGX) Baijiu, whose acid/ester ratio was basically the same as QX, and the concentration of ethyl acetate was also higher, was once considered to belong to QX. After long-term practice, it had been found that there were significant differences in the distribution characteristics of trace components between LBGX and QX [
46]. It was not until 2005 that LBGX was officially recognized as a separate flavor type. The classification and feature selection of QX and LBGX were performed by random forests, and the results (
Figure A.2) showed that isopropyl myristate (positively correlated with jiao-aroma and mellow while negatively correlated with oily aroma and baked aroma) made an important contribution to classification accuracy.
Fuyuxiangxing (FYX) Baijiu was derived from NX, JX, and QX, and formed a style of pre-NX flavor, mid-QX flavor, and post-JX flavor [
47]. It was officially identified as an independent flavor type in 2005. The main characteristic of its technology was that unbroken grains were chosen as raw material, and were saccharified by using Xiaoqu to cultivate microorganism, while Daqu was added for fermenting in the cellar [
48]. There were 5 kinds of differential markers selected by random forests (
Figure A.2), namely 2-butanol (positively correlated with ethanol aroma and rich), ethyl 2-hydroxy butanoate (negatively correlated with jiao-aroma, mellow, sweet taste and hard), 3-octone (positively correlated with herbal aroma while negatively correlated with sesame aroma), ethyl nonanoate (positively correlated with jiao-aroma while negatively correlated with softness), and 2,4-di-tert-butylphenol (positively correlated with sweet aroma and persistence while negatively correlated with harmonious and rich).
In summary, random forest model was used to classify and select features for derived and basic flavor types. The 19 differential markers with the highest contribution to classification were selected based on the accuracy ranking of each group, containing 11 esters, 2 alcohols, 2 acids, 1 aldehyde, 1 ketone, and 2 aromatics (containing benzene). Based on, PCA analysis was carried out on different flavor types with 19 selected differential markers to evaluate whether they can distinguish the flavor types of Baijiu. The results (
Figure 3c) showed that the total variance was 71.5% (PC1 was 38.7%, PC2 was 32.8%), which indicated that the 19 differential markers screened could effectively distinguish Baijiu of different flavor types. However, the influence of these differential markers in corresponding flavor type of Baijiu on sensory attributes still needed to be further verified by flavor addition test.