Preprint
Article

NIR Spectroscopy as an Alternative to Thermogravimetric Analyzer for Biomass Proximate Analysis: Comparison of Chip and Ground Biomass Models

Altmetrics

Downloads

100

Views

50

Comments

0

A peer-reviewed article of this preprint also exists.

Submitted:

22 December 2023

Posted:

22 December 2023

You are already at the latest version

Alerts
Abstract
A non-destructive analysis of proximate parameters (moisture content, MC; volatile matter, VM; fixed carbon, FC; and ash content) for a variety of chipped and ground biomasses was investigated through a combination of thermogravimetric analysis (TGA) and near-infrared spectroscopy (NIRS) with partial least squares regression (PLSR). In this study, the thermogravimetric method was employed as a tool to determine the proximate analysis data of biomass samples through TG and DTG curves, which were generated by tracking changes in biomass mass loss over time or temperature. NIRS non−destructively scanned the chipped biomass in diffuse reflectance while ground biomass using transflectance mode, covering the wavenumber range of 3594.87 to 12,489.48 cm−1. The PLSR−based models: Full−PLSR, genetic algorithm (GA)−PLSR, successive projection algorithm (SPA)−PLSR, the spectral multi−preprocessing (MP) PLSR− 5 range method, and the MP PLSR−3 range method were developed. Model performance was assessed, compared, and selected based on the coefficient of determination in the prediction set (R2P), root mean square error in the validation set (RMSEP), and the ratio of prediction to deviation (RPD). Based on the comprehensive analysis of model performance, MC and FC models of chip biomass have demonstrated satisfactory performance which makes it applicable cautiously in any application, including research. The optimum models for MC and FC in chip biomass were constructed using GA−PLSR with 2nd derivative and Full−PLSR with a constant offset, resulting in R2P values of 0.8654 and 0.8773, RMSEP values of 0.85% and 2.12%, and RPD values of 2.9 and 3.0, respectively. Other parameters such as MC and FC in ground biomass as well as VM and ash content in both chip and ground biomass were found suitable for rough screening. The sensitivity of the selected model was assessed by calculating the limit of quantification (LOQ), and the results indicated that the calculated LOQ values for VM in both chip and ground biomass and FC in chip biomass are lower than the minimum reference values used during model development, signifying a high level of sensitivity. However, for the remaining parameters, the LOQ values surpass the established minimum reference value, suggesting the model could not predict the future samples with reference value lower than the calibration range. Nevertheless, enhancing the models by incorporation an ample number of representative biomass samples and consistently validating the models with unknown samples is imperative to ensure accurate predictions.
Keywords: 
Subject: Engineering  -   Energy and Fuel Technology

1. Introduction

Biomass is a renewable, carbon-neutral, abundantly available, and sustainable energy source that holds the potential to reduce or replace reliance on fossil fuels, contributing to energy savings and environmental preservation [1]. Studies indicate that global lignocellulosic biomass, including crop residues, wood, and grass, generates approximately 181.5 billion tonnes each year [2]. This highlights the substantial potential of utilizing agricultural residues and wood from fast−growing trees for bio-conversion. A primary concern is that biomass generally has a lower energy density compared to fossil fuels. Nevertheless, the application of diverse biomass energy technologies rooted in thermochemical treatments-namely, pyrolysis (300 to 1000 °C), gasification (500 to 1300 °C), liquefaction (250 to 550 °C), torrefaction (200 to 300 °C), and combustion (700 to 1000 °C)− displays the potential to effectively transform biomass into valuable energy resources [3,4].
The wide range of biomass sources, the diverse assortment of agricultural and tree waste materials, and the continuous variations in their properties during processing, storage, and transportation collectively pose challenges in effectively characterizing and understanding biomass as an efficient fuel. The characterization of biomass is crucial for reliably predicting its behavior as a fuel [5]. In the context of the thermal conversion of biomass, proximate analysis emerges as a pivotal method for characterization [4]. Proximate analysis is used to determine the mass percentages on a wet basis of MC, VM, FC, and ash content, all of which significantly influence both combustion behavior and plant design.
During the biomass combustion process, the rate of heat release is primarily influence by proximate analysis parameters, including MC, VM, FC, and ash content [6]. Adjustments in these proximate analysis values can yield variations in crucial combustion performance indicators such as ignition time and temperature, burnout time and temperature, and the overall heat release. Higher moisture levels decrease the higher heating value of biomass [7] and lead to reduced combustion efficiency, whereas higher ratios of VM to FC correspond to increased fuel reactivity [8]. VM refers to the components in biomass that are converted into a gaseous form through thermal decomposition, whereas FC represents the non-volatile segment of the biomass. A high VM content does not necessarily ensure a high calorific value because some VM constituents are derived from non−combustible gases like CO2 and H2O. Lower MC and VM contents contribute to the improvement of the FC content in biomass samples [9]. A high FC signifies a higher carbon content in the biomass, resulting in higher energy content and generating maximum heat during combustion [10]. Therefore, the FC content of biomass positively influences its energy potential and calorific value. Conversely, a higher presence of VM in the biomass results in lower FC, consequently impacting the energy potential of the biomass negatively.
Ash is an inorganic residue remained after ignition of combustible substance [11,12]. The energy required for the thermal breakdown and phase transition of ash-forming inorganics is extracted from the biomass burning energy, resulting in a reduction of calorific value [13]. The presence of high MC and elevated ash content in biomass lead to ignition and combustions problems [5]. Also, the presence of ash influences corrosion and slag formation [8].
Various standard procedures exist to measure these proximate analysis parameters. ASTM E−871−82 (2019) addresses moisture, ASTM E−872−82 (2019) handles volatile matter, and ASTM E1755−01 (2020) tackles ash in biomass. FC is usually determined by calculating the difference between 100% and the sum of MC, VM, and ash content [12,14]. These standard methods are involve heating a sample under specific conditions and assessing the change in weight. For proximate analysis of biomass, experimentation includes using a simple oven to determine MC and a furnace to determine VM and ash contents. The experimental techniques employed to estimate these parameters often consume considerable time, are financially demanding, and carry an increased likelihood of experimental error. Additionally, considering the large number of samples needed to determine the proximate analysis data for biomass characterization, the process is both tedious and destructive in nature. Hence, various researchers have recommended the use of TGA for proximate analysis of biomass. For instance, Posom et al. (2020) recommend to use TGA to determine the proximate analysis of bamboo chip and leucaena pellets biomass [15], Torquato et al. (2017) investigated the appropriate conditions for determining proximate parameters of lignocellulosic agricultural residue using TGA [16], and García et al. (2013) analyzed 13 different types of biomass for proximate analysis using TGA [8]. Similarly, previous research studies have demonstrated the potential of utilizing NIR Spectroscopy, as an alternative method, for assessing biomass proximate analysis parameters across the wavelength range between 780 and 2500 nm. Posom et al. (2022) developed a reliable and accurate model for online measurement of MC in sugarcane [17]. Toscano et al. (2022) estimated the MC of industrial wood chip fuel using a portable NIR spectrometer [18]. Sirisomboon et al. (2020) compared the performance of proximate data models, namely MC, VM, FC, and ash, through direct scanning of bamboo chips [19]. Shrestha et al. (2018) evaluated MC in bamboo chips using diode array near-infrared spectroscopy [20]. Posom et al. (2017) determined the HHV, VM, FC, and A of ground bamboo using FT−NIRS [21]. Adnan et al. (2017) assessed the feasibility of NIRS and chemometrics for rapid and non−destructive prediction of MC in intact green coffee beans [22]. The above research demonstrates the potential of TGA for determining proximate parameters and NIRS for rapidly predicting biomass proximate parameters. Therefore, this study integrates TGA and NIRS to develop calibration models using chemometircs, specifically PLSR.
Although NIRS is a rapid, reliable, and non-destructive analytical method, it requires the development of individual calibration models based on spectral data and each reference parameter. This process can be time-consuming and costly. However, in the long term, it proves to be beneficial for the accurate assessment of proximate analysis parameters. To the best of our knowledge, there have been no publications reporting the development of a global NIRS model for proximate parameters using fast-growing trees and agricultural residue which can be catagorized into wood and non−wood biomass. In this work, we elucidate the rationale for combining both types of biomass materials for model development. Additionally, this study marks the first application of the recent multi-preprocessing methods, specifically the 5−range and 3−range methods, for the NIRS evaluation of proximate parameters in biomass. Therefore, the main aims of this study are as follows:
1. To determine the proximate analysis parameters, i.e., MC, VM, FC and ash content of biomass, using TGA.
2. To develop separate calibration models using Full−PLSR, GA−PLSR, SPA−PLSR, the multi−preprocessing 5 range−PLSR, and the multi−preprocessing 3 range−PLSR for the non−destructive assessment of each proximate analysis parameter in both chipped and ground biomass.
3. To compare and select the best−performing PLSR−based model for each proximate analysis parameter from chipped and ground biomass, and establish it as a rapid, reliable, non−destructive alternative method for assessment of proximate analysis parameters.
4. To determine the LOQ value for each proximate parameters using the calibration set of the proposed high-performance model, both for chip and ground biomass.

2. Materials and Methods

Figure 1 illustrate the overall research methodology employed to assess the proximate analysis parameters and select the best performance PLSR based models using TGA (35−700 °C) and NIRS (3594.87−12489.48 cm−1).

2.1. Sample Preparation

Ten different varieties of biomass samples were collected from various locations in Nepal. Wood samples included four fast−growing species: (1) Alnus nepalensis, (2) Pinux roxiburghii, (3) Bombax ceiba, and (4) Eucalyptus camaldulensis. Non−wood were five agricultural residues: (1) Zea mays (cob), (2) Zea mays (shell), (3) Zea mays (stover), (4) Oryza sativa, and (5) Saccharum officinarun and one fast growing tree (6) Bombusa vulagris. These samples were manually chopped, dried under the open sun, and then transported to the laboratory for NIRS and TGA experiments to record the spectral data and proximate analysis data for the development of PLSR-based models [23].

2.2. FT-NIRS Scanning

In Figure 2a, chip biomass samples with dimensions less than 30 mm X 15 mm were scanned using an FT−NIR spectrometer (MPA, Bruker, Ettlingen, Germany) in diffuse reflectance and sphere macro sample rotating mode. The samples were fully filled into a quartz−bottom sample cup with a diameter of 90 mm and a height of 90 mm. They were then covered with aluminum sheet to prevent NIR leakage. Subsequently, the chip biomass samples were ground using a multifunctional high−speed disintegrator (WF−04, Thai grinder, Thailand). The ground biomass samples were scanned in a transflectance mode (Figure 2b). These samples were placed in glass vials with a diameter of 20 mm and a height of 40 mm.
NIR scanning of both chip and ground biomass was performed twice without changing their positions under a controlled laboratory environment, maintaining a temperature of 25 ± 2 °C. For both form of biomass samples, the spectrometer operated at a resolution of 16 cm−1, with a background scan time and sample scan time of 32 scans (average), logging absorbance data as log (1/R) within a wavenumber range of 3594.87 to 12,489.48 cm−1. The average absorbance value was calculated and applied for model development. A gold plate scan was performed for every new scanning, and aluminum plates/handles were used to prevent possible NIR leakage. Figure 3 shows the raw spectra obtained by scanning the chip biomass and ground biomass using an FT−NIR spectrometer. The ground sample was randomly selected and its particle size distribution was assessed at the Scientific and Technological Research Equipment Center at Chulalongkorn University, Bangkok, Thailand, using the Mastersizer 3000 instrument (MAL1099267, Hydro MV). Figure 4 shows the representative particle size distribution of the ground biomass used in this research, ranging from 0.01 to 3080 µm.

2.3. Reference Analysis (Proximate Analysis)

TGA (TG 209 F3 Tarsus, Netzsch, Germany) with a microbalance sensitivity of 0.1 µg resolution was employed for direct combustion in ambient air conditions, i.e., O2 and N2 in 1:4 ratio, within a temperature range of 35°C to 700°C. A heat flow rate of 10°C/min was utilized to record the mass loss of ground biomass sample of 6 ± 0.5 mg with respect to time or temperature. Prior to thermal biomass degradation, all samples were held isothermally at 35°C for 10 minutes. N2 is used as a protective layer to create stable and inert environment [24]. Proteus 6.0.0 (NETZSCH software, Germany) was used to generate TG and DTG curves. The TG and DTG curves were typical of the degradation process, which is generally divided into four significant stages: MC removal (<110 – Stage I), release and combustion of VM (197-350°C – Stage II), char combustion (360−600°C– Stage III), and ash lifting (>600°C – Stage IV) [21,25] (Figure 5). The representative TG and DTG of wood and non−wood biomass used in this study are shown in Figure 6a) and 6b), respectively.
Biomass is composed of cellulose, hemicellulose, and lignin [26]. As evident from Figure 6b, the combustion of lignocellulosic biomass exhibited two distinct major decomposition peaks. The first zone is within a temperature range of approximately 180 – 400°C, where the rate of weight loss of biomass is at its maximum and the peak is prominently displayed. This range corresponds to the decomposition of cellulose and hemicellulose. During this phase, volatiles are released and subsequently ignited, resulting in the formation of char [27]. The second zone spans the temperature range of approximately 380 to 600°C, where a broader peak was observed, associated with char oxidation [27]. In this stage, lignin assumes a key role as the primary contributor to biomass char formation. Beyond 535°C, the decomposition primarily relates to inorganic matter, specifically the ash content.
Due to reference test of TGA in our case (TG 209 F3 Tarsus, Netzsch, Germany) used a small sample of ~ 6 mg, the biomass was needed to be ground to get the mixed of every piece of chip sample scanned to obtain the proximate parameters. This grinding method might slightly effect the measurement constituent quantity in the sample, espectially, the moisture content. But it is the only way we have to compromize for estimation the MC in the chip sample.

2.4. Outlier Identification

To achieve a high−performance predictive model, the identification and removal of outliers from the complete dataset is one of the most crucial procedures in NIR modeling. In this study, outliers were identified using two methods: one based solely on reference data and the other utilizing both reference and spectroscopic data.
Outlier for the reference data is calculated using following equation:
X i X ¯ S D ± 3 ,
Where, Xi is the measured value of sample i, and X ¯ and SD denote the average and standard deviation of the measured values across all samples. If equation (1) is satisfied for any sample i, the sample is considered an outlier and is removed from the complete dataset [23,28].
Following the removal of outliers based on equation (1), the complete dataset was used to develop a model. If the model performance was deemed unsatisfactory, a further outlier identification process was undertaken using both reference and spectroscopic data. This method involved a full cross-validation using PLS regression on total dataset to identify outliers. A scatter plot was then created to compare the measured and predicted values. Outliers were carefully identified and subsequently removed if their behavior significantly deviated from the rest of the data points. This meticulous outlier detection process ensures the model robustness and accuracy by eliminating potential data anomalies that could impact the reliability of predictive results.

2.5. Spectral Preprocessing

Spectral preprocessing holds significant importance in NIR modeling to achieve high−performance predictions of the reference values using NIR spectral data. NIR raw spectral data often contains instrumental noise, baseline shift, over lapping peaks, scattering, and other errors [23]. Therefore, to enhance spectral features and extract the maximum relevant information, spectral preprocessing, which involves mathematical transformations, is vital prior to model development [29,30].
Prior to model development, raw spectra obtained from both chip and ground biomasses were subjected to ten different types of traditional spectral preprocessing, which include D1, D2, constant offset, SNV, MSC, min−max normalization, mean centering, vector normalization, D1 + MSC, and D1 + vector normlization. Additionally, the raw spectra were pretreated using MP techniques within a 5 range and 3 range to assess prediction performance within the same wavenumber range [23]. These preprocessing steps could potentially improve and or even reduce predictive accuracy [31] of proximate parameters for both chip and ground biomass.
The MATLAB−R2020b (MathWorks, Natick, MA, USA) with an in−house code was employed for conducting both traditional spectral preprocessing and multi spectral preprocessing techniques.

2.6. Partial Least Squares Regression Model Development

After the identification and removal of outliers from the complete dataset, as shown in Figure 1, PLSR models were developed by manually splitting the total dataset which was run in dessending order ra into a calibration set (80%) and a validation set (20%), ensuring that both sets include all varieties of biomass that have been considered. The calibration set is used to establish a mathematical model that correlates the reference data and spectral data of biomass samples. It includes the maximum and minimum reference values of the identified proximate parameters, ensuring the model was adequately trained to handle the entire spectrum of data. On the other hand, the prediction set employs the developed model to predict the properties of an unknown sample.
The NIR spectra of each biomass sample contained multiple variables, resulting in collinearity issues, redundant information, and heightened computational complexity [32]. To tackle these challenges, uninformative wavelength variables, having either negligible or adverse impacts on model performance, need identification and elimination. In this study, two wavelength selection processes, GA and SPA, were employed to develop a PLSR model. The primary objective in choosing these wavelength selection methods is to improve predictive capacity, achieved higher model efficiency, and create simpler models that are more interpretable [33].
GA is an evolutionary optimization technique that mimics natural selection and genetic mechanisms to find optimal solutions, utilizing three key operators: selection, crossover, and mutation [34,35,36]. In feature wavelength selection, it begins by generating a population based on available features. Subsequently, the algorithm creates successive populations by selecting individuals based on their fitness level in the problem domain. These individuals undergo recombination through selection, crossover, and mutation across multiple generations, progressively eliminate redundant information from the chosen wavelengths [37]. The optimal member of the population becomes the selected feature.
SPA is a forward selection method that selects variables with minimal redundant information from the informative variable [33]. The variable selected by SPA have the greatest projection onto the orthogonal subspace of the previously selected variable [38]. In SPA, the initial phase involves projecting onto the columns of the spectral matrix, creating a candidate subset of variable with minimal collinearity. The subsequent step assesses these candidate subsets based on the RMSE values obtained from the validation set of the PLSR calibration. The final step involves removing uninformative variables through a variable elimination procedure that does not significantly compromise predictive ability [38,39,40].
In this study five different PLSR−based models were constructed: Full−PLSR, MP PLSR−5 range, MP PLSR−3 range, GA−PLSR, and SPA−PLSR. These models were developed using an in-house code in MATLAB−R2020b (MathWorks, Natick, MA, USA), and their performances were subsequently compared.
The models were evaluated based on R2C, RMSEC, R2P, RMSEP, RPD and bias. These parameters were calculated using the Equations (2) to (5)
R 2 c , R 2 p = 1 i = 1 N ( y i y ^ i ) 2 i = 1 N ( y i y ¯ ) 2 ,
R M S E C , R M S E P = i = 1 N ( y i y ^ i ) 2 N ,
R P D = S D R M S E P ,
B i a s = i = 1 n ( y i y ^ i ) n ,
where, y is the measured value, y ^ is the predicted value, i is subscript used to indicate the number of the sample, y ¯ is the mean of the measured value, N is the number of samples of repective set, SD is the standard deviation of the measured values of the validation set, and n is the number of samples in the validation set.
The optimum model is selected based on higher R2C, R2P, and RPD values, as well as lower RMSEC and RMSEP values. In the case of similar performance, the model with fewer LVs is selected to adopt it as an alternative method for rapid non-destructive evaluation of biomass proximate analysis parameters [23].
Williams et al. (2019) [41] and Zornoza et al. (2008) [42] guidelines have been followed for the interpretation of R2 and RPD values, respectively.

2.7. Limit of Quantification (LOQ)

The sensitivity of the selected high−performance PLSR model is a critical factor for reliably and accurately determining biomass proximate analysis parameters. To address this, the LOQ of the selected model is calculated, representing the minimum concentration of the analyte that can be detected and quantified with high sensitivity and reliability. There are different methods to calculate the LOQ, which is based on 1) visual evaluation, 2) signal−to−noise, and 3) the SD of the response to slope [43]. In this study, LOQ is calculated based on the SD of the response to slope method obtained from the calibration set. The LOQ is calculated using the following equation:
LOQ = 10   σ C S C

3. Result and Discussion

3.1. Spectral Preprocessing

Figure 7 displays the average pretreated spectrum obtained from the best−performing model for proximate analysis of chip or ground biomass. This model utilizes the following techniques: a) MC with the second derivative from chip biomass, b) VM with a MP combination set of 4, 3, 4, 6, and 0 from chip biomass, c) FC with a constant offset from chip biomass, and d) ash content with the first derivative from the ground biomass.

3.2. Reference Data

The proximate analysis data for ten different varieties of biomass, including four wood varieties of fast−growing trees and six non-wood varieties (five agricultural residues and one fast−growing tree), were obtained by analyzing the TG and DTG profiles obtained from TGA, as explained in Section 2.3. Figure 8 displays the comprehensive results of proximate analysis for biomass, encompassing MC, VM, FC, and ash content, all of which were obtained from TGA.
Under the TGA, as explained in section 2.3, the average MC, VM, FC, and ash content in the biomass samples were recorded as 6.47%, 58.99%, 31.63%, and 2.91%, respectively, and 6.55%, 60.57%, 27.17%, and 5.72%, respectively. Here, FC is calculated as:
FC (%) = 100 – MC (%) – VM (%) – ash (%),
The analysis revealed notable variations in MC, VM, FC, and ash content among the studied biomass varieties. The maximum MC of 8.61% and VM of 65.64% were recorded for Zea mays (shell), whereas the minimum MC of 3.19% was found in Saccharum officinarum, and the minimum VM of 51.96% was observed in Oryza sativa. Additionally, Pinus roxburghii exhibited the highest FC content at 35.11%, while Zea mays (shell) displayed the lowest FC content of 22.21%. Similarly, Oryza sativa had the maximum ash content of 13.72%, and Pinus roxburghii had the lowest ash content at 1.82%.
The MC, VM, FC, and ash content in fast−growing trees range from 3.50% to 10.39%, 48.31% to 69.74%, 18.95% to 42.05%, and 0.32% to 7.90%, respectively. For agricultural residues, the respective ranges are 2.21−12.21% (MC), 44.62−75.10% (VM), 10.40−35.83% (FC), and 0.88−17.59% (ash content). Interestingly, there seems to be an overlap in the ranges. This was confirmed by the data in Table 7 and scatter plots of the developed models shown in Figure 9 which indicated that the proximate parameter spans were wider when the wood and non−wood biomass of fast-growing trees and agricultural residue combined for modeling.
The observed variations in MC among the biomass varieties are of significance. Despite all MC values falling below the 10%, the notably higher MC in Zea mays (shell) highlights the necessity for effective drying procedures prior to utilizing it for energy purposes. Achieving an optimal MC could enhance the combustion efficiency of this biomass [44].
The diverse levels of VM content are indicative of differing combustion behaviors. Biomasses with higher VM content tend to ignite more readily, making them favorable for certain combustion-based applications [10]. The presence of both high and low VM biomasses, such as Zea mays (shell) and Oryza sativa respectively, provides opportunities for specific use in different energy production processes.
The FC content is a crucial factor influencing combustion efficiency. Biomasses with higher FC content generally possess better energy density and combustion characteristics [45]. In light of this, Pinus roxburghii contained notably higher FC content positions it as a promising biomass for energy production, particularly in scenarios where concerns related to ash generation and deposition need to be minimized.
Furthermore, ash content is a critical consideration in combustion processes. Lower ash content, such as in Pinus roxburghii, can alleviate operational challenges associated with ash accumulation. Conversely, Oryza sativa had higher ash content might require careful management or additional treatment to mitigate potential issues during combustion.
This section may be divided by subheadings. It should provide a concise and precise description of the experimental results, their interpretation, as well as the experimental conclusions that can be drawn.

3.2. Prediction of Proximate Analysis Data

Table 1 shows the statistical summary data for the proximate parameters: MC, VM, FC, and ash. These parameters were employed in both the calibration and validation sets to develop a PLSR based models for both chipped and ground biomass, using the full wavenumber range 3594.87 to 12,489.48 cm−1 to assess proximate analysis parameters. Table 2 shows the details of outliers identified in chip and ground biomass, which were subsequently removed from the total dataset for PLSR model development. Table 3 shows PLSR modeling results for chipped biomass, and Table 4 shows the results for ground biomass, bolded model showing the best performance in each parameters.

3.2.1. Moisture Content (%)

Table 2 shows the count of outliers identified for MC in chip (based on reference data) and ground biomass (based on both reference and spectroscopic data). After the removal of these outliers, PLSR based models were developed to evaluate the percentage of MC in chip and ground biomass. The model performance was assessed across various preprocessing techniques, and the best-performing model, selected as the optimal choice among five different types of PLSR−based models, was highlighted in bold in both Table 3 and Table 4.
In Figure 9a), the scatter plot displays the measured and predicted MC (%) values for wood and non−wood samples in both the calibration and validation sets, using GA−PLSR. The best performance was achieved with GA−PLSR utilizing 10 LVs, D2 spectral pretreatment, and 57 important wavenumbers (Table 3). A notable 2.82% improvement in explained variance from Full−PLRS (R2P = 0.8417) to GA−PLSR (R2P = 0.8654) was observed. This improvement can be attributed to GA−PLSR efficient feature selection and model parameter optimizations, resulting in a more accurate and precise predictive model. The increased explained variance indicates a stronger correlation between the spectral data (independent variable) and the reference data (dependent variable), ultimately enhancing the predictive capabilities of the model.
Similarly, Figure 9e) displays a scatter plot depicting the measured and predicted MC (%) of ground biomass using mean centering. The Full-PLSR model, employing 12 LVs and mean centering as spectral preprocessing, exhibited the best performance (Table 4). An enhancementof 19.72% in explained variance was observed, transitioning from Full-PLSR using raw spectra (R2P=0.6752) to Full-PLSR employing mean centering (R2P=0.7868). Furthermore, the use of mean centering indicates that the model accuracy can be improved by removing baseline shifts, aligning spectral data around a common central point, and mitigating systematic variations caused by instrumental effects such as intensity fluctuations.
Figure 10 displays the average absorbance spectra after undergoing D2 pretreatment, with 57 significant wavenumbers highlighted in red to emphasize their importance as identified through GA. The highest peaks were observed at 3722, 4525, 5000, and 5285 cm−1, all of which could have the potential to enhance the model performance derived from GA−PLSR for MC (%).
Similarly, Figure 11 illustrates the regression coefficients for the MC (%) of ground biomass using Full−PLSR with spectra preprocessed through mean centering. Significant peaks were identified at 3650, 4902, 7042, and 8163 cm−1, which could have a substantial impact on enhancing the model performance.
Table 5 [46] tabulates the bond vibration and corresponding functional group, spectra−structure, and material type which were found in the featured wavelength plot and regression coefficient plots by near infrared spectroscopy analysis of chip and ground biomass for proximate analysis parameters which derived from the best performing models in this study. Figure 10, Figure 12, Figure 14 and Figure 16 for chip models and Figure 11, Figure 13, Figure 15 and Figure 17 for ground biomass models for MC, VM, FC and ash, respectively.

3.2.2. Volatile Matter (%)

For VM, no outliers were identified in either chip or ground biomass (Table 3). Therefore, models were developed as explained in section 2.6. For both chip and ground biomass, the PLSR model developed using the spectral MP 5−range methods yielded the highest performance.
For chip biomass, the best result was obtained with a spectral preprocessing combination set of 4, 4, 3, 2, 0, i.e., D1, D1, MSC, SNV, and zero, respectively, from the range 3625.72−12,489.48 cm−1, which is equally divided into five sections. Figure 9 b) displays the scatter plot of measured versus predicted percentages of VM in wood and non−wood samples for the calibration and validation sets. When compared with the best model performance (Table 3) obtained from Full−PLSR using a constant offset (R2P of 0.7725), the MP 5 range−PLSR method resulted in a 2.7443% improvement in explained variance.
Figure 12 shows the regression coefficient plot for the % of VM in chip biomass, derived from the MP PLSR 5−range method. The important peaks were observed at the wavenumbers 4019, 5495, 5865, and 6944 cm−1 (Table 5), which could significantly influence the improved performance of the selected model.
The model achieved its best performance for VM (%) in the ground biomass with a spectral preprocessing combination set of 3, 5, 4, 5, and 0, i.e., MSC, D2, D1, D2, and zero absorbance − from the range 3625.72−12,489.48 cm−1, equally divided into five sections. This MP PLSR 5−range resulted in an R2P of 0.7733 (Table 4), a signifying a notable improvement of 2.3290% compared to Full−PLSR (R2P = 0.7557).
As shown in Figure 13, significant peaks were observed at wavenumbers 3693, 4505, 5200, 5735, and 5952 cm−1 (Table 5), all of which could exert a notable influence on the enhanced performance of the model using the MP PLSR−5 range method. Within the wavenumber range between 10,722.9−12,489.48 cm−1, the regression coefficient was absorbed zero. This suggest that this range lack relevant spectral information for predicting VM in ground biomass.

3.2.3. Fixed Carbon (%)

Table 2 presents the outliers counts for FC in chip and ground biomass. No outliers were detected in the chip biomass dataset. However, using the reference and spectro-scopic data, 22 ground biomass samples were identified as outliers. These identified outliers were subsequently removed from the total ground biomass dataset, after which PLSR models were developed following the procedures outlined in section 2.6.
The model developed using Full−PLSR, constant offset spectral preprocessing, and 15 LVs yielded better results for FC (%) in chip biomass (Table 3). These results also indicate that model accuracy can be improved by correcting the baseline shift in the raw spectra using a constant offset. Figure 9c displays the scatter plot of measured versus predicted % of FC for non−wood and wood samples in the calibration and validation set, obtained from Full−PLSR with constant offset preprocessing.
Figure 14 displays the regression coefficient plot for the % of FC in chip biomass using Full−PLSR with spectral preprocessing of constant offset. Significant peaks were observed at 3722, 4405, 5952, and 8754 cm−1 (Table 5), which are likely to con-tribute significantly to the enhancement of model performance.
The Full−PLSR model with spectral preprocessing using the D1 provides improved model performance for % of FC in the ground biomass (Table 4). These results indicate that the reduction in linear shift and enhancement of the peaks in the raw spectra achieved by applying the D1 with a gap and segment of 5 each contribute to the improvement in model accuracy.
Figure 15 presents the regression coefficient plot for the % of FC in ground biomass using Full−PLSR with D1 spectral preprocessing. Significant peaks were observed at 3733, 4099, 5903, and 11655 cm−1 (Table 5), each of which could potentially exert an influence on the model improved performance.

3.2.4. Ash Content (%)

Table 2 presents the outliers counts for ash content in chip and ground biomass, based on reference and spectroscopic data. These identified outliers were removed from the total dataset, and various PLSR−based models were developed as explained in section 2.6. For chip biomass, the best−performing model was developed using the MP PLSR−3 range methods with multi-preprocessing combination set of 5, 4, 4. For ground biomass, the Full−PLSR model with spectral preprocessing from D1 provided the best performance results.
Figure 9d shows a scatter plot comparing the measured and predicted % ash content of non−wood and wood samples within the calibration and validation datasets. These results were obtained using the MP PLSR−3 range method. The most favorable outcomes were achieved through a spectral preprocessing combination set, consisting of the D2 from 3594.87 to 5492.59 cm−1, the D1 from 7498.31 to 5500.30 cm−1, and the D1 from 7506.02 to 12,489.48 cm−1, respectively (Table 3). Despite the MP PLSR−3 range method enhancing the explained variance by 4.6077%, in comparison with Full−PLSR with R2P of 0.7596.
Figure 16 depicts the regression coefficient plot for the % ash content in chip biomass using the multi−preprocessing PLSR 3−range method, incorporating a combination set of 5, 4, and 4. The plot reveals significant peaks at 4019, 5285, 11,655, and 12,048 cm−1 (Table 5). These peaks are expected to have a substantial impact on achieving the highest-performing model for assessing the % ash content in chip biomass.
The best performance for % ash content in the ground biomass was achieved by using Full−PLSR with spectral preprocessing involving the D1, along with 13 LVs (Table 4). The results also indicate that removing baseline shift, resolving overlapping peaks, and highlighting detailed structure in raw spectra will contribute to improving the model accuracy. In Figure 9h), a scatter plot illustrates the relationship between measured and predicted ash content percentages of non−wood and wood samples in both the calibration and validation sets, demonstrating the model high performance.
Figure 17 displays the regression coefficients for the % of ash content in the ground biomass using Full−PLSR with a spectral preprocessing technique involving the D1 (segment=5 and gap=5). The crucial peaks that may significantly impact the model performance are observed at wavenumbers 4505, 5285, 11,655, and 12,300 cm−1 (Table 5).

3.3. Limit of Quantification (LOQ)

In this study, the LOQ, defined as the lowest possible concentration of analyte that can be reliably detected and quantified with acceptable level of accuracy and precision, is determined using the SD of the response and the slope obtained from the calibration set of the selected model [28]. Table 6 presents the LOQ results of the PLSR models for the percentages of MC, VM, FC, and ash content in both chip and ground biomass.
The LOQ for VM (%) in both chip and ground biomass, as well as for FC (%) in chip biomass, is below the minimum reference value used in the modeling. This suggests that the selected model for VM and FC in chip biomass has the potential to reliably detect and precisely quantify these parameters. This indicates its high sensitivity. For instance, the chosen MP method (PLSR−5 range) can reliably detect VM in both chip and ground biomass, as indicated by the model LOQ values i.e., 23.4294 % and 29.1656%, respectively. Similarly, using Full−PLSR with constant offset preprocessing, FC in chip biomass can be reliably detected with an LOQ value of 6.9125%.
However, for other parameters such as MC, ash, and FC in ground biomass, the LOQ values exceed the minimum reference value used in model development. This indicates that the selected models have limitations in assessing these parameters for only not more than the minimum value of the model development. From Table 6, the slope of every model was same but the SD of MC, ash and FC were lower than VM, espectially, for MC, the SD was very low leading to the high value of LOQ. Though, the MC and FC were the constituents which were the good absorber of NIR radiation but in case of LOQ calculation, it was not concerned to absorption of radiation but the slope of the model and the SD of the data of the calibration set constituents must be high (wide range). Additionally, considering alternative modeling methods for predicting proximate parameters, especially when their LOQ values surpass the minimum reference value used in modeling, can enhance sensitivity, as illustrated by the NIR model LOQ.

3.4. Effect of Combined Wood Samples with Non-Wood Samples in Developed Models

Table 7 presents the range of reference values for the % of MC, VM, FC, and ash content found in both wood and non−wood biomass samples within the calibration set and validation set. Based on Figure 9 and Table 7, the non−wood samples are characterized by a wider range in the MC, VM, FC, and ash content in both calibration and prediction sets. It becomes apparent that the range of each proximate parameter, whether in chip and ground form, widens after the inclusion of non−wood samples alongside the wood samples. This expansion aims to establish a more resilient PLSR model for predicting proximate parameters.
In Figure 9a,b, the range of MC (%) and VM (%) in wood chip biomass samples was narrower than that of non−wood samples, with the latter extending to both higher and lower percentages. Figure 9c illustrates that the range of FC values for wood samples was higher and narrower compared to that of the non−wood samples. The inclusion of non−wood samples expanded the range, mostly towards the lower percentage values. In contrast, Figure 9d, depicting ash content in chip biomass, reveals a lower and narrower range for both wood and non−wood samples compare to other proximate parameters. Consequently, the performance of the optimal model was comparatively lower for ash content in comparison to MC, VM, and FC.
Similarly, in ground biomass, the range of non−wood samples is higher compared to wood samples, except for FC. Similar to chip biomass, the range of MC (%) (Figure 9e) and VM (%) (Figure 9f) values for wood samples in ground biomass is narrower compared to that of the non−wood samples. Therefore, the inclusion of non−wood samples contributes to expanding the range to both higher and lower percentages. In Figure 9g, the percentage range of FC in wood samples is higher and wider compared to that of the non−wood samples. The inclusion of both wood and non−wood samples with a wider range has contributed to a better performance of the model compared to other parameters. From Figure 9h, the ash content (%) range in wood samples appears lower and narrower. However, the inclusion of non−wood samples towards higher % has notably expanded the ash content range, ultimately enhancing the model performance.

3.5. Comparison of the Model Performance Between Chip and Ground Biomass

In this section, the performance of the best models for analyzing proximate parameters in chip and ground biomass were compared, as detailed in Table 3 and Table 4. The primary difference in the NIR scanning process was that chip biomass was scanned using the diffuse reflectance sphere macro sample rotating mode in a large amount, while ground biomass was scanned using the transflectance mode in a small amount. All other scanning conditions, including resolution, background scan time, sample scan time, and controlled laboratory settings, remained consistent, particularly in terms of temperature. The results revealed contrasting outcomes for chip and ground biomass, with chip biomass outperforming in all proximate parameters except for ash content. This observation is particularly significant when considering that ground biomass samples are typically more homogeneous. One possible explanation for this difference may be attributed to the scanning mode employed for biomass analysis, specifically the use of diffuse reflectance sphere macro sample rotating mode. Chip biomass samples inherently vary in particle size and shape, and introducing the sphere macro sample rotating mode during diffuse reflectance scanning allows the samples to rotate in a larger cup with more constituent variation, facilitating high-throughput screening and aiding in averaging any inhomogeneities, thereby providing a more representative measurement of the entire sample. Consequently, this enhanced scanning technique contributes to the improved performance observed in chip biomass analysis. During transflectance scanning, although light diffuses better within homogeneous materials, the sample size is too small and the constituent variation is minimal. Additionally, there may be a possibility of small NIR leakage via the glass vial edge, which might not provide useful information, resulting in comparatively lower performance compared to chip biomass analysis. For model of ash content, the chip model was poorer than the ground biomass model. The ash determination method by TGA in this experiment was by combustion up to 700 °C which is above 550 °C and therefore there is a risk in loosing some of the potassium and hence underestimate the ash content while in chip there was full content of these mineral.

3.6. Comparison with Previous Work

Several research studies have utilized NIR spectroscopy for proximate analysis of various biomass types. Typically, these studies have focused on specific biomass varieties. However, this study differs by developing models that encompass ten commonly used wood and non−wood biomass for energy purposes. The goal is to create a unified model for non−destructive assessment of proximate parameters. Unfortunately, the distribution of constituent parameters within both wood and non−wood is linear, resulting in a broad range for each parameter without clear separation into distinct groups. This study marks the first instance of combining these two types of data for estimation of proximate parameter analysis.
For MC analysis, Toscano et al. (2022) [18] conducted an evaluation of MC in industrial wood chips, analyzing a total of 817 samples. They reported R2P values ranging from 0.86 to 0.89. Similarly, Posom et al. (2022) [17] employed PLS regression to measure MC in cane bagasse, achieving an R2P value of 0.90. Sirisomboon et al. (2020) [19] assessed MC in bamboo chips, obtaining an R2P value of 0.96, while Adnan et al. (2017) [22] predicted the MC in intact green coffee beans, resulting in an R2P value of 0.96. Shrestha et al. (2018) [20] evaluated MC in bamboo chips using NIR−gun (600−1100 nm) and reported an R2P value of 0.92, whereas they achieved an R2P value of 0.74 using Micro−NIR (1150−215 nm). Similarly, Xue et al. (2015) [47] measured MC in corn stover with an R2C value of 0.81. In this study, the model performance is comparatively lower for both chip (R2P =0.8654) and ground (R2P =0.7868) biomass compared to the previous research mentioned above.
Similarly, for VM analysis for non−wood sample, previous research recorded R2P values of 0.81 for chip bamboo [19], 0.82 for ground bamboo [21], and 0.68 for corn stover [47]. In this study, the R2P for VM in chip biomass (0.79) and ground biomass (0.77) are both lower when compared to the chip bamboo [19] and ground bamboo [21] respectively. However, when compared to the performance of corn stover, both the chip and ground combined wood and non−wood models exhibited better performance.
For FC, previous research recorded R2P values of 0.81 for chip bamboo [19], 0.66 for corn stover [47], and 0.85 for ground bamboo [21]. The model performance for FC in this study on chip biomass (R2P = 0.88) is better compared to the previous study.
For ash, being inorganic, it does not absorb NIR radiation [19,21]. However, the model developed for ash content in wood and non−wood chip (R2P of 0.7946) and ground biomass of wood and non−wood (R2P of 0.8005) in this study proves to be a reliable predictor. Previous research of non−wood model has reported R2P values of 0.86 for chip bamboo [19], 0.85 for corn stover [47], and 0.51 for ground bamboo [21]. Compared to the previous study, the performance of chip combine wood and non−wood is lower compared to chip bamboo [19] and corn stover [47]. However, the performance of ground combine model is comparatively higher than that of ground bamboo [21].
As recommended by Williams et al. (2019) [41], a model achieving an R2 of 0.83−0.90 can be cautiously utilized for any applications, including research and from 0.66 to 0.81 is considered suitable for rough screening. Moreover, based on Zornoza et al. (2008) [42], an RPD value between 2 and 2.5 enables approximate quantitative predictions and 2.5 and 3 are indicative of good prediction. Therefore, the model selected in this research for non-destructive MC (R2P of 0.8654 and RPD of 2.9) and FC (R2P of 0.8773 and RPD of 3) assessment in chip biomass can be cautiously employed for various applications, including research while the other models i.e., VM and ash models from chip and MC, VM, FC and ash model from ground biomass can be applied for screening purpose only (R2P of 0.7733−0.8005 and RPD was 2.2−2.3). So it is advisable to employ the chosen model derived from chip wood and non−wood when evaluating proximate parameters for practical applications, particularly MC and FC.

4. Conclusions

In this study, proximate parameters (MC, VM, FC, and ash content) were determined using a TGA. PLSR models were developed and compared utilizing NIR spectroscopy within a wavenumber range of 3594.87 to 12,489.48 cm−1 to assess these proximate analysis parameters, both in chip and ground biomass. These models were constructed from raw spectra, traditional preprocessing approaches, GA, SPA, and spectral MP with 5 range and 3 range methods.
The findings of this study underscore the potential of NIR spectroscopy as a promising alternative tool for the quantitative prediction of MC and FC in chip biomass. The selected model for MC and FC in chip biomass is suitable for most applications, including research, but should be used with caution. As for the remaining proximate parameters, i.e., MC and FC in ground biomass, and VM and ash content in both chip and ground biomass, the models exhibit fair performance and are primarily suitable for rough screening purposes. The models perform more accurately in chip biomass, highlighting the efficacy of diffuse reflectance sphere macro sample rotating mode compared to the transflectance mode of scanning in ground biomass. In addition, the chip biomass model can be preferred for proximate analysis instead of the ground biomass model, as it eliminates the need for biomass grounding, saving time, labor, and costs prior to scanning.
The model developed in this study proved to be more robust than those in previous studies. This is attributed to the wider variation in different types of biomass, including fast−growing trees and agricultural residues, whereas previous studies were focused on specific biomass species. The accuracy, robustness, and sensitivity of these models can be further enhanced by expanding the dataset with samples from diverse sources and validating their performance with unknown samples before industrial application. Implementing the selected models will benefit researchers, engineers, and industries aiming to design thermochemical conversion systems that can select suitable biomass and efficiently extract maximum energy from it, all while minimizing energy consumption and environmental impact.

Author Contributions

Conceptualization, B.S., J.P., P.S. and B.P.S; Methodology, B.S., J.P., P.S. and B.P.S; Software, B.S. and J.P.; Formal analysis, B.S. and J.P.; Investigation, B.S.; Resources, B.S. and B.P.S.; Data curation, B.S., J.P. and P.S.; Writing the original draft, B.S.; Writing–review & editing, B.S., J.P., P.S., B.P.S and A.F; Visualization, B.S.; Supervision, J.P., P.S. B.P.S and A.F; Project administration, P.S.; Funding acquisition, P.S.; Validation, B.P.S and A.F. All authors have read and agreed to the published version of the manuscript.

Acknowledgments

The authors would like to express their sincere gratitude to the Near-Infrared Spectroscopy Research Center for Agricultural Product and Food, Department of Agricultural Engineering, School of Engineering at King Mongkut’s Institute of Technology Ladkrabang, Bangkok, Thailand, for their generous research funding support provided through the KMITL doctoral scholarship (KDS 2020/052).

Conflicts of Interest

The authors declare no conflicts of interest.

Abbreviations

C = carbon O = oxygen
D1 = first derivative PLSR = partial least squares regression
D2 = second derivative R2 = coefficient of determination
DTG = derivative thermogravimetric R2C = coefficient of determination of calibration set
FC = fixed carbon R2P = coefficient of determination of prediction set
FT = Fourier transform RPD = ratio of prediction to deviation
GA = genetic algorithm RMSE = root mean square error
H = hydrogen S = sulfur
LVs = number of latent variables SC = slope of the regression line
LOQ = limit of quantification SD = standard deviation
Max = maximum SEC = standard error of calibration set
MC = moisture content SEL = standard error of laboratory
Min = minimum SEP = standard error of prediction set
Mean = average SNV = standard normal variate
MSC = multiplicative scatter correction SPA = successive projection algorithm
MP = multi-preprocessing TG = thermogravimetric
N = nitrogen VM = volatile matter
NT = total number of sample σC = standard deviation of residual of calibration set
Nc = number of sample in calibration set
Np = number of sample in prediction set
NIRS = near infrared spectroscopy

References

  1. Zhang, Y.; Wang, H.; Sun, X.; Wang, Y.; Liu, Z. Separation and Characterization of Biomass Components (Cellulose, Hemicellulose, and Lignin) from Corn Stalk. BioResources 2021, 16(4), 7205–7219. [Google Scholar] [CrossRef]
  2. Nimmanterdwong, P.; Chalermsinsuwan, B.; Piumsomboon, P. Prediction of lignocellulosic biomass structural components from ultimate/proximate analysis. Energy 2021, 222, 119945. [Google Scholar] [CrossRef]
  3. Saldarriaga, J.F.; Aguado, R.; Pablos, A.; Amutio, M.; Olazar, M.; Bilbao, J. Fast characterization of biomass fuels by thermogravimetric analysis (TGA). Fuel 2015, 140, 744–751. [Google Scholar] [CrossRef]
  4. Teh, J.S.; Teoh, Y.H.; How, H.G.; Sher, F. Thermal Analysis Technologies for Biomass Feedstocks: A State-of-the-Art Review. Processes 2021, 9, 1610. [Google Scholar] [CrossRef]
  5. Demirbas, A. Combustion characteristics of different biomass fuels. Prog. Energy Combust. Sci 2004, 30(2), 219–230. [Google Scholar] [CrossRef]
  6. Aich, S.; Behera, D.; Nandi, B.K.; Bhattacharya, S. Relationship between proximate analysis parameters and combustion behaviour of high ash Indian coal. Int. J. Coal Sci 2020, 7, 766–777. [Google Scholar] [CrossRef]
  7. Demirbas, A. Effects of moisture and hydrogen content on the heating value of fuels. ENERG SOURCE PART A 2007, 29(7), 649–655. [Google Scholar] [CrossRef]
  8. Garcíaม, R.; Pizarro, C.; Lavín, A.G.; Bueno, J.L. Biomass proximate analysis using thermogravimetry. Bioresour. Technol. 2013, 139, 1–4. [Google Scholar] [CrossRef]
  9. Kongto, P.; Palamanit, A.; Ninduangdee, P.; Singh, Y.; Chanakaewsomboon, I.; Hayat, A.; Wae-hayee, M. Intensive exploration of the fuel characteristics of biomass and biochar from oil palm trunk and oil palm fronds for supporting increasing demand of solid biofuels in Thailand. Energy Rep 2022, 8, 5640–5652. [Google Scholar] [CrossRef]
  10. Jia, Y.; Li, Z.; Wang, Y.; Wang, X.; Lou, C.; Xiao, B.; Lim, M. Visualization of Combustion Phases of Biomass Particles: Effects of Fuel Properties. ACS omega 2021, 6(42), 27702–27710. [Google Scholar] [CrossRef]
  11. Vassilev, S.V.; Vassileva, C.G.; Song, Y.-C.; Li, W.-Y.; Feng, J. Ash contents and ash-forming elements of biomass and their significance for solid biofuel combustion. Fuel 2017, 208, 377–409. [Google Scholar] [CrossRef]
  12. Pazó, J.A.; Granada, E.; Saavedra, A.; Eguía, P.; Collazo, J. Biomass thermogravimetric analysis: Uncertainty determination methodology and sampling maps generation. Int. J. Mol. Sci. 2010, 11(7), 2701–2714. [Google Scholar] [CrossRef] [PubMed]
  13. Özyuğuran, A.; Yaman, S. Prediction of calorific value of biomass from proximate analysis. Energy Procedia 2017, 107, 130–136. [Google Scholar] [CrossRef]
  14. Sahito, A.R.; Mahar, R.; Siddiqui, Z.; Brohi, K. Estimating calorific values of lignocellulosic biomass from volatile and fixed solids. Int. J. Renew. Energy Res. 2013, 2(1), 1–6. [Google Scholar] [CrossRef]
  15. Posom, J.; Maraphum, K.; Phuphaphud, A. Rapid Evaluation of Biomass Properties Used for Energy Purposes Using Near-Infrared Spectroscopy. Renewable Energy-Technologies and Applications 2020, IntechOpen.
  16. Torquato, L.D.M.; Crnkovic, P.M.; Ribeiro, C.A.; Crespi, M.S. New approach for proximate analysis by thermogravimetry using CO 2 atmosphere: Validation and application to different biomasses. J. Therm. Anal. Calorim 2017, 128, 1–14. [Google Scholar] [CrossRef]
  17. Posom, J.; Phuphaphud, A.; Saengprachatanarug, K.; Maraphum, K.; Saijan, S.; Pongkan, K.; Srimai, K. Real-time measuring energy characteristics of cane bagasse using NIR spectroscopy. Sens. Bio-Sens. Res. 2022, 38, 100519. [Google Scholar] [CrossRef]
  18. Toscano, G.; Leoni, E.; Gasperini, T.; Picchi, G. Performance of a portable NIR spectrometer for the determination of moisture content of industrial wood chips fuel. Fuel 2022, 320, 123948. [Google Scholar] [CrossRef]
  19. Sirisomboon, P.; Funke, A.; Posom, J. Improvement of proximate data and calorific value assessment of bamboo through near infrared wood chips acquisition. Renew. Energy 2020, 147, 1921–1931. [Google Scholar] [CrossRef]
  20. Shrestha, A.; Sirisomboon, P. Rapid evaluation of moisture content in bamboo chips using diode array near infrared spectroscopy. MATEC Web of Conferences - ICEAST 2018 2018.
  21. Posom, J.; Sirisomboon, P. Evaluation of the higher heating value, volatile matter, fixed carbon and ash content of ground bamboo using near infrared spectroscopy. J. Near Infrared Spectrosc. 2017, 25(5), 301–310. [Google Scholar] [CrossRef]
  22. Adnan, A.; Hörsten, D.V.; Pawelzik, E.; Mörlein, A.D. Rapid Prediction of Moisture Content in Intact Green Coffee Beans Using Near Infrared Spectroscopy. Foods 2017, 6(5), 38. [Google Scholar] [CrossRef]
  23. Shrestha, B.; Posom, J.; Sirisomboon, P.; Shrestha, B.P. Comprehensive Assessment of Biomass Properties for Energy Usage Using Near-Infrared Spectroscopy and Spectral Multi-Preprocessing Techniques. Energies 2023, 16, 5351. [Google Scholar] [CrossRef]
  24. Stolov, A.A.; Simoff, D.A.; Li, J. Thermal stability of specialty optical fibers. J. Light. Technol. 2008, 26(20), 3443–3451. [Google Scholar] [CrossRef]
  25. Shrestha, A.; Saechua, W.; Sirisomboon, P. Some physical and combustion characteristic of Leucaena Leucocephala pellet. In Proceedings of the 16th TSAE national conference and 8th TSAE international conference, agricultural and food engineering outlook on sustainable future, Bangkok international trade & exhibition centre, Bangkok, Thailand, 17-19 March 2015. [Google Scholar]
  26. Díez, D.; Urueña, A.; Piñero, R.; Barrio, A.; Tamminen, T. Determination of Hemicellulose, Cellulose, and Lignin Content in Different Types of Biomasses by Thermogravimetric Analysis and Pseudocomponent Kinetic Model (TGA-PKM Method). Processes 2020, 8, 1048. [Google Scholar] [CrossRef]
  27. López-González, D.; Fernandez-Lopez, M.; Valverde, J.L.; Sanchez-Silva, L. Thermogravimetric-mass spectrometric analysis on combustion of lignocellulosic biomass. Bioresour. Technol. 2013, 143, 562–574. [Google Scholar] [CrossRef] [PubMed]
  28. Shrestha, B.; Shrestha, Z.; Posom, J.; Sirisomboon, P.; Shrestha, B.P. Evaluating limit of detection and quantification for higher heating value and ultimate analysis of fast-growing trees and agricultural residues biomass using NIRS. EASR. 2023, 50(6), 612–618. [Google Scholar]
  29. Broad, N.; Graham, P.; Hailey, P.; Hardy, A. Guidelines for the development and validation of near-infrared spectroscopic methods in the pharmaceutical industry. Handbook of vibrational spectroscopy 2002, 5, 3590–3610. [Google Scholar]
  30. Roger, J.-M.; Mallet, A.; Marini, F. Preprocessing NIR Spectra for Aquaphotomics. Molecules 2022, 27(20), 6795. [Google Scholar] [CrossRef] [PubMed]
  31. Robert, G.; Gosselin, R. Evaluating the impact of NIR pre-processing methods via multiblock partial least-squares. Anal. Chim. Acta 2022, 1189, 339255. [Google Scholar] [CrossRef] [PubMed]
  32. Posom, J.; Phuphaphud, A.; Saengprachatanarug, K.; Maraphum, K.; Saijan, S.; Pongkan, K.; Srimai, K. Real-time measuring energy characteristics of cane bagasse using NIR spectroscopy. Sens. Bio-Sens. Res. 2022, 38, 100519. [Google Scholar]
  33. Jiang, W.; Lu, C.; Zhang, Y.; Ju, W.; Wang, J.; Hong, F.; Wang, T.; Ou, C. Moving-window-improved Monte Carlo uninformative variable elimination combining successive projections algorithm for near-infrared spectroscopy (NIRS). J. Spectrosc. 2022, 2020, 1–12. [Google Scholar] [CrossRef]
  34. Pitak, L.; Sirisomboon, P.; Saengprachatanarug, K.; Wongpichet, S.; Posom, J. Rapid elemental composition measurement of commercial pellets using line-scan hyperspectral imaging analysis. Energy 2021, 220, 119698. [Google Scholar] [CrossRef]
  35. Koljonen, J.; Nordling, T.E.; Alander, J.T. A review of genetic algorithms in near infrared spectroscopy and chemometrics: Past and future. J. Near Infrared Spectrosc. 2008, 16(3), 189–197. [Google Scholar] [CrossRef]
  36. Fei, Q.; Li, M.; Wang, B.; Huan, Y.; Feng, G.; Ren, Y. Analysis of cefalexin with NIR spectrometry coupled to artificial neural networks with modified genetic algorithm for wavelength selection. Chemometr. Intell. Lab. Syst. 2009, 97(2), 127–131. [Google Scholar] [CrossRef]
  37. Maraphum, K.; Ounkaew, A.; Kasemsiri, P.; Hiziroglu, S.; Posom, J. Wavelengths Selection Based on Genetic Algorithm (GA) and Successive Projections Algorithms (SPA) Combine With PLS Regression for Determination the Soluble Solids Content in Nam-DokMai Mangoes Based on Near Infrared Spectroscopy. Eng. Appl. Sci. Res. 2021, 49, 119–126. [Google Scholar]
  38. Araújo, M.C.U.; Saldanha, T.C.B.; Galvão, R.K.H.; Yoneyama, T.; Chame, H.C.; Visani, V. The successive projections algorithm for variable selection in spectroscopic multicomponent analysis. Chemometr. Intell. Lab. Syst. 2001, 57(2), 65–73. [Google Scholar] [CrossRef]
  39. Chen, Y.M.; Lin, P.; He, Y.; He, J.Q.; Zhang, J.; Li, X. L. Fast quantifying collision strength index of ethylene-vinyl acetate co-polymer coverings on the fields based on near infrared hyperspectral imaging techniques. Sci. Rep. 2016, 6(1), 20843. [Google Scholar] [CrossRef] [PubMed]
  40. Li, C.; He, M.; Cai, Z.; Qi, H.; Zhang, J.; Zhang, C. Hyperspectral Imaging with Machine Learning Approaches for Assessing Soluble Solids Content of Tribute Citru. Foods 2023, 12, 247. [Google Scholar] [CrossRef] [PubMed]
  41. Williams, P.; Manley, M.; Antoniszyn, J. In Near infrared technology: Getting the best out of light., African Sun Media, 2019.
  42. Zornoza, R.; Guerrero, C.; Mataix-Solera, J.; Scow, K.M.; Arcenegui, V.; Mataix-Beneyto, J. Near infrared spectroscopy for determination of various physical, chemical and biochemical properties in Mediterranean soils. Soil Biol. Biochem. 2008, 40(7), 1923–1930. [Google Scholar] [CrossRef] [PubMed]
  43. Guideline, I.H.T. Validation of analytical procedures: Text and methodology. Q2 (R1) 2005, 1(20), 05. [Google Scholar]
  44. Werther, J.; Saenger, M.; Hartge, E.-U.; Ogada, T.; Siagi, Z. Combustion of agricultural residues. Prog. Energy Combust. Sci. 2000, 26(1), 1–27. [Google Scholar] [CrossRef]
  45. Yuan, Y.; Yong, H.; Tan, J.; Wang, Y.; Kumar, S.; Wang, Z. Co−Combustion characteristics of typical biomass and coal blends by thermogravimetric analysis. Front. Energy Res. 2021, 9, 753622. [Google Scholar] [CrossRef]
  46. Workman Jr, J.; Weyer L. In Practical guide to interpretive near-infrared spectroscopy, CRC press. 2007.
  47. Xue, J.; Yang, Z.; Han, L.; Liu, Y.; Liu, Y.; Zhou, C. On-line measurement of proximates and lignocellulose components of corn stover using NIRS. Appl. Energy 2015, 137, 18–25. [Google Scholar] [CrossRef]
Figure 1. Flowchart depicting the overall research methodology for the rapid assessment of biomass proximate analysis parameters using TGA, while using NIRS with PLSR is an alternative.
Figure 1. Flowchart depicting the overall research methodology for the rapid assessment of biomass proximate analysis parameters using TGA, while using NIRS with PLSR is an alternative.
Preprints 94143 g001
Figure 2. FT−NIR spectrometer scanning of (a) chip biomass in diffuse reflectance and sphere macro sample rotating mode using a quartz bottom sample cup, and (b) ground biomass in transflectance mode using a glass vial.
Figure 2. FT−NIR spectrometer scanning of (a) chip biomass in diffuse reflectance and sphere macro sample rotating mode using a quartz bottom sample cup, and (b) ground biomass in transflectance mode using a glass vial.
Preprints 94143 g002
Figure 3. Raw spectra of (a) chip biomass obtained from diffuse reflectance mode of FT−NIRS scanning and (b) ground biomass obtained from transflectance mode of FT−NIRS scanning across the entire wavenumber range of 3594.87 to 12,489.48 cm−1.
Figure 3. Raw spectra of (a) chip biomass obtained from diffuse reflectance mode of FT−NIRS scanning and (b) ground biomass obtained from transflectance mode of FT−NIRS scanning across the entire wavenumber range of 3594.87 to 12,489.48 cm−1.
Preprints 94143 g003
Figure 4. Representative particle size distribution of the ground biomass ranging from 0.01 to 3080 µm.
Figure 4. Representative particle size distribution of the ground biomass ranging from 0.01 to 3080 µm.
Preprints 94143 g004
Figure 5. The TG and DTG curves obtained from TGA analysis of Nepalese biomass.
Figure 5. The TG and DTG curves obtained from TGA analysis of Nepalese biomass.
Preprints 94143 g005
Figure 6. (a) TG and (b) DTG profiles of fast−growing tree and agricultural residue biomass obtained using TGA within the temperature range of 35 to 700 °C with a heat flow rate of 10°C/min.
Figure 6. (a) TG and (b) DTG profiles of fast−growing tree and agricultural residue biomass obtained using TGA within the temperature range of 35 to 700 °C with a heat flow rate of 10°C/min.
Preprints 94143 g006
Figure 7. The average pretreated spectra for: (a) MC with the second derivative, (b) VM with a multi-preprocessing combination set of 4, 4, 3, 2, and 0, (c) FC with a constant offset, and (d) ash content with the first derivative.
Figure 7. The average pretreated spectra for: (a) MC with the second derivative, (b) VM with a multi-preprocessing combination set of 4, 4, 3, 2, and 0, (c) FC with a constant offset, and (d) ash content with the first derivative.
Preprints 94143 g007
Figure 8. Overall proximate analysis results of ten different biomass varieties, including four wood varieties of fast−growing trees and six non−wood varieties (five agricultural residue and 1 fast−growing tree) obtained from TGA analysis.
Figure 8. Overall proximate analysis results of ten different biomass varieties, including four wood varieties of fast−growing trees and six non−wood varieties (five agricultural residue and 1 fast−growing tree) obtained from TGA analysis.
Preprints 94143 g008
Figure 9. Measured versus predicted value in calibration set and validation set for chip biomass (a) MC, (b) VM, (c) FC, and (d) Ash and for ground biomass (e) MC, (f) VM, (g) FC, and (h) Ash.
Figure 9. Measured versus predicted value in calibration set and validation set for chip biomass (a) MC, (b) VM, (c) FC, and (d) Ash and for ground biomass (e) MC, (f) VM, (g) FC, and (h) Ash.
Preprints 94143 g009
Figure 10. The average absorbance value of MC (%) of chip biomass obtained using second derivative preprocessing with a selection of important wavenumbers obtained from GA, within full wavenumber range of 3594.87–12,489.5 cm−1.
Figure 10. The average absorbance value of MC (%) of chip biomass obtained using second derivative preprocessing with a selection of important wavenumbers obtained from GA, within full wavenumber range of 3594.87–12,489.5 cm−1.
Preprints 94143 g010
Figure 11. The regression coefficient for the MC (%) of ground biomass using the Full−PLSR with spectral preprocessing of mean centering.
Figure 11. The regression coefficient for the MC (%) of ground biomass using the Full−PLSR with spectral preprocessing of mean centering.
Preprints 94143 g011
Figure 12. Regression coefficient for the % of VM in chip biomass using the MP PLSR 5−range method with a spectral multi-preprocessing combination set of 4, 4, 3, 2, 0.
Figure 12. Regression coefficient for the % of VM in chip biomass using the MP PLSR 5−range method with a spectral multi-preprocessing combination set of 4, 4, 3, 2, 0.
Preprints 94143 g012
Figure 13. Regression coefficient for the % of VM in ground biomass using the MP PLSR 5−range method with a spectral multi-preprocessing combination set of 3, 5, 4, 5, and 0.
Figure 13. Regression coefficient for the % of VM in ground biomass using the MP PLSR 5−range method with a spectral multi-preprocessing combination set of 3, 5, 4, 5, and 0.
Preprints 94143 g013
Figure 14. Regression coefficient for the % of FC in chip biomass using Full−PLSR method with a spectral preprocessing of constant offset.
Figure 14. Regression coefficient for the % of FC in chip biomass using Full−PLSR method with a spectral preprocessing of constant offset.
Preprints 94143 g014
Figure 15. Regression coefficient for the % of FC in ground biomass using Full−PLSR method with a spectral preprocessing of first derivative.
Figure 15. Regression coefficient for the % of FC in ground biomass using Full−PLSR method with a spectral preprocessing of first derivative.
Preprints 94143 g015
Figure 16. Regression coefficient for the % of ash content in chip biomass using the MP PLSR 3−range method with a spectral multi−preprocessing combination set of 5, 4, and 4.
Figure 16. Regression coefficient for the % of ash content in chip biomass using the MP PLSR 3−range method with a spectral multi−preprocessing combination set of 5, 4, and 4.
Preprints 94143 g016
Figure 17. Regression coefficient for the % of ash content in ground biomass using Full−PLSR method with a spectral preprocessing of D1.
Figure 17. Regression coefficient for the % of ash content in ground biomass using Full−PLSR method with a spectral preprocessing of D1.
Preprints 94143 g017
Table 1. Statistical data of the proximate analysis parameters of the chip and ground biomass used in the PLSR model development.
Table 1. Statistical data of the proximate analysis parameters of the chip and ground biomass used in the PLSR model development.
Preprints 94143 i001
Table 2. Table 2. Details on the number of identified outliers in chip and ground biomass evaluated before the development of the PLSR model.
Table 2. Table 2. Details on the number of identified outliers in chip and ground biomass evaluated before the development of the PLSR model.
Preprints 94143 i002
Table 3. Results of the PLS regression based model for the proximate analysis (%) of chip biomass, bolded model showing the best performance.
Table 3. Results of the PLS regression based model for the proximate analysis (%) of chip biomass, bolded model showing the best performance.
Preprints 94143 i003
Table 4. Results of the PLS regression based model for the proximate analysis (%) of ground biomass, bolded model showing the best performance.
Table 4. Results of the PLS regression based model for the proximate analysis (%) of ground biomass, bolded model showing the best performance.
Preprints 94143 i004
Table 5. The dominant peaks on the regression coefficient plot obtained from the best−performing PLSR−based model.
Table 5. The dominant peaks on the regression coefficient plot obtained from the best−performing PLSR−based model.
Preprints 94143 i005
Table 6. LOQ results of PLSR models for the % of MC, VM, FC, and ash content in chip and ground biomass.
Table 6. LOQ results of PLSR models for the % of MC, VM, FC, and ash content in chip and ground biomass.
Preprints 94143 i006
Table 7. The range of % of MC, VM, FC, and Ash content of wood and non−wood samples in calibration and validation sets.
Table 7. The range of % of MC, VM, FC, and Ash content of wood and non−wood samples in calibration and validation sets.
Preprints 94143 i007
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.
Copyright: This open access article is published under a Creative Commons CC BY 4.0 license, which permit the free download, distribution, and reuse, provided that the author and preprint are cited in any reuse.
Prerpints.org logo

Preprints.org is a free preprint server supported by MDPI in Basel, Switzerland.

Subscribe

© 2024 MDPI (Basel, Switzerland) unless otherwise stated