1. Introduction
Methane (CH
4), a gas with a global warming potential 25 times greater than carbon dioxide, has increased in atmospheric composition by a factor of three since preindustrial times largely due to changes in emissions from anthropogenic activities [
1,
2]. Nearly all countries globally have entered treaties or agreements to reduce methane emissions but it has recently been suggested that methods used to quantify methane emissions are insufficiently precise to measure emission reduction of mitigation strategies [
3,
4,
5,
6].
Measuring methane emissions from oil and gas production activities has been the focus of many studies since the 1990s [
7,
8,
9,
10,
11,
12]. Recently, approaches have developed from survey methods, where short-duration measurements are made by instruments mounted on a mobile platform [
13,
14,
15], to continuous monitoring approaches, where measurements are made by instrumentation fixed in location [
6,
16,
17,
18]. The advantage of continuous monitors is that observations are more likely to detect the short-duration, large-emission events that are typical of oil and gas emission distributions and are essential to capture if realistic emissions estimates are to be generated [
19,
20]. Typically, on-site continuous monitoring systems comprise of several (minimum of four) methane measurement/meteorological instrumentation installed at fixed points around the site coupled with a dispersion model to infer a rate of emission [
6,
21,
22]. One difficulty in designing a continuous monitoring system that is to be installed at the fence line of an oil and gas production site is the choice of methane measurement instrumentation. Methane sensing technologies include metal oxide sensors (
$15 per sensor), integrated infra-red sensors (
$300 per sensor), tunable diode laser absorption spectrometers (
$1000 per sensor) and optical cavity instrumentation (>
$10,000 per sensor). As there are thousands of productions sites across the US alone, costly (>
$1,000) methane measurement systems are prohibitively expensive to deploy at every site therefore metal oxide sensors are currently being widely deployed across oil and gas production sites across the US [
6,
22].
As metal oxide sensors are being widely distributed to measure near-background methane concentrations in air (typically below 100 ppm), many studies have described methods of calibration, deployment and response, however, most studies report that these sensors are sensitive to temperature and relative humidity and required individual calibration [
17,
23,
24,
25,
26,
27,
28,
29,
30]. Typical metal oxide sensor models used in these studies are the Taguchi Gas Sensors (TGS) 2600 and 2611 models produced by Figaro Engineering Inc. (Osaka, Japan). TGS2600 uses a tin-dioxide (SnO
2) sensing layer that reacts with the detected gases. The detection mechanism relies on the change in electrical conductivity through the SnO
2 in presence of methane which is a reducing gas. Absorbance of oxygen molecules occurs on the surface of SnO
2 forming O
2- ions leading to electron capture that creates a depletion layer at the surface of the SnO
2 increasing its resistance [
23,
24]. Methane interacts with the adsorbed oxygen ions releasing the trapped electrons back into the SnO
2 conduction band, decreasing its resistance. The change in resistance is proportional to the concentration of CH
4. The TGS2611 has the same operation basis as the TGS2600 but has been optimized for methane detection by the integration of a heating element to maintain optimal temperature for methane detection and features a filter material that selectively permits methane to reach the sensing element while blocking other gases [
23,
24].
Studies using Figaro TGS sensors have measured the resistance change across the metal oxide strip to infer changes in concentration. It is assumed the metal oxide sensor has a relatively constant resistance in clean air (
R0, Ω) which becomes lower in the presence of methane (
Rs, Ω) and the ratio of these resistances gives a measure of the methane mixing ratio in air. The resistance of the metal oxide sensor is also affected by the air temperature (
Ta, °C) and relative humidity (
rH, %) and the ratio of resistance can be corrected to account for these factors using an empirically derived algorithm (Equation (1)) [
25].
The temperature and humidity corrected ratio of
Rs and
R0 is then typically converted to a methane mixing ratio using a sensor-specific calibration algorithm [
25,
30]. As they perform badly in low relative humidity, less than 40% RH [
30], the TGS sensors are typically calibrated by comparing
(Rs/R0)corr to methane mixing ratios measured using a sub-ppm reference instrument measuring methane contemporaneously. An algorithm is then generated to translate
(Rs/R0)corr to a calibrated mixing ratio (
[CH4]cal). Published algorithms have been linear, power functions and exponential functions [
17,
23,
25,
26,
30,
31] and there is not a good current understanding of differences between individual sensors’ response to changing methane concentrations apart from possible differences in manufacturing [
17].
Nested within the conversion from measured data to a calibrated methane mixing ratio are several potential sources of uncertainty: 1. The size of the measured resistance (Rs) appears to be highly variable between individual sensors and through trial and error, we have found that the output of some of the sensors in near background mixing ratios (< 10 ppm)- the methane signal has become almost indistinguishable from the noise; 2. The calculation of [CH4]cal is heavily dependent on determining a value for the TGS resistance in clean air, R0; 3. Given the variability in response of sensors to changes in methane concentration, it seems questionable that all sensors would respond equally to changes in relative humidity and temperature (as defined in Equation (1)) especially in sensors with low signal to noise ratios. Overall, these uncertainties raise concerns about the historical methods for converting signals from TGS methane sensors to a calibrated methane mixing ratio.
One alternative approach could be to use a novel machine learning approach that accounts for low signal to noise ratios and differences in sensor response to relative humidity and temperature changes. Recent developments in machine learning algorithms have transformed sensor calibration and enhanced their reliability in environmental monitoring [
32]. One example of machine learning is the Random Forest (RF) Regressor developed by Leo Breiman in 2001. Random Forest is an ensemble machine learning model [35], which implies that the model is built by the combination of predictions from multiple simpler models [
33]. Random Forest uses a collection of decision trees which are the models that split the data into branches based on the input features to make a prediction. Training the RF regression model involves presenting the model with the input variables (metal oxide sensors’ resistances, temperature and relative humidity) that are passed through multiple decision trees and the output aggregation predict the target variable (in this case the trace gas analyzer CH
4 concentrations).
To investigate this, we will use two TGS sensors with low signal to noise ratio and report on the RF machine learning approach to generating calibrated methane mixing ratios. Specifically, we will 1. Collect Rs values from low-response TGS2600 and TGS2611 sensors contemporaneously with a sub-ppm methane analyzer; 2. Generate calibrations curves using (a) the traditional method (presented above) and (b) RF machine learning approach; and 3. compare the calibrated methane mixing ratios reported by the TGS2600 and TGS2611 to the methane analyzer to generate an understanding of sensor type response, bias and drift over time. Ultimately, we aim to provide evidence to understand if the RF machine learning approach can generate better calibration algorithms than currently used methods.
Author Contributions
EK: Conceptualization, Investigation, Methodology, Supervision, Writing – original draft preparation, review and editing. SNR: Funding Acquisition, Conceptualization, Investigation, Methodology, Supervision, Writing – original draft preparation, review and editing. MM: Investigation, Review and Editing. AU: Review and Editing. AA: Review and Editing. DJZ: Funding Acquisition, Project Administration, Conceptualization, Supervision, Review and editing.