Many methods have been proposed based on this principle. The methods differ primarily in the decomposition algorithm but also in the method used to denoise the decomposed signal. Therefore, a multitude of variants of this approach will be described below.
2.4.1. Decomposition methods after Fourier decomposition
In the 1940s, the American mathematician Nobert Wiener introduced a new filter called the Wiener filter [
46]. This filter consists of applying a Fourier transform to the signal. The obtained Fourier coefficients are then modified according to the ratio between the expected signal spectrum and the actual spectrum. According to this definition, conventionally, the desired signal is known. However, the desired EMG signal is generally unknown. Several variants of the Wiener filter have therefore been proposed to filter the EMG signal. Among other things, most of these variants consist of using an estimate of the noisy signal as a reference rather than the desired signal.
In 1998, [
17] presented an approach using the Fourier transform to estimate the power spectrum of the noisy signal from a reference signal obtained at the start of the trial. During this period, the electrode is placed on the skin, but the muscle is not contracted. We therefore obtain an estimate of the background noise. This solution makes it possible to consider several noise sources without having to identify them. Unlike time domain subtraction methods, the reduction of noisy waves at several different frequencies is therefore quite simple. To reduce the noise, the signal of interest is therefore also decomposed in order to obtain its power spectrum. Then, the coefficients of the noisy signal are subtracted from those of the measured signal. In this way, an estimate of the power spectrum of the non-noisy signal is obtained. The general scheme of this method is depicted in
Figure 8.
[
61] also reproduced this procedure to reduce contamination caused by electrical stimulation. Unlike [
17], this approach directly uses coefficients of the Fourier transform rather than the power spectrum. In addition, to estimate the coefficients of the noisy signal, the researchers developed a more robust method that involves performing several consecutive Fourier transforms and averaging over the coefficients obtained at different times. This frequency subtraction method was also tested by [
12] to remove ECG from the EMG signal. However, after comparing its performance to other ECG removal methods, they concluded it wasn’t the strongest method.
[
62] designed a comparable solution in which the background noise is acquired in real time via a separated channel. Similar to the ANC, this therefore allows them to obtain an estimate of instantaneous noise rather than using an estimate based on a signal taken at the start of a trial. This method differs from the ANC in that the denoising is performed in the frequency domain rather than the time domain. Another analogous method that uses a dual-adapted FBLSM was introduced by [
63]. Their method employs an adaptive finite impulse response (FIR) filter in the frequency domain.
An algorithm based on the discrete cosine transform (DCT), which is derived from the Fourier transform, was also explored by [
64]. They removed PLI by setting to zero the coefficients of the DCT corresponding to PLI. However, a drawback of this method is that, like the conventional notch filter, it also removes a major part of the EMG signal at this frequency.
[
65] have explored an alternative to this method called the spectrum interpolation, in which the spike in the signal’s power spectrum caused by PLI is first removed. The real spectrum value at 60/50Hz is estimated by interpolating between the next two values in the spectrum. The signal is then reconstructed using the modified coefficients. This procedure allows attenuation of the PLI without removing the EMG signal at 60/50Hz, which is a major accomplishment compared to most methods. A variant of this scheme was also introduced by [
66] who added an Hampel filter in order to detect the spikes in the power spectrum. In the original method presented by [
65], the center frequency had to be known for the method to be automatically performed. However, in some cases PLI harmonics may also be present in the signal. In addition, the center frequency of PLI can also vary slightly. The method proposed by [
66] thus allows more than one spike to be detected at any frequency. This also opens the door to the removal of other interference such as electrical stimulation.
2.4.2. Denoising methods after wavelet decomposition
Wavelet transform (WT) is a decomposition method created as an alternative to the Fourier transform. The method is analogous to the those already presented in the following ways: 1. The signal is decomposed using one of the wavelet transforms, 2. It is then denoised in the wavelet domain, and 3. The signal is subsequently reconstructed by applying the inverse wavelet transform on the modified coefficients. In general, wavelet-based denoising methods are used to reduce WGN. While the general block diagram is similar to the methods based on Fourier transform and cosine transform, the decomposition using the wavelet transform is not as trivial and many parameters must be defined. Therefore, much more research has been conducted on the subject. Moreover, a considerable amount of research has focused on wavelet denoising in the context of WGN removal. Therefore, in most cases, the denoising stage differs from that of the previously mentioned decomposition methods.
Decomposition using wavelet transform: While the Fourier transform decomposes the signal from the time domain into the frequency domain, the wavelet transform generates components (kernels) that are defined in terms of both time and frequency. In this approach, the components are created by translating and dilating a fixed function called the mother wavelet (, thus allowing a multi-dimensional representation of the signal. Unlike the sine/cosine waves used in Fourier transform, the amplitude of the mother wavelet varies across its length. Translating it over the signal thus allows the definition of each component in time.
Wavelet transforms can be categorized into two main types: continuous wavelet transform (CWT) and discrete wavelet transform (DWT). CWT involves calculating the wavelet coefficients at every possible scale. However, this version of WT is highly redundant and computationally time consuming. From a denoising perspective, the aim is to decompose the signal in a way that allows reconstruction of the original signal using a linear combination of the smallest number of components. In the classic CWT, many more coefficients are generated than are actually needed to reconstruct the signal. CWT is thus highly redundant. As stated by [
67], the wavelet functions must be orthogonal in order to meet this criteria. DWT achieves this by restricting the variation in translation and scale to powers of 2. As presented in
Figure 9, DWT can be implemented using Mallat’s algorithm, which uses high- and low-pass filters to separate high- (D) and low-frequency (A) components [
68].
However, in contrast to Fourier transform, the resolution of wavelet transform is not uniform in the time–frequency plane. Indeed, as stated in the Heisenberg problem, larger scales cover larger time frames but smaller frequency frames, while smaller scales expand the frames in frequency and contract them in time. Therefore, at each level of decomposition in classic DWT, the low-frequency component is decomposed into two new components using a high-pass filter. The filter bank resulting from a level 3 DWT is presented in
Figure 9 (A). The coefficients obtained using the level 3 DWT on a measured signal (
) are presented in
Figure 9 (B). The measured signal can then be rewritten in the wavelet domain as a linear combination of the high-frequency coefficients (
) and the last low-frequency coefficient (
) as follows:
where
L is the level of decomposition.
A variant of DWT is wavelet packet transform (WPT) [
69]. The main difference between the two methods is that WPT is more adaptive to the signal. As shown in
Figure 10, it decomposes not only low-frequency components, but also high-frequency components at each level. Using the signal itself, the most useful frequency bands can be selected to match the signal. The signal can then be expressed as any orthogonal combination of components, as shown in grey in
Figure 10.
Another variant of the DWT is the stationary wavelet transform (SWT). This variant was designed to overcome the variance DWT suffers due to time shifts [
70]. SWT is similar to DWT, except that at each level of decomposition, the filters are up-sampled and the signal is never sub-sampled.
In addition to choosing the WT type, the mother wavelet and the level of decomposition must also be defined. According to the literature, it seems that the best mother wavelet has yet to be clearly defined. Many different mother wavelets have been reported in EMG signal denoising. Likewise, the decomposition level also seems to differ across the literature. However, a number of studies have compared mother wavelets and decomposition levels in the context of EMG signal denoising. [
71] tested a total of 53 mother wavelets from the following famillies: Daubechies, Symlets, Coiflet, BiorSplines, ReverseBior, and Discrete Meyer. Their results suggest that the most adapted wavelets in the context of removing WGN from EMG signals are first-order Daubechies, BioSplines, and ReverseBior wavelets (db1, bior1.1, rbio1.1). They also found that various mother wavelets produce relatively good results. However, they did not recommend bior3.1, bior3.3, bior3.5, bior3.7, bior3.9 and dmey. [
72] also investigated the performance of different mother wavelets with two different decomposition levels. They reported better results using the db4 at level 3. These results appear to be consistent with the literature, since most studies have reported using either the db2 [
73,
74,
75,
76,
77,
78,
79] or the db4 [
67,
80,
81,
82,
83,
84]. In terms of optimal decomposition level, level 4 appears to be the most widely used [
5,
19,
71,
77,
82,
85,
86], while the level 3 is also frequently chosen [
25,
72,
87,
88,
89,
90].
Denoising in the wavelet domain: As for all decomposition methods, once decomposition is completed, the signal is denoised in the decomposition domain. While the block diagram of wavelet denoising methods is similar to those based on Fourier transform and cosine transform, a considerable amount of research has focused on wavelet denoising in the context of WGN removal. Therefore, instead of simply subtracting the coefficients obtained with the noisy signal from those of the trial, the denoising stage is usually performed by applying a threshold to each of the coefficients as proposed by [
91]. Thus, this stage is separated into two main steps: 1. threshold selection, and 2. application of the thresholds.
Threshold selection : The threshold selection step is used to determine the threshold value that should be applied to each coefficient of the decomposed signal. There are different algorithms that help to define these thresholds. When WGN is the only noise source considered, Eq.
1 becomes
where
is Gaussian noise N(0,1) and
is an estimate of the noise variance.
In the wavelet domain this equation becomes
In WGN removal, the principle is that the noise level
can be estimated using the signal itself. In the classic method submitted by [
91], the
Universal Threshold is defined as follows:
where
is the threshold,
M is the signal length in the time domain and
is the noise estimate. The noise estimate
is usually obtained using Eq.
18.
where
is the detail coefficient at level
j, which means that a noise estimate is obtained for each coefficient. This noise estimate is referred to as
level dependant (LD). However, two other noise estimates are also used frequently in wavelet denoising [
71]. The noise level can also be estimated using only the first level (FL). In fact, it is well known that close to no EMG signal can be found in the first detail coefficient
. Therefore, this coefficient is often used as an estimate of the overall noise. Since the noise is assumed to be WGN, the noise is equally dispersed over all frequencies, which means that it can easily be estimated from a single coefficient. The last method is said to be
global (GL) and defines
as an estimate of the standard deviation of all the wavelet coefficients [
82].
Since [
91] published their method, several other threshold selection procedures have also been explored. The most popular are the
SURE Threshold,
Hybrid Threshold and
Minimax Threshold [
73].
Application of the thresholds : Once they have been defined, thresholds are used to either remove or shrink the coefficients according to a threshold function. The simplest threshold function is the hard function (HAD), which consists of zeroing all coefficients below their associated thresholds. The HAD function can be stated as follows:
Another popular function is the soft function (SOF), which is an extension of the hard function [
92]. In addition to zeroing the coefficients at values lower than their corresponding thresholds, the other coefficients are also shrunk by subtracting the threshold values from them:
A comparison of the output coefficient obtained according to the input coefficient is presented in
Figure 11 for both the HAD and SOF functions. In addition, other functions such as hyperbolic and non-negative garrote have also been investigated [
75].
Aside from the classic wavelet thresholding method, which consists of applying a threshold to the coefficients, other denoising methods using the wavelet have been reported in the literature. [
93] designed an algorithm based on the WT to reduce ECG artifacts in the signal. Since the ECG signal tends to have the most energy in the combined EMG + ECG signals, these researchers suggest completely removing the largest coefficients. Using this approach, their thresholding function is the complete opposite of the classic HAD function:
Other studies in which contaminants other than WGN have been considered suggest choosing the thresholds manually. For example, [
53] uses a manual thresholding method to remove motion artifacts. Similarly, [
94] applies manual thresholding to the removal of unscaled white noise.
Aside from this, [
74] proposed a method in which only the lower frequency component (
) is ignored in the reconstruction. This allowed the researchers to eliminate baseline wander, which, by definition, is found in the lower frequencies. In contrast, [
67] presented a comparable approach in which only the lower components are used in the reconstruction. It is well known that the ratio of EMG signal versus noise in high-frequency components is low. Therefore, the researchers suggest ignoring the higher frequency components in the reconstruction.
[
77] proposed using a quiet trial to estimate the background noise as reported with the Fourier transform. Using this estimation, they were able to set the thresholds accordingly. A comparable scheme to remove background noise was also reported by [
70].
Reconstruction using the modified coefficients The modified coefficients are then used to reconstruct the signal in the time domain using the inverse wavelet transform (IWT).
2.4.3. Denoising after empirical mode decomposition (EMD)
In 1998, [
95] developed a new decomposition method called the empirical-mode decomposition. Unlike the wavelet approach, EMD does not decompose the signal in terms of basic atoms like mother wavelets, but rather breaks them into a series of intrinsic mode functions (IMFs). These functions are obtained using a procedure called the sifting process. The biggest difference between WT and EMD is that EMD adapts to the signal that needs to be decomposed. As explained previously, with WT, the mother wavelet and the level of decomposition must be defined. This results in a certain number of filters that are fixed. In contrast, with EMD, the IMFs are not fixed and depend on the signal. This allows better adaptation of the decomposition according to the input signal.
In 2006, [
7] used EMD to filter WGN from an EMG signal. Similar to Donoho’s wavelet method, the approach consists of applying a threshold to the coefficients obtained from the decomposition into IMFs and reconstructing the signal with the modified coefficients. However, in this study, the selection of the thresholds was done differently. They performed an EMD on a noise window taken before the trial. The coefficients obtained with the noisy signal were used as thresholds for the trial by applying the SOF function. EMD-based denoising methods have also been performed to remove electrical stimulation [
96] and ECG [
97]. [
98] developed a novel denoising algorithm inspired by wavelet thresholding called interval thresholding (IT), which is performed on the IMFs obtained from the EMD. Other thresholding methods for EMD denoising were also studied by [
99]. Their findings suggest that the an iterative version of the IT yields better denoising results when employed with hard threshold.
Subsequently, [
100] used an extension of EMD called the ensemble EMD (EEMD) developed by [
101]. Without going into detail, in EEMD, noise is added to the signal purposefully in order to improve the decomposition into IMFs. To do this, several EMDs are performed one after the other so that the results can be averaged. The results of this study demonstrated that the EEMD was more efficient in reducing the WGN, PLI and BW than conventional filters and classic EMD, particularly for low SNR. Another denoising algorithm based on EEMD and an improved wavelet threshold was introduced by [
102] to remove random noise.
An improved algorithm derived from the EEMD was also introduced by [
103]. This Complementary EEMD (CEEMD) was designed to reduce the contamination introduced by the added noise. In CEEMD, each noise is added in pairs with plus and minus signs to insure a complete cancellation of the residual noise in the reconstruction step. CEEMD was applied to EMG denoising by [
104] along with an improved version of the interval thresholding (IT) method designed by [
98]. They compared the results obtained with their proposed method (CEEMD-IT) to those of SWT, EMD along, the EMD with IT and the EMD with direct thresholding in the context of WGN removal. The study demonstrated that the CEEMD-IT method achieved better denoising results for most of the tested SNR while retaining more useful information.
A further extension of the EMD method called
BoostEMD was design by [
105].This variant of the EMD method allows for use of higher orders IMFs. The authors conducted a comparative study with the classic EMD for classification accuracy and found their method was more effective. However, they also observe that the nature of their method makes it impossible to extract all the noise while retaining all the useful information in the EMG signal.