1. Introduction
Conventional Electroencephalography (EEG) is an invaluable tool for assessing neurological disorders, with measurements performed in controlled clinical environments, utilizing full cap systems over the scalp with wet electrodes that provide low impedance for quality measurements with high temporal resolution. However, researchers and clinicians have been interested in measuring EEG outside the laboratory for better assessing neurological disorders [
1].
The development of technology has led to the evolution of ambulatory EEG, resulting in research on smaller EEG measurement options. This has given rise to a new category of wearable EEG devices that are wireless, have aesthetic designs limited to only the area around the head, and use dry electrodes. The convenience of these devices enables individuals to incorporate EEG measurement into their everyday lives, making it easier to assess certain medical conditions such as epilepsy and sleep disorders and allowing for Brain-Computer Interface (BCI) applications outside of a laboratory environment. While these types of devices are commercially available and their capabilities acceptable for some current applications, the overall design of these wearables still constitutes a barrier to daily usage, being uncomfortable over long periods of time and making it obvious when a person is utilizing such an EEG device [
2,
3].
Quoting Looney et al., the next generation of wearable devices must be “discreet, unobtrusive, robust, user friendly and feasible”. An in-ear-EEG approach checks all these boxes, trading the wide coverage across the scalp with an inconspicuous EEG recording ability, based on the ear [
4]. The ability to acquire reliable brain recordings from inside the human ear is critical to accelerating the development of next-generation EEG-enabled earbuds. While ear-EEG is one of the best candidates for consumer BCI, being referred to as “beyond wearable” [
2], real-world brain recordings are affected by multiple environmental factors that are not present in the laboratory. A systematic, well-established characterization of in-ear-EEG hardware and signal quality is crucial to understand the limitations and applications of newly developed devices.
When characterizing a novel EEG system, it is important to be able to evaluate all components of the system. This includes the biological neural signals, derived from validated EEG paradigms like the alpha block or the various Event-Related Potentials (ERPs); and the hardware and signal chain components (electrical and mechanical), which require testing in an controlled setting through a test-bench or phantom model [
5].
In many areas of research and instrumentation, phantoms are utilized for testing, validating, and calibrating acquisition systems, namely in medical imaging like Positron Emission Tomography (PET) and Magnetic Resonance Imaging (MRI) scans where standardized phantoms are available [
6,
7]. In EEG, a standardized phantom does not exist, although many prototypes from different materials and designs have been proposed.
A common approach used by research groups to test novel EEG systems or new types of electrodes consists of using salt-doped ballistic gelatin (BG) head models onto which scalp EEG sensors can be mounted for testing [
8,
9]. Nonetheless, phantoms can also be made from different organic (e.g., agar) or synthetic (e.g., carbon-doped thermoplastics) materials [
10,
11]. The basic principle for an EEG phantom prototype is to obtain a conductive physical model with the shape of the human head. This is usually achieved through image processing of anatomical scans (i.e., MRI or CT) and 3D printing of the resulting phantom’s cast for molding [
12].
An EEG phantom allows for the playback of a previously recorded known EEG signal. This recognizable signal is referred to as the “ground truth”. Then, measurements can be done on the phantom for comparison with this known signal, which constitutes a way of characterizing the acquisition system without the need to account for the inherent variability and lack of repeatability of neural signals present when recording from human subjects, as well as identifying external sources of noise [
13,
14]. For a known previously recorded signal to be played out of the phantom, electrode antennas are driven inside it. These can be simple screws, exposed wire tips, or a coaxial cable to create dipoles [
11,
15,
16]. The driven antennas can be either attached to the interior side of the scalp layer in hollow-shaped phantoms or are put in place during a phase of assembly where the constitution of the filling material allows for this procedure [
11,
17]. Additionally, EEG phantoms allow for a controlled analysis of electrode contact impedance measurements and signal noise floor characteristics [
17,
18].
Despite the existence and use of EEG phantoms for evaluating scalp based EEG systems, there is a lack of literature for ear-EEG validation via appropriate phantom models. To the best of our knowledge, appropriate EEG phantoms for this purpose do not exist and all previous EEG phantoms neglect the structures of the outer ear and ear canal systematically.
The feasibility of measuring brain signals through ear-EEG devices has been validated in recent publications [
4,
19,
20]. For an in depth review of the field and overview of the technological state-of-the-art around ear-EEG we please refer to [
21,
22]. A commonality across the ear-EEG literature, is that the validation paradigms utilized tend to be known ERP paradigms, namely the alpha modulation (or alpha blocking) paradigm, the Auditory Steady-State Response (ASSR), the Steady-State Visual Evoked Response (SSVEP), Auditory Evoked Potentials (AEPs), Visual Evoked Potentials (VEPs) and oddball type paradigms to elicit responses linked to higher processes of the brain, like the P300 and Mismatch Negativity (MMN) responses. Similarly, the interest of utilizing electro-oculography (EOG) measured by ear-EEG as a possible input for BCI applications as also been explored [
20].
Given the predominance of these select group of paradigms utilized to derive ear-EEG responses, and the lack of a dedicated phantom model suitable for ear-based systems, here we propose a complete ear-EEG validation toolkit to contribute and expand to the characterization of this technology through:
A software framework ("EaR-P Lab") that allows the user to readily make a validation test battery for the characterization of ear-EEG devices at the neural signal acquisition level.
The design and prototyping of an ear-EEG suitable physical phantom for systematic characterization of in-ear sensors, allowing controlled comparison of fit form factors for ear-EEG acquisition.
This contribution to the field aims to allow for a more reliable assessment of out-of-the-box ear-EEG devices by providing proper benchmarking tools for comparing systems.
4. Discussion
Ear-level EEG devices are an active field of research and development with potential to disrupt the way in which brain activity can be factored into daily life. We can expect the increase of new applications, form factors and sensor technology being developed as the field grows. Currently, there is scientific evidence to support the use of ear-EEG [
21,
22] and we set out to provide a set of tools that could benefit further testing and characterization of ear-EEG devices. Our approach was to develop a validation toolkit that would allow the characterization of ear-EEG devices from hardware to neural signal acquisition.
We developed an EEG acquisition application based on nine commonly utilized EEG paradigms in the field. Our validation results show that this implementation of EaR-P Lab can indeed be used to elicit the expected neural responses from scalp EEG recordings. When evaluating these responses from the perspective a near-ear electrode location like T8, we observed that the VEP responses were the most affected, indicating that ear-EEG may have a limitation on the type of measurable paradigms.
We also developed an EEG phantom suitable for evaluating ear-EEG sensors by combining 3D scans of a participant’s ear impressions and 3D printing a mould to be filled with conductive material. This allowed us to assess ear-EEG devices in terms of their electrode contact impedance, measured noise floor, and acquisition of a known signal. To the best of our knowledge this is the first EEG Phantom dedicated for ear-level devices, as traditional head phantoms neglect the structures of the ear needed for this evaluation [
8].
Our phantom development highlighted salt-doped agar as the best material substrate for this purpose, with conductivity values equivalent to those reported in literature for whole brain anatomy at 0.33 S/m [
37,
38]. While salt-doped ballistic gelatin and carbon fibre doped silicone were also investigated as potential material substrates, there were notable limitations with these options.
Ballistic gelatin showed a drastic change on signal transmission integrity across a week of testing. This could be related to the storage conditions, highlighting the susceptibility of the phantom to the effect of time and the environment, which could hinder the repeatability of results when evaluating ear sensors. While agar is also an organic and perishable material, it showed a more stable material performance despite being kept under the same storage conditions as the BG phantom. To address the susceptibility to degradation over time, we investigated the use a synthetic non-perishable material composition for the phantom by doping platinum cured silicone with carbon fibres. This approach, however, did not scale from the prepared samples for conductivity testing to the larger ear-EEG phantom.
The mechanism by which carbon fibres turn silicone into a conductive medium is different to that of salt-doped agar or BG. In the latter, salt fully dissolves in the medium creating a homogeneous conductive substrate, while in the former, carbon fibres disperse in the silicone creating conductive paths through heterogeneously mixed fibres. When the samples were small strips, this dispersion of carbon fibres was effective throughout the volume, however when a larger volume was used, the stirring method did not achieve dispersion of fibres over the full volume. This led to patches of non-conductive silicone, clearly visible in the final product. A different mixing approach should be defined in the future to ensure that the CF phantom is conductive throughout. It should be noted however that the conductivity of this synthetic substrate is orders of magnitude larger than that of real anatomical structure and the agar/BG substrates.
We piloted the use of these tools through the evaluation of a third-party ear-EEG sensors in a small feasibility study. Our results showed that the ear-EEG sensors were functional and EEG responses were successfully recorded when using a scalp reference (Cz and T8). However, in-ear or cross-ear references only showed significant EEG responses for steady state paradigms and alpha blocking. Further investigation should be done on the reference choice for these ERP paradigms, as it has been shown that this is crucial for obtaining characteristic ERP responses [
39] . However, this was not the scope of this feasibility study. Importantly, we could see clear EOG responses within-ear and cross ear reference configuration, promising for further ear-based BCI applications.
We have also shown how the data obtained with the ear-EEG phantom, without the need of a testing subject, shows feasibility on selecting optimal electrode locations in the ear and can be tied with the actual ear-EEG recordings of the same tested subject to improve the SNR of the ASSR.
Next steps with this tool include a better executed testing protocol and the assessment of a generic ear canal for the testing and comparison of different earpiece designs. Our ear-EEG phantom methodology should also be considered in the future for the controlled study of how real-world factors, like gait, affect ear-EEG [
40].
The presented ear-EEG validation Toolkit is available to the scientific community, via the Open Science Framework (OSF) repository named: "Brain Wearables: Validation toolkit for Ear Level EEG sensors" (
https://osf.io/2dxs4/). All future modifications to the Ear-P Lab and the ear-EEG phantom generated by the authors will also be updated in the mentioned repository.
Figure 1.
EaR-P Lab - structure and main attributes.
Figure 1.
EaR-P Lab - structure and main attributes.
Figure 2.
EaR-P Lab - functioning framework.
Figure 2.
EaR-P Lab - functioning framework.
Figure 3.
EaR-P Lab - Main Menu.
Figure 3.
EaR-P Lab - Main Menu.
Figure 4.
Escalating latency when recording multiple Event-Related Potential (ERP) blocks on the same file, exemplified for auditory stimuli - a similar effect happens for visual stimuli.
Figure 4.
Escalating latency when recording multiple Event-Related Potential (ERP) blocks on the same file, exemplified for auditory stimuli - a similar effect happens for visual stimuli.
Figure 5.
The cascading effect is nullified when recording multiple ERP blocks in different files after restarting data streaming, exemplified for auditory stimuli - a similar effect happens for visual stimuli.
Figure 5.
The cascading effect is nullified when recording multiple ERP blocks in different files after restarting data streaming, exemplified for auditory stimuli - a similar effect happens for visual stimuli.
Figure 6.
EEG acquisition setup schematic and equipment - a) USB Audio Interface TASCAM US-100 b) - Digital-Analog Converter (DAC) Amplifier FiiO Alpen 2 c) - ER2 Etymotic tubal insert research grade earphones
Figure 6.
EEG acquisition setup schematic and equipment - a) USB Audio Interface TASCAM US-100 b) - Digital-Analog Converter (DAC) Amplifier FiiO Alpen 2 c) - ER2 Etymotic tubal insert research grade earphones
Figure 7.
Alpha block grand average spectrogram over the control group at Oz (Cz referenced) - the bottom plot shows the alpha power mean power (8 Hz to 12 Hz) per section while the left vertical plot shows the frequency response between the two conditions.
Figure 7.
Alpha block grand average spectrogram over the control group at Oz (Cz referenced) - the bottom plot shows the alpha power mean power (8 Hz to 12 Hz) per section while the left vertical plot shows the frequency response between the two conditions.
Figure 8.
Grand average Auditory Steady-State Response (ASSR) (dB) to a 40 Hz frequency modulated stimuli, at (Left) P4 and (Right) T8. Statistically significant peaks are highlighted by the green star token, based on an f-test (p < 0.05).
Figure 8.
Grand average Auditory Steady-State Response (ASSR) (dB) to a 40 Hz frequency modulated stimuli, at (Left) P4 and (Right) T8. Statistically significant peaks are highlighted by the green star token, based on an f-test (p < 0.05).
Figure 9.
Grand average Steady-State Visual Evoked Potential (SSVEP) (dB) to a 10 Hz visual stimuli, at (Left) Oz and (Right) T8. Statistically significant peaks are highlighted by the green star token, based on an f-test (p < 0.05). Only the first harmonic was statistically evaluated.
Figure 9.
Grand average Steady-State Visual Evoked Potential (SSVEP) (dB) to a 10 Hz visual stimuli, at (Left) Oz and (Right) T8. Statistically significant peaks are highlighted by the green star token, based on an f-test (p < 0.05). Only the first harmonic was statistically evaluated.
Figure 10.
Grand average Auditory Evoked Potential (AEP) waveform at T8. Statistically significant segments are highlighted in green, based on a t-test (p < 0.05).
Figure 10.
Grand average Auditory Evoked Potential (AEP) waveform at T8. Statistically significant segments are highlighted in green, based on a t-test (p < 0.05).
Figure 11.
Grand average Visual Evoked Potential (VEP) waveform at (Left) Oz and (Right) T8. Statistically significant segments are highlighted in green, based on a t-test (p < 0.05).
Figure 11.
Grand average Visual Evoked Potential (VEP) waveform at (Left) Oz and (Right) T8. Statistically significant segments are highlighted in green, based on a t-test (p < 0.05).
Figure 12.
Grand average Mismatch Negativity (MMN) waveform at (Left) Pz and (Right) T8. Statistically significant segments are highlighted in green, based on a t-test (p < 0.05).
Figure 12.
Grand average Mismatch Negativity (MMN) waveform at (Left) Pz and (Right) T8. Statistically significant segments are highlighted in green, based on a t-test (p < 0.05).
Figure 13.
Grand average P300 waveform at (Left) P4 and (Right) T8. Statistically significant segments are highlighted in green, based on a t-test (p < 0.05).
Figure 13.
Grand average P300 waveform at (Left) P4 and (Right) T8. Statistically significant segments are highlighted in green, based on a t-test (p < 0.05).
Figure 14.
Soft/hard blink amplitudes, for an exemplary subject, at (Left) F3 and (Right) T8.
Figure 14.
Soft/hard blink amplitudes, for an exemplary subject, at (Left) F3 and (Right) T8.
Figure 15.
Grand average saccade profiles on the four cardinal directions, at (Left) T7 and (Right) T8.
Figure 15.
Grand average saccade profiles on the four cardinal directions, at (Left) T7 and (Right) T8.
Figure 16.
(Left) Closed render of the ear-EEG phantom. (Right) Exploded render of the ear-EEG phantom.
Figure 16.
(Left) Closed render of the ear-EEG phantom. (Right) Exploded render of the ear-EEG phantom.
Figure 17.
Outer ear scans (blue - left ear, red - right ear) from a subject, received as .stl files, shown from two perspectives, obtained by an expert audiologist.
Figure 17.
Outer ear scans (blue - left ear, red - right ear) from a subject, received as .stl files, shown from two perspectives, obtained by an expert audiologist.
Figure 18.
Example of a left ear scan being centered and orientated with the phantom’s lid mesh and different views of the alignment and depth of the ear mesh and the lid mesh into a single rendered object.
Figure 18.
Example of a left ear scan being centered and orientated with the phantom’s lid mesh and different views of the alignment and depth of the ear mesh and the lid mesh into a single rendered object.
Figure 19.
Disassembled ear-EEG phantom - bottom half (in yellow), top half (in white, similar to yellow), and two lids with a left and right ear imprint from one of the subjects.
Figure 19.
Disassembled ear-EEG phantom - bottom half (in yellow), top half (in white, similar to yellow), and two lids with a left and right ear imprint from one of the subjects.
Figure 20.
Ear-EEG phantom assembly - antennas and railing fittings were sealed with tape
Figure 20.
Ear-EEG phantom assembly - antennas and railing fittings were sealed with tape
Figure 21.
(Left) Agar ear-EEG phantom. (Right) Ballistic Gelatin (BG) ear-EEG phantom.
Figure 21.
(Left) Agar ear-EEG phantom. (Right) Ballistic Gelatin (BG) ear-EEG phantom.
Figure 22.
Silicone doped with carbon fibers ear-EEG phantom - the lack of conductive homogeneity is highlighted on the right, with conductive and non-conductive zones being visible.
Figure 22.
Silicone doped with carbon fibers ear-EEG phantom - the lack of conductive homogeneity is highlighted on the right, with conductive and non-conductive zones being visible.
Figure 24.
Contact impedance measurements (kOhm) measured on an agar ear-EEG phantom, in the second day of testing, for wet and dry electrode conditions. Values marked with an * surpassed the value of 50 in the graph, for the respective unit.
Figure 24.
Contact impedance measurements (kOhm) measured on an agar ear-EEG phantom, in the second day of testing, for wet and dry electrode conditions. Values marked with an * surpassed the value of 50 in the graph, for the respective unit.
Figure 25.
Noise floor measurements (µVrms) measured on an agar ear-EEG phantom, in the second day of testing, for wet and dry electrode conditions. Values marked with an * surpassed the value of 50 in the graph, for the respective unit.
Figure 25.
Noise floor measurements (µVrms) measured on an agar ear-EEG phantom, in the second day of testing, for wet and dry electrode conditions. Values marked with an * surpassed the value of 50 in the graph, for the respective unit.
Figure 26.
Ear-EEG alpha wave (10 Hz input) simulation frequency response (dB), for agar and BG, in dry and wet electrode settings, at the ER8 electrode.
Figure 26.
Ear-EEG alpha wave (10 Hz input) simulation frequency response (dB), for agar and BG, in dry and wet electrode settings, at the ER8 electrode.
Figure 27.
(Left) Internal side of the tested earbuds. (Right) External side of the tested earbuds.
Figure 27.
(Left) Internal side of the tested earbuds. (Right) External side of the tested earbuds.
Figure 28.
Ear-EEG scalp and ears acquisition configuration showing the approximate position of ear electrodes, with colors and numbers matching those in
Figure 27 - referencing for ear electrodes is provided in the legend, for example, "Ex1" is used to label ear electrode 1 (black), where "x" is replaced by
L or
R for left or right ear, respectively.
Figure 28.
Ear-EEG scalp and ears acquisition configuration showing the approximate position of ear electrodes, with colors and numbers matching those in
Figure 27 - referencing for ear electrodes is provided in the legend, for example, "Ex1" is used to label ear electrode 1 (black), where "x" is replaced by
L or
R for left or right ear, respectively.
Figure 29.
(Left) Ear-EEG head setup - side view. (Right) Ear-EEG head setup - posterior view.
Figure 29.
(Left) Ear-EEG head setup - side view. (Right) Ear-EEG head setup - posterior view.
Figure 30.
Grande average alpha block modulation (dB) for the wet ear-EEG recordings for the different referencing setups. Omitted results are not significant based on a t-test (p < 0.05).
Figure 30.
Grande average alpha block modulation (dB) for the wet ear-EEG recordings for the different referencing setups. Omitted results are not significant based on a t-test (p < 0.05).
Figure 31.
Grand average ASSR (dB) to a 40 Hz frequency modulated auditory stimuli for the wet ear-EEG recordings for the different referencing setups. Omitted results are not significant based on an f-test (p < 0.05).
Figure 31.
Grand average ASSR (dB) to a 40 Hz frequency modulated auditory stimuli for the wet ear-EEG recordings for the different referencing setups. Omitted results are not significant based on an f-test (p < 0.05).
Figure 32.
Grand average SSVEP (dB) to a 10 Hz visual stimuli for the wet ear-EEG recordings for the different referencing setups. Omitted results are not significant based on an f-test (p < 0.05).
Figure 32.
Grand average SSVEP (dB) to a 10 Hz visual stimuli for the wet ear-EEG recordings for the different referencing setups. Omitted results are not significant based on an f-test (p < 0.05).
Figure 33.
Grand average wet ear-EEG AEP waveform at ER8 for a (Left) Cz and a (Right) T8 reference. Statistically significant segments are highlighted in green, based on a t-test (p < 0.05).
Figure 33.
Grand average wet ear-EEG AEP waveform at ER8 for a (Left) Cz and a (Right) T8 reference. Statistically significant segments are highlighted in green, based on a t-test (p < 0.05).
Figure 34.
Grand average wet ear-EEG AEP waveform at EL8 for an ER3 reference. Statistically significant segments are highlighted in green, based on a t-test (p < 0.05).
Figure 34.
Grand average wet ear-EEG AEP waveform at EL8 for an ER3 reference. Statistically significant segments are highlighted in green, based on a t-test (p < 0.05).
Figure 35.
Grand average wet ear-EEG VEP waveform at ER8 for a (Left) Cz and a (Right) T8 reference. Statistically significant segments are highlighted in green, based on a t-test (p < 0.05).
Figure 35.
Grand average wet ear-EEG VEP waveform at ER8 for a (Left) Cz and a (Right) T8 reference. Statistically significant segments are highlighted in green, based on a t-test (p < 0.05).
Figure 38.
Grand average wet ear-EEG P300 waveform at ER8 for a (Left) Cz and a (Right) T8 reference. Statistically significant segments are highlighted in green, based on a t-test (p < 0.05).
Figure 38.
Grand average wet ear-EEG P300 waveform at ER8 for a (Left) Cz and a (Right) T8 reference. Statistically significant segments are highlighted in green, based on a t-test (p < 0.05).
Figure 39.
Grand average ear-EEG soft/hard blink ratio, for the different ear-EEG reference configurations.
Figure 39.
Grand average ear-EEG soft/hard blink ratio, for the different ear-EEG reference configurations.
Figure 40.
Grand average wet ear-EEG saccade profiles on the four cardinal directions at ER8 for a (Left) Cz and a (Right) T8 reference.
Figure 40.
Grand average wet ear-EEG saccade profiles on the four cardinal directions at ER8 for a (Left) Cz and a (Right) T8 reference.
Figure 41.
Grand average wet ear-EEG saccade profiles on the four cardinal directions at ER8 for a (Left) within-ear and a (Right) between-ears reference.
Figure 41.
Grand average wet ear-EEG saccade profiles on the four cardinal directions at ER8 for a (Left) within-ear and a (Right) between-ears reference.
Figure 42.
Reassessment of dry electrodes ear-EEG ASSR data for the subject utilised to make the ear-EEG phantom. Data was rereferenced from ER3 (original reference) to ER4 (better electrode proposed by the phantom) resulting in a dB increase for this response.
Figure 42.
Reassessment of dry electrodes ear-EEG ASSR data for the subject utilised to make the ear-EEG phantom. Data was rereferenced from ER3 (original reference) to ER4 (better electrode proposed by the phantom) resulting in a dB increase for this response.
Table 1.
Electrical conductivity, in Siemens per meter, for samples of agar, BG and silicone doped with carbon fiber (CF) (1%) as the proposed materials for the ear-EEG phantom.
Table 1.
Electrical conductivity, in Siemens per meter, for samples of agar, BG and silicone doped with carbon fiber (CF) (1%) as the proposed materials for the ear-EEG phantom.
|
Conductivity [S/m] |
Agar |
0.309 |
BG |
0.918 |
CF (1%) |
14.035 |
Table 2.
Measured mass (g) of each ear-EEG phantom over the testing days, on agar and BG.
Table 2.
Measured mass (g) of each ear-EEG phantom over the testing days, on agar and BG.
|
Day 1 |
Day 2 |
Day 3 |
Day 4 |
Agar |
855 |
851 |
850 |
845 |
BG |
963 |
959 |
958 |
956 |
Table 3.
Measured signal amplitude (mV) at the sides of each ear-EEG phantom, directly through the material, as a measure of signal integrity, on agar and BG.
Table 3.
Measured signal amplitude (mV) at the sides of each ear-EEG phantom, directly through the material, as a measure of signal integrity, on agar and BG.
|
Day 1 |
Day 2 |
Day 3 |
Day 4 |
Agar |
44 |
40 |
40 |
36 |
BG |
80 |
52 |
52 |
40 |