Two main approaches have been used to understand the mechanisms underlying the reduced response to sounds during sleep: evoked potentials (evoked “responses”) and evoked unit activity. Evoked responses are usually collected using scalp EEG (or magnetoencephalography) recordings or with invasive, deep intracortical local field potential (LFP) recordings. These studies consistently found that cortical auditory evoked potentials look similar in wake and REM sleep, but the amplitude of the late components is larger in NREM sleep, likely reflecting the bistability (ON/OFF firing) of cortical neurons during this sleep phase [
9,
10]. However, evoked responses reflect mainly the synaptic input, not the firing output, which is why the analysis of unit activity is critical to study sensory disconnection. In general, the studies that measured the firing evoked by a sound found little differences between sleep and wake in subcortical regions and primary cortical areas. For instance, in guinea pigs, the majority of neurons in primary auditory cortex (A1) show similar evoked firing activity in waking and NREM sleep, and the remaining neurons are split evenly between increased or decreased firing in sleep relative to wake [
11]. In marmosets, the analysis of evoked unit firing in A1 and higher order auditory cortex (mostly supragranular layers) showed that some neurons fire more in sleep than in waking, others do the opposite, with no obvious depressive effect of sleep [
12]. In the same study the response in NREM sleep was a poor predictor of the response in REM sleep, with a third of all units showing opposite responses in the two phases [
12], suggesting that the mechanisms of auditory disconnection may differ between NREM sleep and REM sleep. A follow-up study in the supragranular layers of marmoset A1 found reduced unit firing in response to weak stimuli during NREM sleep [
13]. However, louder sounds evoked similar unit responses in wake and sleep despite being unable to cause behavioral arousal, leaving the question of why the acoustic arousal threshold is increased in sleep unresolved [
13]. Several studies that recorded from rat A1 also concluded that the firing evoked by different sounds does not fundamentally differ in wake, NREM sleep, and REM sleep [
14,
15,
16]. On the other hand, it was recently found that most neurons in the perirhinal cortex, a higher order area, respond less to sounds in REM sleep, and even less so in NREM sleep, relative to waking [
16]. A crucial insight provided by this study was that the reduced response in NREM sleep does not depend on the cortical area per se (high order vs. A1) but on the latency of the response, which likely reflects the position of the responding neurons within layers: early (<20msec) responding neurons, which accounted for most of the recorded units in A1, were unaffected, while the few A1 neurons that were late responding (>40msec) showed a reduced response in NREM sleep. Consistently, most recorded neurons in perirhinal cortex were late responding and responded less in NREM sleep, while the few early responding neurons were unaffected. Interestingly, a late response did not predict a reduced response in REM sleep, pointing again to different mechanisms for disconnection in the two sleep phases [
16]. A recent study in epileptic patients implanted with depth electrodes in the lateral temporal lobe found that sounds evoked a moderately decreased spiking response in NREM sleep relative to wake, especially outside A1. A moderate decrease in evoked firing was also present in REM sleep [
17]. Relative to waking, however, in both sleep phases the most significant change was the attenuation of “alpha/beta desynchronization”, i.e., the large decrease in power in the alpha and beta frequencies triggered by the sound during waking was strongly attenuated in sleep [
17]. Another recent study found that whether they cause arousal from sleep or not, sounds evoke the same strong calcium response in the mouse primary auditory thalamus (ventral medial geniculate nucleus, vMGN) that projects to A1 [
18], consistent with previous evidence that sounds reach the input layer of A1 equally well in waking and NREM sleep. Moreover, optogenetic excitation of vMGN neurons did not cause arousal from sleep and their optogenetic inhibition did not change the probability of sound-induced arousals. By contrast, in the “high order” posterior intralaminar thalamic nucleus, sounds or blue light applied during NREM sleep induced a large calcium response only when they caused arousal. Furthermore, optogenetic inhibition of this nucleus, or of its dense projections to the temporal association cortex, strongly reduced the probability to wake up from sounds or blue light [
18]. While evoked responses were not recorded during REM sleep, the optogenetic stimulation of the posterior intralaminar nucleus woke up the mouse from NREM sleep but not from REM sleep [
18], again suggesting that sensory disconnection in REM sleep may be mediated differently than in NREM sleep.
Together, these studies show that sounds reach the input layer of A1 equally well in waking and NREM sleep, but their transmission to supragranular and infragranular layers, as well as the feedforward transmission to higher order cortices, are compromised. During NREM sleep this effect likely depends on the occurrence of OFF periods, which impair corticocortical communication [
19], an hypothesis that is consistent with the fact that higher SWA leads to higher auditory arousal threshold [
5,
6]. Other mechanisms likely exist, however, because during stage 2 of NREM sleep or REM sleep, when slow waves are rare or spatially restricted [
20], subjects still respond only 50% of the time [
3]. One proposed mechanism is the impairment of the feedback from higher to lower cortical areas, consistent with the reduced alpha/beta desynchronization caused by sounds in sleeping humans [
17]. Of note, although many recent studies point to cortical “gates” located after the input layer, or gates in the “high order” thalamus, one early study found that during sleep neurons in the midbrain reticular formation and deep layers of the superior colliculus respond less to sounds and other stimuli [
21]. Intriguingly, all units recorded in REM sleep showed a reduced response independent of latency, while during NREM sleep the affected neurons were those with a late response [
21], akin to what seen in cortex.
In summary, a prominent mechanism for partial auditory disconnection in NREM sleep is in place after the input layer of A1 but other mechanisms in “high order” thalamus also are involved, and a brainstem gate may exist for REM sleep.