2.1. Data Acquisition
The datasets utilized in this study were retrospectively acquired from a previous QQ-NET study [
51]. The study was approved by the local Institutional Review Board and involved MRI scans of 34 ischemic stroke patients (occurring within a unilateral cerebral artery territory) between 6 hours to 42 days post-stroke. The scans were conducted on a clinical 3T scanner (GE MR Discovery 750) utilizing a 32-channel brain receiver coil. The 3D mGRE imaging protocol was applied, with the following parameters: 0.47×0.47×2.0 mm
3 voxel size, eight equally spaced echoes (TE
1/ΔTE/TE
8 = 4.5/5/39.5 ms), TR= 42.8 ms, bandwidth=244.1 Hz/pixel, 20
o flip angle, and 5 min 15 second scan time. Further, DWI (24 cm FOV, 0.94×0.94×3.2 mm
3 voxel size, 1953.1 Hz/pixel bandwidth, 0, 1000 s/mm
2 b-values, TE=71 ms, TR=3000 ms, and four signal averages), and a T1 weighted fluid attenuated inversion recovery sequence (24 cm FOV, 0.5×0.5×5 mm
3 voxel size, TE=23.4 ms, TR=1750 ms) were used.
To check whether a network trained with simulated stroke datasets can provide reasonable uniform OEF maps in healthy subjects without producing false negatives, MRI scans of four healthy subjects (age 31 ± 6 years) on a 3T GE scanner were also retrospectively obtained from the QQ-NET study [
51]. The MRI scans employed 3D mGRE with imaging parameters that matched those used for the stroke patients.
2.3. Data processing: OEF using multi-echo complex QQ (mcQQ)
In the mcQQ model, a nonlinear formulation is used when integrating the QSM-based and qBOLD-based OEF mapping methods to obtain OEF
where
and
(=0.98) [
40] denotes venous and arterial oxygenation. The mGRE complex signal at the j’th echo with the compensation of the initial phase and background field contribution on phase [
55,
56], (
), can be modeled as Equation 1.
Here,
Larmor frequency,
the dipole kernel, * the convolution operator. In the phase term, the QSM distinguishes the venous blood deoxyhemoglobin’s susceptibility contribution (OEF effect) from the non-blood neural tissue susceptibility (
) on a voxel-wise basis.
where
the fully oxygenated blood susceptibility (-108.3 ppb with tissue hematocrit Hct =0.357) [
30],
the ratio between the venous blood volume (
) and total blood volume (0.77) [
60],
the hemoglobin volume fraction (0.0909 with Hct=0.357) [28,61-63],
the susceptibility difference between deoxy and oxyhemoglobin (12522 ppb) [
29,
64]. The qBOLD models the OEF effect on the mGRE magnitude [
40]:
where
is signal intensity a
;
is the transverse relaxation rate;
[
21];
is the signal decay due to the blood vessel network [
19];
is the characteristic frequency by the susceptibility difference between deoxygenated blood and the surrounding tissue [
40]:
with
=267.51 rad
s
-1T
-1 the gyromagnetic ratio;
the main magnetic field; and
is the macroscopic field effect on mGRE signal [
40].
2.4. Deep neural network for mcQQ (mcQQ-NET)
In order to account for realistic measurement noise (i.e., Gaussian noise in complex mGRE signals), mcQQ-NET introduces two modifications compared to the current QQ deep learning model (QQ-NET) [
51]: changes in network structure and model loss.
Regarding the network structure, mcQQ-NET employs a combination of two Unet-based [
65,
66] sub-networks (
Figure 1). Each sub-network processes either the mGRE magnitude or phase input, whereas QQ-NET uses a single Unet to handle mGRE magnitude and QSM. In detail, for each sub-network of mcQQ-NET, the original U-net was modified to (1) use zero-padding to maintain a uniform convolution layer output size, and (2) set the number of inputs to 8 for each sub-network (comprising 8 echo magnitude and phase signals, respectively) and outputs to 5 for the magnitude sub-network (
) and 3 for the phase sub-network (
). The setting for the numbers of outputs and inputs are based on how magnitude and phase signals can be modeled as functions of these parameters, as described by Equations 2 and 3. Furthermore, two additional layers were integrated: one to combine common outputs (
) and another to apply the tanh function. The use of the tanh function limits model parameters (e.g., min and max) based on physiological expectation for
Y (0~100%) and
v (0.5~5.5%) and CCTV results for the other parameters, mirroring the approach in QQ-NET.
Regarding the model loss, mcQQ-NET has a weighed sum of three losses: (1) L1 difference between the normalized truth and the output of mcQQ (
), (2) L1 difference of
spatial gradient to preserve edge (
), and (3) the model loss to consider physical model consistency
).
and
are identical to those in QQ-NET [
51]. Notably,
is different between mcQQ-NET and QQ-NET: in mcQQ-NET,
, whereas in QQ-NET,
. The total loss (
) is set as
with the weights being empirically determined as
and
.
To ensure a fair comparison between mcQQ-NET and QQ-NET, mcQQ-NET used the same training and testing scheme as QQ-NET [
51] with one exception: noise consideration in data generation. While QQ-NET introduced Gaussian noise into the mGRE magnitude signals and QSM, mcQQ-NET incorporated Gaussian noise into the mGRE complex signals, offering a more realistic approach. In detail, mirroring QQ-NET, mcQQ-NET generated the training data using simulated stroke brains. First, the model parameters (
) were estimated from real 34 stroke patient cases using QQ-CCTV [
37] and used as ground truth. The average (
), standard deviation (
), min, and max were
(1.10, 0.04, 1.04, 2.12), R
2 (19.6, 7.1, 7.3, 161.1 Hz),
(0.67, 0.10, 0.31, 0.98),
(2.3, 1.2, 0.3, 7.2 %), and
(-11.6, 37.5, -957.2, 159.7 ppb).
was set to satisfy that the first echo magnitude signal was unity. Second, from the model parameter maps (ground truth), the mGRE complex signals were simulated for each brain voxel using Equations 1, 2, and 3. Third, Gaussian noise was added to these complex signals to obtain SNR 100 at the first echo, with distinct noise instances for each training. This procedure produced pairs of ground truth (QQ-CCTV results) and simulated measurements (mGRE complex signals) for training. Out of 34 simulated datasets, 26/2/6 was used for training/validation/test, respectively.
mcQQ-NET was implemented using Pytorch 1.13.0 [
67] and NVIDIA RTX A6000 GPU. Minimization was performed using ADAM [
68] with a learning rate of 10
-4. Training was stopped at 400 epochs when the validation loss became stable. Due to GPU memory constraints, batch size was set to 1 with a 4D patch (16×200×200×48) as input, which approximately covers a whole brain. The patch center was randomly positioned within a selected brain, a process repeated for all training brains (1 epoch). Validation was carried out in a manner identical to the training process. For each epoch, the sequence of the training brains was randomly rearranged.
The trained mcQQ-NET was tested with three separate datasets, similar to the original QQ-NET [
51]: Test Data 1) an additionally simulated stroke brain created using the same process as the training datasets (SNR 100) (
Figure 2). To reduce algorithm-dependent bias, the ground truth was set as the average of the QQ-NET and mcQQ-NET results from a real stroke patient (7 days post onset). This reconstruction was repeated five times with distinct instances of Gaussian noise to measure accuracy and precision. Test Data 2) 30 ischemic stroke patients, a subset of the 34 patients devoid of hemorrhage and reperfusion, were divided into three groups based on the time interval between stroke onset and MRI scan [
69]: acute (6-24 hours, N=4), subacute (1-5 days, N=13), and chronic (≥ 5 days, N=13) phase (
Figure 3 and
Figure 4). A five-fold cross-validation was performed to prevent overlap between training and test data [
51]: six real patient brains were selected as test data, while the simulated datasets of the remaining 28 patients were utilized for training (N=26) and validation (N=2) data. This yielded 5 trained networks. The first trained network was used for Test Data 1 and 3. Test Data 3) four healthy subjects scanned with identical imaging parameters (e.g., TE) to those used in training (
Figure 5). The objective was to evaluate whether a network trained on simulated stroke brains could generate uniform OEF maps in healthy brains without introducing noticeable false positives, such as low OEF values typically observed in stroke lesions. During network testing, to ensure full brain coverage, patch sliding with 30% overlap was employed. Multiple overlapped patches were produced and subsequently combined to construct a single whole brain.
For real test data (Test Data 2 and 3), we compensated for the macroscopic field contribution in both magnitude and phase signal inputs. For the magnitude signal input at j’th echo (
), voxel spread function method [
70] was used to estimate
:
where
the mGRE complex. For phase signal input (
), the following steps were taken. First, the total (
) and background (
) fields were spatially unwrapped (
and
) using a region-growing algorithm [
71]. Second, unwrapped phase (
) was calculated using
:
where
is the initial phase at
. Lastly, the phase by tissue field (
) was obtained by compensating the contributions of background field (
) and the initial phase (
):
.
We compared mcQQ-NET with QQ-NET [
51], both of which were tested on the same test datasets. While they used an identical training and testing scheme and data, they differed in in two aspects as mentioned earlier: network structure and model loss.