Boundary segmentation of vessel wall from Doppler OCT intensity image and the segmentation of the inner lumen contour from Doppler OCT phase image are essential for the 3D morphology reconstruction of the vessel. Due to the strong scattering of red blood cells and other medium in vessel, the images suffer from low contrast and blurred boundary especially at the greater depth of the vessel. And the OCT images contain severely speckle noise and random noise, which increases the complexity of the boundary segmentation. Therefore, the algorithms for accurate boundary segmentation are demanded, which should be robust to the OCT noise and very good at the weak boundary segmentation.
There exist many boundary segmentation approaches for OCT vessel images. Earlier studies usually focus on pixel intensity changes, gradient information, Chan-Vese model, graph-based method. The pixel intensity changes along the A-Scan contour were used to achieve image segmentation [
5,
6,
7,
8], while most segmentation methods using intensity information are based on intensity changes of backscattered signals [
9,
10,
11,
12,
13,
14]. Gasca et al. [
15] extracted the boundary in intensity images, but it was generally difficult to extract the weak boundary accurately. Sihan et al. [
16] used Canny filter for edge detection, and then used link step for segmentation. However, due to the fixed threshold, this method has poor segmentation effect on noisy images. Yang et al. [
17] proposed a segmentation algorithm based on dual-scale gradient information, which simultaneously uses the global gradient information and local gradient information to complement each other. And it adopts the shortest path search algorithm to optimize the edge in combination with the local Canny edge detection method, which has high accuracy and repeatability for the segmentation of 3D OCT volume data. However, this method is not robust to noise. Guimaraes et al. [
18] used intensity images to achieve retinal segmentation, and divided different thresholds according to different tissues to achieve retinal layer segmentation. The computational efficiency of this method is very high. But due to the influence of the intensity variation in the layer, the intensity inconsistency causes artifacts in the blood vessels during the imaging process. Most of the segmentation methods based on image intensity information are used for eye image segmentation, which the extraction ability of weak boundary is insufficient. Chan-Vese (C-V) model is a classical region-based geometric active contour model, which can segment images without obvious edges well. The method based on C-V model first needs to set up an initial contour curve, and use the gradient information to get the optimal result. In 2006, Grady applied the random walk method to the field of image segmentation [
19], introducing regularization into the segmentation process. This method can extract the weak boundary of images, but in the weak boundary extraction process of OCT images, the staircase effect will appear, resulting in incorrect segmentation. Roy et al. [
20] used the random walk algorithm and signal attenuation model of OCT imaging process to track the maximum optical backscatter of each A-scan. And then used global gray scale statistics to optimize and achieve image segmentation. Huang et al. [
21] used the Laplacian operator to perform iterative calculation by changing the calculation operator of the random walk, which showed good robustness to the weak boundary of OCT images. The graph-based approach is used to achieve segmentation of retinal in time domain OCT images [
22,
23,
24] and OCT coronary vessel images [
25]. Garvin et al. [
26,
27] also extended this method to spectral OCT images, adding flexible and diverse constraints and greatly improving segmentation accuracy. Nabila et al. [
28] segmented retinal vessels of OCTA with generalized Gauss-Markov-Gibbs random field model and Markov-Gibbs random field model. Ruchir et al. [
29] used the method of 3D map cutting to segment the skin surface layer. In 2022, Mittal et al. proposed to use random walk and interframe flattening algorithm to process OCT images. They used N-ret layer segmentation method to simplify OCT image segmentation and improve the accuracy of OCT image segmentation [
30].Traditional vascular segmentation methods often rely on shallow image features, such as image gray scale and texture features, which have low representativeness and are easy to be disturbed by noise. Moreover, such features rely on designers’ experience and knowledge accumulation to design extraction algorithms. Practice has proved that the deep features are more representative. Machine learning techniques can extract deep features from massive data. In different application scenarios, more representative features can be extracted from the training data and applied to the blood vessel segmentation task, which can not only improve the accuracy of blood vessel segmentation, but also improve the segmentation efficiency. . The sparse representation and discriminant dictionary learning methods were used to vascular segmentation. According to the feature vector of pixels, each pixel is classified into blood vessels and non-blood vessels, so as to achieve the segmentation and extraction of retinal blood vessels [
31,
32,
33]. In recent years, the powerful segmentation ability of deep learning has gradually been widely applied to the segmentation field of medical images. Roy et al. [
34] proposed RelayNet for end-to-end segmentation of retinal layers and fluid masses in ocular OCT images. Fan et al. [
35] used a multi-channel FCN to automatically segment coronary arteries from angiographic images. Hamwood et al. [
36] studied the impact of batch size and network structure on retinal segmentation results. Girish et al. [
37] applied the FCN model to the segmentation of the inner retinal capsule membrane. This plays an important role in ophthalmic diagnosis and quantification of retinal abnormalities. In recent years, many scholars have combined traditional methods with deep learning methods to achieve image segmentation. Fang et al. [
38] combined deep learning with Graph Search to propose a Convolutional Neural Networks-Graph search (CNN-GS) architecture for automated segmentation of 9-layer boundaries of retinal OCT images. Zhang et al. [
39] combined convolutional neural networks with random walk algorithms to segment plaques in OCT coronary artery images. The CFANet method proposed by MaFei et al. has achieved high accuracy in the segmentation of OCTA optic disc and macular region [
40]. Yazan et al. [
41] used SegNet in combination with conditional random fields to segment the lumen and calcified area of OCT coronary arteries, helping to determine the placement of stents during coronary interventional therapy. Ronneberger et al. [
42] proposed full convolutional neural network U-net for the segmentation of neuron structure under electron microscope and the segmentation and extraction of cell contour under light microscope. More recent studies commonly use its variants for layer-wise retina segmentation [
43,
44,
45]. The cascaded U-net architecture was proposed to segment the vessel intensity image and its corresponding phase image for the vessel wall and the blood flow area boundary, respectively [46]. It contains two U-nets, which means that the second segmentation results rely on the first segmentation results. The limitation of this method is that two models are trained and optimized separately, which takes more time to obtain the optimized models. Besides, when integrating the models into the system for online application, it is more complicated to call two models than one.
Due to the strong scattering of blood in the blood vessels, with the increase of imaging depth, the sensitivity of the OCT system decreases. And the signal-to-noise ratio of the images decreases, resulting in the blurring of the lower edge of the blood vessels. At the same time, OCT images contain noise, and noise levels in phase images are generally higher than in intensity images, increasing the difficulty and complexity of accurately segmenting the contour of blood vessel edges. The accurate extraction of the contour of blood vessel wall and blood flow region is particularly crucial for the reconstruction of the three-dimensional structure of blood vessel and the three-dimensional morphology of thrombus.
In this work, we proposed a multi-classification deep learning model to extract the vessel boundary from Doppler OCT intensity images and the lumen area contour from corresponding Doppler OCT phase images simultaneously and automatically. There are two reasons for using the U-net structure in this study. Firstly, the principle of OCT is a fiber optic Michelson interferometer. The interference occurs when the optical path difference between the reference light returned by the mirror and the backscattered light of the measured sample is within the coherence length of the light source. The detector output signal reflects the backscattering intensity of the medium. While the shallow and deep layers of U-Net acquire high-frequency and low-frequency information from the image, respectively, and the information of each level is well preserved through skip connections. Secondly, the scanning mirror can record its spatial position so that the reference light interferes with backscattered light from different depths within the medium. According to the position of the mirror and the corresponding interference signal intensity, the measurement data of different depths (z direction) of the sample can be obtained. U-Net can reconstruct the spatial structure with self-similarity and use the handcrafted image and the structure of the known part to interpolate the unknown region [48].
The pair of intensity image and phase image are preprocessed into one 512×512 grayscale image as the input of the model. The pixel categories of the input image are divided into three categories: background, vessel region and inner lumen area. The average Dice Coefficient (DC) adopted for the quantitative analysis of the segmentation results reaches 96.7%. Based on 250 in-vivo mouse femoral artery images segmentation results, 3D thrombosis morphology and lumen area analysis were adopted for quantitative outcome evaluation.