3.1. Historical Overview
As mentioned in
Section 2, one paper per year between the years 1990 and 2024 was selected, summing to a total of 35 documents. For five years the search retrieved no results (see
Figure 1) resulting in 30 documents. Unfortunately, the full text of two works was not available. The remaining 28 documents are briefly described below.
The oldest work found to tackle liver segmentation is the one of Bae
et al. (1993) [
19] present a similar sequential image-by-image segmentation technique using a reference image, where the liver occupies a significant portion of the abdomen cross-section. Image processing techniques, including grey-level thresholding, Gaussian smoothing, and connectivity tracking, are employed to extract the liver boundaries. The resulting boundaries are then smoothed using mathematical morphology techniques and B-splines. The study focuses on a living-donor liver transplant program, and the computer-determined boundaries are compared with those drawn by a radiologist, showing agreement within 10% of the calculated areas.
Gao
et al. (1996) [
20] focus on facilitating 3D visualisation for surgical planning. The method employs global histogram analysis, morphologic operations, and a parametrically deformable contour model to delineate the liver boundary. Ten cases were used to validate the approach and promising results were found with minimal operator intervention required.
Soler
et al. (1997) [
21] propose an automatic method for segmenting the portal vein, with the primary objective of achieving accurate segmentation with detailed branching and topological information, facilitating the localisation of liver tumours concerning Couinaud’s anatomical segmentation. The approach involves the initial detection of liver contours using 3D deformable models, followed by limiting the CT images to a liver mask containing hepatic tissue, vascular trees, and potential tumours. Classification of anatomical structures is performed using Gaussian curves fitted to the intensity histogram. The vascular trees and tumours are segmented through a hysteresis thresholding technique based on a distance map, considering the Gaussian parameters. An isotropic image is obtained through shape-based interpolation, and the portal vein is reconstructed using skeletonization, eliminating short branches and correcting errors. Results demonstrate that the algorithm automatically extracts the first three main bifurcations of the portal vein, comparable to manual segmentations.
Yoo
et al. (2000) [
22] focus on the use of pixel ratios. By analysing the grey value range of a normal liver in CT images, a binary image is generated and then processed into four mesh images based on hole ratios to eliminate noise. A template representing the general outline of the liver is generated from the union image of these mesh images and subtracted from the binary image to accurately represent the organ boundary. The pixel ratio, which takes into account the distribution of organ pixels, was used to discriminate between the organ and noise, especially in cases where organs have similar grey value ranges. The proposed method reduced processing time compared to existing methods and was validated against manual segmentation by medical experts.
Pan and Dawant (2001) [
23] introduce a level-set approach, which addresses the challenge of defining appropriate speed functions for contour propagation. A speed function is proposed to stop the propagation of the contour at organ boundaries with weak edges by incorporating accumulative speed based on the path of the contour, enhancing the robustness of segmentation in noisy images. The method also leverages a-priori anatomical information to improve accuracy. Tested on five CT datasets, including cases with abnormal livers, the method demonstrates good agreement with manual delineations.
Saitoh
et al. (2002) [
24] present an automated method for segmenting the liver region from the third phase of abdominal CT scans. The approach involves the extraction of blood vessels using a threshold, followed by morphological dilation to define an approximate liver region useful for removal of adjacent organs. The final liver region is then extracted using a threshold. The method is thus based on mathematical morphology and thresholding techniques, using the unique characteristics of blood vessels to functionally identify the liver region. The experiments performed on eight CT datasets show a good agreement between the automatically and manually detected liver regions.
Masumoto
et al. (2003) [
25] use multislice CT images. The method uses two time-varying images acquired during the contrast medium circulation phase, highlighting the liver region through CT value changes. The proposed scheme involves generating a liver likelihood image by analysing CT value changes and subsequently extracting the liver region while considering the geometric characteristics of blood vessels and tumours. The evaluation, based on Receiver Operating Characteristic (ROC) analyses, demonstrates the superiority of the proposed method over other approaches, especially when using information from both phases.
The scheme proposed by Lim
et al. (2004) [
26] uses a ROI approach to optimise computational efficiency. Morphological filters, incorporating a priori knowledge of liver location and intensity, detect the initial boundary. The algorithm then generates a gradient image using the weighted initial boundary and employs an immersion-based watershed algorithm for segmentation. Post-processing includes region merging based on statistical information to refine the segmentation.
Liu
et al. (2005) [
27] present a Gradient Vector Flow (GVF) snake-based method for the semiautomatic segmentation of liver volumes in contrast-enhanced CT images. The algorithm follows a stepwise approach, starting with the computation of an initial edge map using the Canny edge detector and the estimation of a liver template. The edge map is then modified to suppress edges within the liver using the liver template, and a concavity removal algorithm is applied to refine the liver boundary. The GVF field is computed based on the modified edge map, and the initial liver contour is determined by considering the candidate initial contour and the computed GVF field. The final liver contour is obtained by deforming the initial contour using the snake. The method is evaluated on 20 contrast-enhanced volumetric liver images, and the results are compared with a radiologist’s manual delineation. The median difference ratio between the computer-generated results and manual results is 5.3%, with a range of 2.9% to 7.6%.
A three-stage approach is used by Lim
et al. (2006) [
28]. The first stage involves image simplification as preprocessing, where a ROI is identified and thresholds are determined using multilevel thresholding. The second stage detects a search range using multiscale morphological filtering, region-labelling, and partition clustering. The third stage uses a contour-based segmentation approach with a labelling-based search algorithm to refine the initial liver boundary. The effectiveness of the algorithm is demonstrated through experimental results on contrast-enhanced abdominal CT images, with an average segmentation accuracy of 96%. Volume measurement is performed based on the segmented liver regions, with an average error rate of 3%.
Beichel
et al. (2007) [
29] introduce a two-step process. First, an initial segmentation is generated using graph cuts, overcoming challenges such as the high variability in liver shape and grey-value appearance. Second, an interactive refinement step is introduced, allowing users to correct segmentation errors in a 3D environment. The refinement is facilitated by a hybrid desktop/virtual reality (VR) user interface. This approach is demonstrated on ten contrast-enhanced liver CT scans, demonstrating robustness to variations in patient data. The results also indicate improved segmentation quality with low interaction times.
The authors Massoptier and Casciaro (2008) [
30] present a fully automated method that uses a statistical model-based approach to distinguish liver tissue from other abdominal organs. An active contour technique using gradient vector flow is used for smoother segmentation of the liver surface segmentation. Automatic classification is performed to isolate hepatic lesions from liver parenchyma. The method is evaluated on 21 datasets and demonstrates robust and efficient liver and lesion segmentations close to the ground truth, with an average processing time of 11.4 seconds per 512x512-pixel slice. Volume overlap for liver surface segmentation is 94.2%, and accuracy is 3.7 mm. Tumour detection achieved a sensitivity and specificity of 82.6% and 87.5%, respectively.
Heimann
et al. (2009) [
31] focus on the comparison and evaluation of different methods. The image data, acquired from different CT scanners, consisted of contrast-dye-enhanced scans showing pathological conditions like tumours and cysts. Radiology experts manually delineated the liver contours in transversal slices to create reference segmentations. A total of 40 images were divided into training and test sets for algorithm evaluation. Evaluation measures included volumetric overlap, relative volume difference, and surface distances. Fully automated and interactive segmentation methods were employed, with the former showing discernible performance differences. The best performing automated approaches used statistical shape models. Interactive methods achieved higher scores with more user interaction. A combined approach using majority voting from the best performing methods outperformed individual automated and interactive results.
A three-step procedure is outlined by Akram
et al. (2010) [
32]. Firstly, a pre-processing step involves converting the image to greyscale and applying a 3x3 median filter to reduce noise. The second step focuses on liver segmentation, with a global threshold and morphological operations to obtain the final segmented liver region. Finally, post-processing steps include adaptive histogram equalisation, Gaussian smoothing, and grey-level transformations to enhance the segmented liver region. Experimental tests on 100 CT images demonstrate the accuracy of the proposed method by comparing automated segmentation results with manually segmented images by hepatologists and oncologists.
The approach of Oliveira
et al. (2011) [
33] involves a sequence of four steps. First, the liver is segmented using level sets with parameters optimised by a genetic algorithm (GA). A Gaussian fit is employed to define the speed image for level set propagation. Secondly, vessels and nodules are segmented using a Gaussian mixture model, focusing on adipose nodules. A region-growing method with information from the Gaussian model is applied. Thirdly, vessels are classified into portal vein or hepatic vein using a vein tracking method. Finally, a geometric approach based on the identified veins is used to segment the liver into different Couinaud regions. The liver segmentation is based on the assumption that the liver parenchyma homogeneity and veins being mainly inside the liver. The parameters are estimated using a GA, and fitness evaluation involves comparing the segmentation with a reference using five disparity metrics. The proposed method shows good performance, ranking among the top methods in the MICCAI-SLiver07 conference evaluation.
The method developed by Linguraru
et al. (2012) [
34] uses a robust parameterisation of 3-D surfaces for point-to-point correspondence overcoming challenges such as inconsistent contrast enhancement and imaging artefacts. A shape descriptor that is invariant, invariant under rotation and scale is used to compare local shape features of organs. An initial liver segmentation is refined using a shape-driven geodesic active contour, and hepatic tumours are detected and segmented using graph cuts and support vector machines (SVMs). The technique is evaluated on a dataset of 101 CT scans and shows improvements in liver segmentation accuracy, particularly in cases with large tumours and segmentation errors. Furthermore, the method identifies liver tumours with a low rate of false positives.
Li
et al. (2013) [
35] discuss a method that makes use of fuzzy clustering and level set techniques. The fuzzy c-Mean (FCM) clustering algorithm is employed, which assigns pixels to different categories based on fuzzy memberships, considering both grey level intensity and spatial information. The FCM algorithm is iteratively optimised by minimising a cost function, allowing for the fuzziness of the resulting partition. To overcome the limitations of standard FCM, a spatial FCM algorithm is introduced that incorporates spatial information into fuzzy membership functions. The paper also introduces the level set method, a continuous deformable model for segmentation. A distance regularised level set evolution (DRLSE) is proposed to address reinitialisation issues and improve efficiency. The proposed method is evaluated using accuracy, sensitivity, and specificity metrics and demonstrates high performance in liver segmentation, especially in cases with unclear boundaries.
Platero
et al. (2014) [
36] integrate a multi-atlas segmentation approach with graph cuts. The method includes several steps: (1) obtain an initial solution using low-level operations to define the ROI around the liver; (2) construct a fast probabilistic atlas for the ROI and compute a coarse binary segmentation using segmentation-affine registration; (3) rank the atlases based on segmentation similarity and propagating selected atlases to the target image; (4) improve segmentation accuracy through label fusion, minimising a discrete energy function; (5) evaluate the approach using a public liver segmentation database. The experimental results show high accuracy, competitive with human expert segmentation.
Artificial Bee Colony (ABC) optimisation is used by Mostafa
et al. (2015) [
37]. Their algorithm use ABC to cluster different intensity values in abdominal CT images, followed by mathematical morphological operations to manipulate and separate the clusters. The process eliminates small and thin regions, such as flesh regions or organ edges. The extracted regions form an initial estimate of the liver area, which is further enhanced using a region-growing technique. The proposed approach demonstrates a segmentation accuracy of 93.73% on a test dataset of 38 CT images, taken in the pre-contrast phase.
A 3D Deeply Supervised Network (DSN) is introduced by Dou
et al. (2016) [
38]. The proposed architecture consists of 11 layers, including 6 convolutional layers, 2 max-pooling layers, 2 deconvolution layers, and 1 softmax layer. The network is designed in a 3D format to effectively capture spatial information. The 3D DSN employs deep supervision via additional deconvolutional layers to counteract vanishing gradients, thus improving the training process. The learning objective is to minimise per-voxel-wise binary classification errors, with deep supervision injected at specific layers. The MICCAI-SLiver07 dataset is used for evaluation, demonstrating that the 3D DSN has a faster convergence and lower errors when compared to traditional 3D Convolutional Neuronal Networks (CNNs).
Christ
et al. (2017) [
14] propose a Cascaded Fully CNN (CFCN) on CT slices that sequentially segments the liver and lesions. First, various preprocessing steps, including Hounsfield unit windowing and contrast enhancement are applied. Then, the cascaded approach involving two U-Net architectures is used for liver and lesion segmentation. Finally, 3D conditional random fields (CRF) are used to refine the segmentation results. Generalisation and scalability to different modalities and real-life datasets, including a diffusion-weighted Magnetic Resonance Imaging (MRI) dataset and a large multi-centre CT dataset, are shown.
Hiraman (2018) [
39] presents a slice alignment method that addresses the challenges through optimal threshold selection, skeletonization, and Enhanced Correlation Coefficient (ECC) alignment. Next, a CNN-based liver region of interest detection method is proposed to classify 2D slices for focused processing.
The study presented by Wang
et al. (2019) [
40] investigates the application of a Generalised CNN for automated liver segmentation and biometry using cross-sectional data from abdominal CT and MRI scans. The retrospective study included a sample of 563 abdominal scans from 530 adults, covering different imaging modalities. The CNN was initially trained on 300 unenhanced multiecho 2D SPGR MRI sets and then subjected to transfer learning for generalisation across different imaging methods. The accuracy of the CNN was evaluated using internal and external validation datasets. The study also investigates the impact of training data size on segmentation accuracy and explores the feasibility of using automated liver segmentation for volumetry and hepatic PDFF quantification.
Almotairi
et al. (2020) [
41] explore the application of the SegNet architecture. The proposed modified SegNet model uses the VGG-16 network as an encoder. Tests were performed on a standard dataset for liver CT scans (3D-IRCADb01 [
42]) and achieved tumour accuracy of up to 99.9% in the training phase and 86% for tumour identification.
Ayalew
et al. (2021) [
43] present a modified U-Net architecture and introduce a new class balancing method. To address the class imbalance between liver and tumour, a weighting factor is applied and slices without tumour are removed during data preparation. The U-Net based network architecture includes batch normalisation, dropout layers, and filter size reduction. Training involves tuning hyperparameters, such as learning rate and batch size. The datasets used are derived from the 3D-IRCADb01 [
42] and LiTS [
44] databases and the results achieve Dice Similarity Coefficient (DSC) of 0.96 and 0.74, respectively. The algorithm also introduces a novel approach for direct tumour segmentation from abdominal CT scan images, with comparable performance to existing two-step methods.
The study of Scicluna (2022) [
45] is motivated by challenges such as the Combined Healthy Abdominal Organ Segmentation (CHAOS) Challenge [
46], which focuses on healthy abdominal organs. The study focuses on replicating the v16pUNet1.1C model, which demonstrated superior performance in Task 2 of the CHAOS Challenge. Results from the v16pUNet1.1C model are presented and compared with variations in the loss function and scaling transformation. The application of a 3D largest-connected-component filter is discussed, showing improvements in mean scores.
A deep semantic segmentation CNN is used by Ezzat
et al. (2023) [
47]. A three-stage architecture is proposed, including pre-processing with data augmentation, deep CNN training, and testing. The CNN-based semantic segmentation model is shown to be robust, achieving a test accuracy of 98.8%. The approach does not require user input, making it accessible to non-experts.
Shao
et al. (2024) [
48] present the Attention Connect Network (AC-Net) for liver tumour segmentation in CT and MRI images. The AC-Net consists of two main modules: the Axial Attention Module (AAM) and the Vision Transformer Module (VTM). The AAM uses an axial attention mechanism to merge features of matching dimensions, maximising the use of spatial features extracted by a CNN. The VTM processes high-level semantic features extracted by the CNN using a methodology similar to Vision Transformers (ViT) [
49]. The network achieves a DSC of 0.90, a Jaccard Coefficient (JC) of 0.82, a recall of 0.92, a precision of 0.89, a Hausdorff Distance (HD) of 11.96, and an Average Symmetric Surface Distance (ASD) of 4.59.
3.2. Other Review Papers
The search described in
Section 2 retrieved six literature review documents. For one of the works, however, the full document was not available. The remaining five are briefly presented next.
A comparative analysis of various available techniques, focusing on their advantages and disadvantages, is given in [
50]. Recognising the challenges posed by the variable shape of the liver and the weak edges in adjacent organ regions, the survey covers approaches such as Threshold, Model, Level Set, Region, Active Contour, and Clustering. The paper also divides its investigation into sections, covering both image pre-processing and segmentation techniques, providing an overview of the current landscape in liver segmentation from CT images.
The study [
51] provides a survey of 3D image segmentation methods, focusing on selected binarization and segmentation techniques suitable for processing volume images. For thresholding methods, both global and local techniques are considered, and challenges such as hysteresis in dealing with voxel value distributions are addressed. The region growing section explores voxel-based procedures, including growing by grey value and adaptive region growing. In addition, deformable surfaces and level set methods are discussed, before other segmentation concepts such as fuzzy connectedness and watershed algorithms are introduced. The concluding remarks underline the complexity of image segmentation, emphasising the absence of a universal solution and the need to carefully evaluate and select methods based on specific tasks and dataset characteristics. The challenges posed by 3D data, including the data volume and issues of interactivity and visualisation, are also acknowledged.
The study [
52] reviews and proposes a literature survey on methods for segmenting liver images, distinguishing between semi-automatic and fully automated techniques. The challenges of liver image segmentation, such as low contrast, blurred edges, and the complexity of liver morphology, are discussed. Different approaches are reviewed, including neuronal network based methods, support vector machine based methods, clustering based methods and hybrid methods. It is concluded that, despite progress, liver image segmentation remains a challenging task, and the paper encourages further development of hybrid approaches for more accurate segmentation.
Various segmentation methods, including statistical shape models, probabilistic atlas-based approaches, geometric deformable models, and machine learning-based methods, are reviewed in [
53]. The review includes information on avaliable databases and challenges in liver tumour segmentation, highlighting the scarcity of public datasets and the need for improved segmentation methods. Liver blood vessel segmentation and computer-assisted diagnosis (CAD) systems are also reviewed. The conclusion highlights the importance of the segmentation, particularly in pathological cases, and the need for improved CAD systems with accurate segmentation for comprehensive analysis of liver treatment.
The survey paper [
54] provides a comparative analysis of various available techniques, focusing on their advantages and disadvantages. Grey level-based techniques, such as region growing and active contour methods, are highlighted as effective for liver segmentation. The survey acknowledges the challenges of detecting early-phase liver lesions and emphasises the need for a combination of methods to achieve seamless segmentation, with region growing and active contour methods considered more efficient than other segmentation techniques.
This survey differs from the other documents in this section in a number of ways. Firstly, the most recent of the review papers found dates from 2022. One of the contributions of this work is to present a more up-to-date view of the works published since then. In addition, none of the other works presents a historical perspective on the subject, starting from 1990, as is the case with the present review.