Preprint
Article

Automated Cervical Cancer Screening Using Single-Cell Segmentation and Deep Learning: Enhanced Performance with Liquid-Based Cytology

Submitted:

05 October 2024

Posted:

07 October 2024

You are already at the latest version

A peer-reviewed article of this preprint also exists.

Abstract
Cervical cancer (CC) remains a significant health issue, especially in low- and middle-income countries (LMICs). While Pap smears are the standard screening method, they have limitations like low sensitivity and subjective interpretation. Liquid-based cytology (LBC) offers improvements but still relies on manual analysis. This study explored the potential of deep learning (DL) for automated cervical cell classification using both Pap smears and LBC samples. A novel image segmentation algorithm was employed to extract single-cell patches for training a ResNet-50 model.The model trained on LBC images achieved remarkably high sensitivity (0.998), specificity (0.997), and accuracy (0.997), outperforming previous CNN models. However, the Pap smear dataset model achieved significantly lower performance (0.694 sensitivity, 0.838 specificity, 0.834 accuracy). This suggests that noisy and poor cell definition in Pap smears pose challenges for automated classification, whereas LBC provides better classifiable cells patches. These findings demonstrate the potential of AI-powered cervical cell classification for improving CC screening, particularly with LBC. The high accuracy and efficiency of DL models combined with effective segmentation can contribute to earlier detection and more timely intervention. Future research should focus on implementing explainable AI models to increase clinician trust and facilitate the adoption of AI-assisted CC screening in LMICs.
Keywords: 
Subject: 
Computer Science and Mathematics  -   Artificial Intelligence and Machine Learning

1. Introduction

Cervical cancer (CC) markedly affects the mortality rate of females worldwide [1]. This type of cancer is associated with a greater number of cancer-related deaths per year than breast cancer [2]. Various scientific studies have shown the existence of an inequality in the incidence of CC between different parts of the world [3,4] as shown by the fact that this it is the most common type of cancer and the leading cause of mortality from cancer in Latin America [5,7,8]. The incidence rates of CC have been steadily increasing over the past years [8].
CC is 99% linked to the human papillomavirus (HPV) [9,11,12]. CC is an almost asymptomatic disease in its first two stages, which are the only two stages in which it is possible to treat the disease effectively [6,7]. For this reason, the early detection and management of early lesions are essential [12]. However, over the past years, the mortality rate associated with CC has remained high in all Latin American countries, reflecting the inefficiency of programs and screening techniques [3].
The early and accurate detection of CC remains a challenge in developing countries and CC continues to be a major health concern [6,13,14]. The costs of diagnosis, treatment and control are among the highest in medicine; thus, this disease is considered catastrophic both collectively and institutionally [14].
The diagnosis of CC depends on initial screening with Papanicolaou (Pap) smear cytology [3]. The Pap smear consists of collecting cervical cells and examining them under a microscope to identify abnormalities [4]. To date, It has been demonstrated that this type of screening has an efficacy of 70% in reducing the mortality rate of associated with CC and its sensitivity ranges from 50 to 75% [15,16]. The Pap smear has a low sensitivity for pre-neoplastic events, such as grade 1 cervical intraepithelial neoplasia (CINI), and for the same reason, numerous clinicians decide to perform a colposcopy with biopsy following a positive HPV test [3]. The Pap smear has several limitations that undermine its performance: High human capital and time requirements, since it requires cytotechnologists or trained pathologists to manually review numerous slides. This technique is subjective and inconsistent, as different observers may have different interpretations about the cellular material. It is very prone to human error, such as misclassification, false negatives, false positives, or omissions. It has a low sensitivity and specificity, since subtle or rare abnormalities are usually omitted or it can confuse some benign conditions with malignant ones [4].
In recent years, the improved detection of CC has been observed in developed countries owing to the implementation of liquid-based cytology (LBC). This is a novel method of the preservation and handling of cytological samples that can replace the traditional Pap smear, overcoming its limitations [16,17,18]. In LBC, the sample is transferred to the fixative fluid, which increases the cytological detection of squamous intraepithelial lesions and reduces the number of unsatisfactory smears [13]. In LBC, cells are filtered and transferred to the slide in a thin layer containing only one cell level, considered representative of the entire sample (monolayer). This facilitates the analysis of the sample compared to the conventional Pap test [19], improving the quality and interpretation of the slides, reducing the number of false negatives and inadequate samples, due to fixation defects and masking of cellularity by excess blood, mucus, inflammatory cells or other artifacts [13,20]. Thus, LBC outperforms traditional the Pap smear in terms of representativeness, sample fixation and smear quality.
In the quest to optimize the early detection of CC, several researchers have evaluated different methods of cervical cell segmentation and computational algorithms based on artificial intelligence (AI) to automatically analyze cytology images and thus improve CC detection [21,22,23]. AI techniques help automate the analysis of cytological images to classify them into healthy or abnormal categories, as well as to detect the severity and types of lesions [24]. AI techniques also provide quantitative and objective results that are consistent and reproducible, and are promising in terms of specificity, sensitivity and diagnostic accuracy [25]. The use of AI in the screening of cervical lesions has potential advantages, such as greater precision, less time-consuming, a decreased need for human capital and the absence of biases due to analytical subjectivity [26]. AI algorithms can process and analyze large sets of images, identifying patterns and inconsistencies that could be invisible to the human eye. In addition, AI-facilitated screening could decrease the need for invasive interventions, reducing patient discomfort and also improving outcomes [23].
In this context, AI is put forward as a method that can help improve CC screening, contributing to and improving medical diagnoses, which will undoubtedly lead to a significant impact on society [27,28]. The aim of the present study was to evaluate the performance of a deep learning ResNet 50 model for cervical cell classification from Pap smear and liquid cytology analysis to improve the early detection of CC. To this task, the segmentation capacity of a preprocessing algorithm to obtain single-cell images that allow the training of deep learning models of diagnostic classification was designed and studied, and the diagnostic performance of this system was compared with a model trained with liquid cytology images from an external repository.

2. Materials and Methods

2.1. Traditional Pap Smear and Liquid Cytology Image Acquisition

For the purposes of the present study, a public dataset of PAP smear images from Center for recognition and inspection of Cells from patients aged between 35-65 years, and with diagnoses classified according to the Bethesda system (https://database.cric.com.br/classification) was used for the segmentation procedure. . Liquid cytology images were obtained from the public online repository https://data.mendeley.com/datasets/zddtpgzv63/4 used by Sompawong *et al* [22].

2.2. Cell Segmentation Algorithm

Subsampled images were processed using ImageJ 1.53 software (https://imagej.net/ij/) to obtain single cell patches to train de classification models. The theoretical steps of the algorithm attached in annex 1 are detailed as follows: i) Linear intensity transformation was applied to improve contrast; ii) median filter with a kernel size of 9x9 was applied to eliminate cell noise and debris; iii) default sharpen filter to highlight cell edges; iv) RGB rendering and split to HSB (hue, saturation, brightness) color space; v) 3x copies of saturation channel to apply thresholding methods for binarization (Minimal, Shanbag and Otsu); vi) Watershed on binary results of segmentations then adding segments to ROI (region of interest) manager; vii) using the segments in the ROI manager, the cropping was made in the original RGB image and each of them was saved with the sample identifier and a sequential number; viii) finally, with the tumor cell ROIs generated in Qupath (https://qupath.github.io/), automatic labeling was carried out where 1 corresponded to cells after the tumor and 0 to the healthy cell.

2.3. Malignant Cell Classification AI-Model Based in ResNet 50 Architecture

ResNet50 architecture corresponds to a convolutional neural network (CNN) developed by Microsoft in 2015 and can support very deep neuronal structures with optimal training performance for image classification. ResNet50, named for its 50 layers, is structured in ‘blocks’ that cleverly utilize ‘skip connections’ or ‘residual connections’. Instead of learning only from the previous layer, these connections allow the network to learn from earlier layer outputs as well. This bypasses the vanishing gradient problem often found in very deep networks, enabling efficient training and better feature extraction for improved accuracy in image classification tasks [29,30]. It was programmed in Python 3.10 using TensorFlow an SciKit Learn libraries (links), using four main code blocks: i) Convolution layers to detect the local features of the image (such as more complex edges, textures and patterns); ii) pooling layers used to reduce dimensionality and, at the same time, preserve the most important characteristics of the image; iii) rectified linear unit non-linear activation functions, to introduce non-linearities into the network and allow the network to learn more complex representations; and iv) fully connected layers, responsible for performing the final classification. All input single-cell images were transformed to tensors of 48x48 in dimension and divided and randomized in 70:30 proportion for train:test using train-test-split of SciKit. In the model configuration phase, three hyper-parameters were used: i) Optimizer = ‘Adam’ (with a learning rate of 1x10-5); ii) loss function: Binary cross-entropy (loss= ’binary-crossentropy’); and iii) the AUC metric used for the evaluation of the model in training and validation. Finally, in the training phase of the model, the following hyper-parameters were used: Bach-size = 512 and number of epochs: 100. At least 20 trainings of each of the algorithms were performed, varying the random state of the training:test data to find the optimal possible performance.

2.4. Statistical Analysis

2.4.1. Descriptive

Through Python libraries for the management of dataframes, arrays and graphs, the main descriptors of the data were obtained depending on whether the variable was numerical or categorical. Means, medians, deviations and quartiles were obtained for continuous and discrete variables and the tables of frequencies of these were categorized. Frequencies of all variables were obtained, and distribution graphs were prepared to guide the following steps.

2.4.2. Inferential

Primarily using the Python libraries for statistics and graphs, data normality was determined using Shapiro-Wilk and Kolmogorov-Smirnov tests. Based on the results, either parametric or non-parametric statistical analyses were conducted. Non-parametric inference methods employed were the Chi-squared for one sample, and the Mann-Whitney or Wilcoxon tests for two samples, depending on their independence or pairing. Their parametric counterparts corresponded to ANOVA, t-test, z-test, and LSD multiple comparison tests..

2.4.3. Diagnostic Performance Metrics

Diagnostic performance was assessed using confusion matrices, generated with Python’s scikit-learn library, to quantify true positives, false positives, true negatives and false negatives. From these matrices, metrics including precision, accuracy, sensitivity, specificity, and positive and negative predictive values were calculated. Subsequently, sensitivity, specificity and receiver operating characteristic (ROC) analyses were performed. The latter involved generating comparative performance graphs, calculating area under the curve (AUC) values, and conducting DeLong tests for statistical comparisons between techniques.

3. Results

The segmentation in the sub-images generated from the original slide-scan revealed that the application of multiple binarization thresholds in the purified cell signal, achieves in the best of cases, the selection of poorly homogeneous groups in most sectors of the slide, and the detection of large cell masses in some areas, and in very few events, single-cell segmentation (Figure 1A). Due to the above, the single-cell patches generated in each of the sub-images are in most cases noisy with a low-definition of the cell edges, which do not allow the cell chromatin patterns to be clearly shown (Figure 1B). By comparing the results generated in the Pap images with the segmentations of the liquid cytology (LCyt), it can be qualitatively appreciated how the generated patches clearly have single, well-defined, noise-free cells with evident nuclear patterns (Figure 1C).
The single-cell patches generated in the Pap and LCyt datasets were used to simultaneously train a ResNet50 deep network-based classification model. Training with 9566 Pap patches, of which 3386 were labeled as malignant cell-positive, revealed medium performance, but a tendency to over-adjust when observing the change in the AUC and loss metrics through 100 epochs (Figure 2A, validation loss increase). Likewise, when looking at the confusion matrix of the binary classification, the significant presence of false negative cases can be observed (predicted = 0, true = 1) and false positives (predicted = 1, true = 0, Figure 2B).
From the confusion matrix, the final performance metrics detailed in Table 1 were obtained. In these metrics, the high level of accuracy and specificity of the model stands out (0.8341 and 0.8386), although with a low level of sensitivity (0.6939).
Using the same architecture and hyperparameters of CNN-ResNet50, classifier training was evaluated, this time with single-cell patches from liquid cytology segmentation (LCyt set). Initially, the training was carried out with 50 epochs (data not shown) and as the metrics were close to convergence, the number of iterations was extended to 100 to determine whether the behavior of the model remained stable. As shown in Figure 3A, the AUC and loss metrics throughout the ages have a more stable behavior than the Pap model and with a very low tendency to overfit, that is evidenced by the similarity between the training set and the validation set. In addition, the confusion matrix clearly demonstrated the low incidence of false negative and false positive cases in relation to true positives (Figure 3B).
From the confusion matrix, the final performance metrics detailed in Table 2 were obtained. In these metrics, the high level of accuracy and specificity of the model stands out (0.998 and 0.997), also with high level of precision and sensitivity (0.998 and 0.999). In addition, Table 2 presents the comparison of the two CNN models published with the same dataset of images by Sompawong *et al* [22] and Chen *et al* [31], where all the ResNet50-LCyt metrics are superior.
All figures and tables should be cited in the main text as Figure 1, Table 1, etc.

4. Discussion

The aim of the present study was to evaluate the performance of a deep learning (DL) ResNet 50 model for cervical cell classification using Pap and LBC samples to improve the early detection of CC. The DL model used herein was trained in a binary cell classification, which is essential for the early detection of CC; identifying patients with lesions that will evolve into CC can help determine an adequate treatment strategy and may thus prevent cancer development.
CC [1] remains one of the leading causes of cancer-related mortality in lower-income countries, despite being a highly preventable pathology. In Latin America and the Caribbean, CC is the third most common type of cancer, and HPV infection is present in >99% of cases with a worse prognosis [32]. For decades, the standard method for detecting cervical lesions has been the Pap test [33]. However, screening programs based on this technique in lower-income countries have rarely been successful in reducing the mortality and incidence rates of CC due to the lack of high coverage and the associated high analytical complexity [34]. Traditional cytology begins with the collection of the sample, the analytical phase is reported, followed by colposcopy when a biopsy is necessary, and finally, the treatment of the detected lesions [35]. Along with the lengthy and costly Pap smear process [26], in the majority of cases, conventional smears are difficult to interpret due to the uneven distribution and overlap of cells, and the presence of blood or inflammation [36]. In addition, the test has a high rate of false negatives; 50% of preneoplastic lesions of the cervix are missed with a single test [37]. The occurrence of false-negative reports depends on the morphological quality of the cells and on the abnormal cells being present in the sample in a recognizable form. This is found to be difficult in the Pap smear due to its multilayer pattern distribution. As a result, a number of women with CC have a history of one or more negative cervical cytology reports when they are actually carriers of high-grade lesions. [38]. In addition, the interobserver reproducibility of cervical cytology is very inaccurate, as previously demonstrated by Stoler and Schiffman [39], in a study analyzing the reproducibility of 4,948 monolayer cytologic interpretations. Of the 1,473 original interpretations of atypical squamous cells (ASC-US), the second reviewer only concurred in 43.0% [39].
The low sensitivity of the Pap smear requires repeating the test multiple times over the years for it to be effective. This is very costly and not affordable in the majority of Latin American countries. In this scenario, it is essential to develop new methods with which to detect CC earlier with low-cost and high-accuracy automated screening technologies [40]. The alternative to the Pap smear is LBC, which makes immediate fixation easier and leaves the cells better visualized. LBC allows for a monolayer spreading where the majority of the debris, blood and exudate is removed [38]. The benefits of these liquid-based methods include decreased obscuring materials and hemorrhage on the slide, a decrease in cellular misrepresentation, and an even cell distribution on the slide [35]. In general, conventional screening processes with brightfield microscopes use a high human resource, and in this context, the structure of the cells to be observed is complex, since the nucleus and cytoplasm are difficult to identify due to overlapping cellular areas and undefined boundaries between neighboring cells. However, this issue can be resolved with novel computer strategies that can classify images automatically and rapidly [38].
AI has been progressively applied in recent years for the diagnosis of various pathologies with successful results when the evidence is imaging [41]. AI models can automatically recognize key features of images and can learn how to classify and process data using efficient algorithms [41,42]. Based on the above, the application of AI in the screening and early diagnosis of CHD is very useful for overcoming the current challenges of the technique [26]. Machine learning in AI is based on several computational models, and one of these is the CNN, which is mainly used for image processing and computer vision tasks. Within this, there are also different architectures, such as Visual Geometry Group 16 (hereinafter VGG16) [43], Residual Network 50 (hereinafter ResNet50) [44], and Mobile Network (hereinafter MobileNet) [45]. Previous research has shown that the CNN ResNet50 is the most effective compared with VGG16 and MobileNet; thus, this is considered the optimal CNN architecture [46]. In addition, recent studies have demonstrated that DL models are robust against changes in the aspect ratio of cervical cells on cytological imaging [26,47].
As previously stated, the early detection of cervical cancer is crucial for improving treatment and patient survival [1,35]. AI models, specifically DL, has emerged as a promising tool for the early detection of this disease. The present study investigated the performance of the ResNet 50 DL model for classifying cervical cells in both conventional Pap smear and LBC images, with the aim of improving early cervical cancer detection [22,48,50]. The findings presented herein suggest that the ResNet 50 model exhibits exceptional accuracy in classifying cervical cells from LBC images, achieving a near-perfect sensitivity, specificity, precision, accuracy and F1-score. This result indicates that the model is highly capable of accurately distinguishing between benign and malignant cells when trained on LBC images. However, the study also highlights the inherent challenges of using traditional Pap images for CC detection using AI [16,50]. The precise segmentation of individual cells from Pap images is hampered by cells overlapping, excessive staining and the presence of blood clots and artifacts. These factors make it difficult to accurately identify cell boundaries, leading to errors in cell classification [21,51]. This is reflected in the performance of the ResNet 50 model with Pap images, which, while achieving good accuracy and specificity, had a significantly lower sensitivity in detecting malignant cells. Specifically, when trained with Pap images, the ResNet 50 model achieved a sensitivity of only 69.39%, while achieving a specificity of 83.86% and accuracy of 83.4. This suggests that the model was able to detect 1351 out of 1511 healthy cells, but only 943 out of 1316 malignant cells. By contrast, when trained with LBC images, the ResNet 50 model demonstrated a much higher sensitivity of 99.8%, along with a specificity and accuracy reaching 99.7%. This indicates that the model was able to detect 3,493 out of 3,503 healthy cells and 4,169 out of 4,174 malignant cells. It is important to consider that the performance of the ResNet 50 model was negatively affected by the imbalance in the Pap dataset, with a significantly larger number of benign cell images compared to malignant ones. This imbalance makes it difficult for the model to learn discriminative patterns for identifying malignant cells. Addressing this imbalance in future research is crucial for improving the detection of malignant cells using Pap images.
LBC offers a significant advantage over Pap images due to its ability to better isolate and distribute cells, resulting in cleaner images with less overlapping and fewer artifacts [16,17,20]. The segmentation artifacts in the traditional Pap smear are difficult to optimize, considering the complexity of the segmentation algorithm applied, which unlike a U-Net type algorithm for segmentation, has customized settings based on the specific nature of the sample [52], also considering the low observational quality of the traditional cytology sample, which is not designed for the analysis of single-layer cells, such as liquid cytology [38]. Despite the above, we believe that it could be possible to obtain a better performance in the segmentation procedure, through the direct use of a neural network of the U-Net or Attention-Unet type that then works directly with the classification model, an edge that would be interesting to address and contrast in future studies with our cohorts [21,22].
LBC facilitates the precise segmentation of individual cells and therefore, improves the accuracy of cell classification. Despite the higher cost associated with LBC tests compared to Pap tests, the present study highlights the importance of adopting LBC to improve the accuracy of cervical cell classification and ultimately improve the early detection of CC [53]. When comparing the results of the present study to those of previous research using both Pap and LBC images, it was found that the ResNet 50 model used herein achieved comparable or superior performance in terms of accuracy and specificity compared to previous studies using Pap images [22,31]. However, previous research using Pap images generally achieved a higher sensitivity, specificity and F1 score, suggesting that the performance of ResNet 50 with Pap images may be limited by the inherent challenges of cell segmentation aforementioned. On the other hand, when compared to studies using LBC images, the ResNet 50 model consistently outperformed the reported results across all performance metrics, including sensitivity, specificity, accuracy and F1 score [44,54,55]. This highlights the significant advantage of using LBC images for DL-based cervical cell classification, underscoring the potential of the ResNet 50 model as a powerful tool for improving early detection when combined with LBC.
While the present study highlights promising results with LBC images, certain limitations should be acknowledged. The imbalance of the Pap dataset, with a significantly larger number of benign cell images compared to malignant ones, negatively affected the performance of the model. To address this imbalance, future research is required to focus on techniques, such as data augmentation, using the focal loss function and resampling techniques to improve class balance. On the other hand, the model is only limited to its use in cytology for the investigation of CC; although this is a pathology of high relevance in health systems, it is also important to note that cytopathological studies are also frequently used in the investigation and diagnosis of other tumor pathologies, such as tumors of the renal and urinary tracts, head and neck tumors and airway tumors. The aforementioned is commonly known in pathology services as miscellaneous cytology and in this clinical scenario, of course, the use of segmentation and classification methods by AI has begun to be explored with successful results [56,57]. Despite these limitations, the findings of the present study strongly suggest that the implementation of DL models, particularly with the use of LBC, has immense potential to revolutionize CC screening. The ability of DL models to rapidly analyze massive datasets and identify subtle patterns of abnormality, surpassing human capabilities, presents a significant advantage in combating this disease. This is particularly crucial in public health systems facing a shortage of qualified personnel for analyzing cervical cytology samples.
In conclusion, the present study demonstrates that the ResNet 50 model, when trained on LBC images, is a powerful tool for the accurate classification of cervical cells and has the potential to improve the early detection of CC. The use of LBC offers significant advantages over Pap smear images, and the continued development of DL models, along with strategies to address data imbalance, has the potential to transform CC detection and improve health outcomes for patients.

Author Contributions

• Main design of the proposal and research question: Mariangel Rodríguez, Claudio Córdova and Sebastián San Martín. • Collection, preparation and anonymization of image samples and their data: Isabel Bejumeda. • Design and execution of the image analysis algorithm: Mariangel Rodríguez and Claudio Córdova • Implementation and testing of the machine learning model: Mariangel Rodríguez • Statistical analysis: Claudio Córdova • Figure design: Isabel Benjumeda and Claudio Córdova. • All authors participate in the article writing and revision .

Funding

This study was funded by a national doctoral scholarship from the “Agencia Nacional de Investigación y Desarrollo ANID”, Chile (grant no. 21220332 and no. 21231886), a PhD Program in Health Sciences and Engineering, Universidad de Valparaíso, Chile, and the Carlos Van Buren Hospital (Valparaiso, Chile) basic operational funds.

Institutional Review Board Statement

All the samples used for this study corresponded to databases for public use available on the web, corresponding to the repositories declared in methodology (https://database.cric.com.br/classification, and https://data.mendeley.com/datasets/zddtpgzv63/4). The images collected do not have an identifier associated with the patient and are completely anonymous.

Informed Consent Statement

Informed consent does not apply in the use of these samples corresponding to public databases for classification.

Data Availability Statement

All data, digital images and codes are available for editor access and revision in private link (Microsoft OneDrive or Google Drive).

Acknowledgments

Not applicable.

Conflicts of Interest

The authors declare that they have no competing interests.

Abbreviations

The following abbreviations are used in this manuscript:
CC cervical cancer
ML machine learning
AI artificial intelligence
DL deep learning
CNN convolutional neuronal network
LMICs particularly in low- and middle-income countries
LBC liquid-based cytology

References

  1. Organization, W.H. *New WHO Recommendations on Screening and Treatment to Prevent Cervical Cancer among Women Living with HIV*; Wiley, 2023.
  2. Bustos Marisa, C.M. Especialistas alertan que cáncer de cuello uterino sigue en el Top 10 como causa de muerte en Chile 2023.
  3. Bogdanova, A.; Andrawos, C.; Constantinou, C. Cervical Cancer, Geographical Inequalities, Prevention and Barriers in Resource Depleted Countries (Review. *Oncol. Lett.* **2022**, *23*, 113. [CrossRef]
  4. Lee, Y.-M.; Lee, B.; Cho, N.-H.; Park, J.H. Beyond the Microscope: A Technological Overture for Cervical Cancer Detection. *Diagnostics* **2023**, *13*, 3079. [CrossRef]
  5. Bray, F.; Ferlay, J.; Soerjomataram, I.; Siegel, R.L.; Torre, L.A.; Jemal, A. Global Cancer Statistics 2018: GLOBOCAN Estimates of Incidence and Mortality Worldwide for 36 Cancers in 185 Countries. *CA Cancer J. Clin.* **2018**, *68*, 394–424. [CrossRef]
  6. Almonte, M.; Murillo, R.; Sánchez, G.I.; González, P.; Ferrera, A.; Picconi, M.A.; Wiesner, C.; Cruz-Valdez, A.; Lazcano-Ponce, E.; Jerónimo, J.; et al. Multicentric Study of Cervical Cancer Screening with Human Papillomavirus Testing and Assessment of Triage Methods in Latin America: The ESTAMPA Screening Study Protocol. *BMJ Open* **2020**, *10*, e035796. [CrossRef]
  7. Rezende, L.F.M.; Murata, E.; Giannichi, B.; Tomita, L.Y.; Wagner, G.A.; Sanchez, Z.M.; Celis-Morales, C.; Ferrari, G. Cancer Cases and Deaths Attributable to Lifestyle Risk Factors in Chile. *BMC Cancer* **2020**, *20*, 693. [CrossRef]
  8. Saldivia, L.Z.; Silva, I.V.; Rojas, F.H.; Vidal, B.M. Distribución etaria e incidencia de lesiones preinvasoras y cáncer cérvico uterino, entre los años 2009-2019: revisión de tres zonas geográficas de Chile. *Rev. Conflu* **2022**, *5*, 56–59.
  9. Schiffman, M.; Castle, P.E.; Jeronimo, J.; Rodriguez, A.C.; Wacholder, S. Human Papillomavirus and Cervical Cancer. *Lancet* **2007**, *370*, 890–907.
  10. Fowler, M., Jr; Ev, D.; j., C. Cervical Cancer. In *StatPearls* [Internet]; StatPearls Publishing: Treasure Island (FL), 2024.
  11. Núñez-Troconis, J. Papel del virus del papiloma humano en el desarrollo del cáncer del cuello uterino. *Investigación Clínica* **2023**, *64*, 233–254. [CrossRef]
  12. Shepherd, L.J.; Bryson, S.C.P. Human Papillomavirus—Lessons From History and Challenges for the Future. *J. Obstet. Gynaecol. Can.* **2008**, *30*, 1025–1033,. [CrossRef]
  13. Contreras, R. Papanicolaou y Citología Líquida En Diagnóstico de Cáncer de Cérvix: Hospital Civil de Maracay. 2012. *Comunidad y Salud* **2015**, *13*, 12–22.
  14. Herrera Conza, E.M.; Salazar Torres, Z.K.; Espinosa Martín, L.; Aspiazu Hinostroza, K.A. Detección Oportuna de Cáncer Cérvico-Uterino. *Revista Vive* **2021**, *3*, 264–274. [CrossRef]
  15. Landy, R.; Pesola, F.; Castañón, A.; Sasieni, P. Impact of Cervical Screening on Cervical Cancer Mortality: Estimation Using Stage-Specific Results from a Nested Case-Control Study. *Br. J. Cancer* **2016**, *115*, 1140–1146. [CrossRef]
  16. Patel, N.; Bavikar, R.; Buch, A.; Kulkarni, M.; Dharwadkar, A.; Viswanathan, V.A. Comparison of Conventional Pap Smear and Liquid-Based Cytology for Cervical Cancer Screening. *Gynecol Minim Invasive Ther* **2023**, *18*;12(2):77-82.
  17. Singh, V.B.; Gupta, N.; Nijhawan, R.; Srinivasan, R.; Suri, V.; Rajwanshi, A. Liquid-Based Cytology versus Conventional Cytology for Evaluation of Cervical Pap Smears:Experience from the First 1000 Split Samples. *Indian J. Pathol. Microbiol.* **2015**, *58*, 17–21. [CrossRef]
  18. Gupta, R.; Yadav, R.; Sharda, A.; Kumar, D.; Sandeep, M.R. Comparative Evaluation of Conventional Cytology and a Low-Cost Liquid-Based Cytology Technique, EziPREPTM, for Cervicovaginal Smear Reporting:A Split Sample Study. *Cytojournal* **2019**, *16*. [CrossRef]
  19. Albarrán, M. Citología Líquida: un mejor diagnóstico en Cáncer De Cuello de Útero; Artemedica.Es, 2023.
  20. Krishna, C.; Chandraiah, S.; Krishna, C. Comparison of Conventional Papanicolaou Smear and Liquid-based Cytology: A Study of Cervical Cancer Screening at a Tertiary Care Center in Bengaluru. *International Journal of Reproduction, Contraception, Obstetrics and Gynecology* **2021**, *10*. 3106, 10 18203 2320-1770 20212963.
  21. Zhao, L.; Li, K.; Wang, M.; Yin, J.; Zhu, E.; Wu, C.; Wang, S.; Zhu, C. Automatic Cytoplasm and Nuclei Segmentation for Color Cervical Smear Image Using an Efficient Gap-Search MRF. *Comput. Biol. Med.* **2016**, *71*, 46–56. 56. [CrossRef]
  22. Sompawong, N.; Mopan, J.; Pooprasert, P.; Himakhun, W.; Suwannarurk, K.; Ngamvirojcharoen, J.; Vachiramon, T.; Tantibundhit, C. Automated Pap Smear Cervical Cancer Screening Using Deep Learning. In *Annu Int Conf IEEE Eng Med Biol Soc*; 2019; pp. 7044–7049.
  23. Bao, H.; Sun, X.; Zhang, Y. The Artificial Intelligence-Assisted Cytology Diagnostic System in Large-Scale Cervical Cancer Screening: A Population-Based Cohort Study of 0.7 Million Women. *Cancer Med.* **2020**, *9*, 6896–6906. [CrossRef]
  24. Shafi, S.; Parwani, A.V. Artificial Intelligence in Diagnostic Pathology. *Diagn. Pathol.* **2023**, *18*, 109. [CrossRef]
  25. Silva, H.E.C.D.; Santos, G.N.M.; Leite, A.F.; Mesquita, C.R.M.; Figueiredo, P.T.S.; Stefani, C.M.; Melo, N.S. The Use of Artificial Intelligence Tools in Cancer Detection Compared to the Traditional Diagnostic Imaging Methods: An Overview of the Systematic Reviews. *PLoS One* **2023**,. [CrossRef]
  26. Hou, X.; Shen, G.; Zhou, L.; Li, Y.; Wang, T.; Ma, X. Artificial Intelligence in Cervical Cancer Screening and Diagnosis. *Front. Oncol.* **2022**. [CrossRef]
  27. Kresnauli, P.; Zipora, Y.C. The Application of Artificial Intelligence In Cervical Cancer Screening With Colposcopy Imaging Device. Author content 2023.
  28. Razzak, M.A.; Islam, M.N.; Aadeeb, M.S.; Tasnim, T. Digital Health Interventions for Cervical Cancer Care: A Systematic Review and Future Research Opportunities. *PLoS One* **2023**, *18*, e0296015. [CrossRef]
  29. He, K.; Zhang, X.; Ren, S.; Sun, J. Deep Residual Learning for Image Recognition. *arXiv* [cs.CV] **2015**.
  30. Shafiq, M.; Gu, Z. Deep Residual Learning for Image Recognition: A Survey. *Appl. Sci. (Basel)* **2022**, *12*, 8972. [CrossRef]
  31. Chen, H.; Liu, J.; Wen, Q.-M.; Zuo, Z.-Q.; Liu, J.-S.; Feng, J.; Pang, B.-C.; Xiao, D. CytoBrain: Cervical Cancer Screening System Based on Deep Learning Technology. *J. Comput. Sci. Technol.* **2021**, *36*, 347–360. [CrossRef]
  32. Ramírez, A.T.; Valls, J.; Baena, A.; Rojas, F.D.; Ramírez, K.; Álvarez, R.; Cristaldo, C.; Henríquez, O.; Moreno, A.; Reynaga, D.C.; et al. Performance of Cervical Cytology and HPV Testing for Primary Cervical Cancer Screening in Latin America: An Analysis within the ESTAMPA Study. *Lancet Reg Health Am* **2023**,. [CrossRef]
  33. Sachan, P.L.; Singh, M.; Patel, M.L.; Sachan, R. A Study on Cervical Cancer Screening Using Pap Smear Test and Clinical Correlation. *Asia-Pacific journal of oncology nursing* **2018**, *5*, 337–341. [CrossRef]
  34. Lozar, T.; Nagvekar, R.; Rohrer, C.; Dube Mandishora, R.S.; Ivanus, U.; Fitzpatrick, M.B. Cervical Cancer Screening Postpandemic: Self-Sampling Opportunities to Accelerate the Elimination of Cervical Cancer. *Int. J. Womens Health* **2021**, 841–859. [CrossRef]
  35. Dasgupta, S. The Efficiency of Cervical Pap and Comparison of Conventional Pap Smear and Liquid-Based Cytology. A Review. *Cureus* **2023**, *15*. [CrossRef]
  36. Zhu, J.; Norman, I.; Elfgren, K.; Gaberi, V.; Hagmar, B.; Hjerpe, A.; Andersson, S. A Comparison of Liquid-Based Cytology and Pap Smear as a Screening Method for Cervical Cancer. *Oncol. Rep.* **2007**, *18*, 157–160. [CrossRef]
  37. Sherman, M.E.; Mango, L.J.; Kelly, D.; Paull, G.; Ludin, V.; Copeland, C.; Schiffman, M.H. PAPNET Analysis of Reportedly Negative Smears Preceding the Diagnosis of a High-Grade Squamous Intraepithelial Lesion or Carcinoma. *Mod. Pathol.* **1994**, *7*, 578–581.
  38. Kanavati, F.; Hirose, N.; Ishii, T.; Fukuda, A.; Ichihara, S.; Tsuneki, M. A Deep Learning Model for Cervical Cancer Screening on Liquid-Based Cytology Specimens in Whole Slide Images. *Cancers* **2022**, *14*, 1159. [CrossRef]
  39. Stoler, M.H.; Schiffman, M. Interobserver Reproducibility of Cervical Cytologic and Histologic Interpretations: Realistic Estimates from the ASCUS-LSIL Triage Study. *JAMA* **2001**, *285*, 1500–1505. [CrossRef]
  40. Mustafa, W.A.; Ismail, S.; Mokhtar, F.S.; Alquran, H.; Al-Issa, Y. Cervical Cancer Detection Techniques: A Chronological Review. *Diagnostics (Basel* **2023**, *17*;13(10):1763. [CrossRef]
  41. Mirbabaie, M.; Stieglitz, S.; Frick, N.R.J. Artificial Intelligence in Disease Diagnostics: A Critical Review and Classification on the Current State of Research Guiding Future Direction. *Health Technol. (Berl.)* **2021**, *11*, 693–731. [CrossRef]
  42. Alowais, S.A.; Alghamdi, S.S.; Alsuhebany, N.; Alqahtani, T.; Alshaya, A.I.; Almohareb, S.N.; Albekairy, A.M. Revolutionizing Healthcare: The Role of Artificial Intelligence in Clinical Practice. *BMC Med. Educ.* **2023**, *23*, 689. [CrossRef]
  43. Chauhan, N.K.; Singh, K.; Kumar, A.; Kolambakar, S.B. HDFCN: A Robust Hybrid Deep Network Based on Feature Concatenation for Cervical Cancer Diagnosis on WSI Pap Smear Slides. *Biomed Res. Int.* **2023**, *2023*, 4214817. [CrossRef]
  44. Wong, L.; Ccopa, A.; Diaz, E.; Valcarcel, S.; Mauricio, D.; Villoslada, V. Deep Learning and Transfer Learning Methods to Effectively Diagnose Cervical Cancer from Liquid-Based Cytology Pap Smear Images. *Int. J. Online Biomed. Eng.* **2023**, *19*, 77–93. [CrossRef]
  45. Chen, W.; Shen, W.; Gao, L.; Li, X. Hybrid Loss-Constrained Lightweight Convolutional Neural Networks for Cervical Cell Classification. *Sensors (Basel)* **2022**, *22*, 3272. [CrossRef]
  46. Enciso Ortiz, S.E. Determinación de la mejor Arquitectura de Redes Neuronales Convolucionales: VGG16, ResNet50 ó MobileNet para detección de la Neumonía 2023 2024.
  47. Frangi, A.; Prince, J.; Sonka, M. *Medical Image Analysis*; Academic Press, 2023; ISBN 9780128136584.
  48. Masoodi, F.; Quasim, M.; Bukhari, S.; Dixit, S.; Alam, S. *Applications of Machine Learning and Deep Learning on Biological Data*; CRC Press, 2023; ISBN 9781000833768.
  49. Sompawong, N.; Mopan, J.; Pooprasert, P.; Himakhun, W.; Suwannarurk, K.; Ngamvirojcharoen, J.; Vachiramon, T.; Tantibundhit, C. Automated Pap Smear Cervical Cancer Screening Using Deep Learning. *Conf. Proc. IEEE Eng. Med. Biol. Soc.* **2019**, *2019*, 7044–7048. [CrossRef]
  50. Alias, N.A.; Mustafa, W.A.; Jamlos, M.A.; Alquran, H.; Hanafi, H.F.; Ismail, S.; Rahman, K.S.A. Pap Smear Images Classification Using Machine Learning: A Literature Matrix. 2022. [Google Scholar] [CrossRef]
  51. Gençtav, A.; Aksoy, S.; Önder, S. Unsupervised Segmentation and Classification of Cervical Cell Images. *Pattern Recognit.* **2012**, *45*, 4151–4168. [CrossRef]
  52. Azad, R.; Aghdam, E.K.; Rauland, A.; Jia, Y.; Avval, A.H.; Bozorgpour, A.; Karimijafarbigloo, S.; Cohen, J.P.; Adeli, E.; Merhof, D. Medical Image Segmentation Review: The Success of U-Net. *IEEE Trans. Pattern Anal. Mach. Intell.* **2024**, PP, 1–20. [CrossRef]
  53. Alvarado-Álvarez, A.M.; Salvador-Fernández, C.L.; Berruz-Alvarado, S.J.; Cañar-Lascano, G.G. Diagnóstico de cáncer cervicouterino: Comparación de la técnica de citología convencional y de base liquida. *RCS* **2023**, *6*, 18–33. [CrossRef]
  54. Zou, J.; Xue, Z.; Brown, G.; Long, R.; Antani, S. Deep Learning for Nuclei Segmentation and Cell Classification in Cervical Liquid Based Cytology. In *Proceedings of the Medical Imaging 2020: Imaging Informatics for Healthcare, Research, and Applications*; SPIE, March 2 2020; Vol. 11318, pp. 268–278.
  55. Mosiichuk, V.; Viana, P.; Oliveira, T.; Rosado, L. Automated Adequacy Assessment of Cervical Cytology Samples Using Deep Learning. In *Proceedings of the Pattern Recognition and Image Analysis*; Springer International Publishing, 2022; pp. 156–170.
  56. Mitra, S.; Das, N.; Dey, S.; Chakraborty, S.; Nasipuri, M.; Naskar, M.K. Cytology Image Analysis Techniques toward Automation. *ACM Comput. Surv.* **2022**, *54*, 1–41. [CrossRef]
  57. McManus, D.T. Miscellaneous Specimens and Ancillary Techniques. In *Histopathology Specimens*; Springer International Publishing: Cham, 2017; pp. 519–531 ISBN 9783319573595.
Figure 1. Segmentation algorithm applied in traditional Pap cytology vs. LCyt liquid cytology. (A) Breakdown of the automatic procedure into its critical steps where the analyzed cell mass and the resulting cell mass of the segments drawn in yellow by the software are observed. (B) Single-cell patches generated by the algorithm in a Pap sample. (C) Single-cell patches generated in an LCyt sample. Yellow arrows indicate zoomed areas where segments are observed that cannot define a single cell due to the diffuse of their boundaries. Pap, Papanicolaou; LCyt, liquid cytology.
Figure 1. Segmentation algorithm applied in traditional Pap cytology vs. LCyt liquid cytology. (A) Breakdown of the automatic procedure into its critical steps where the analyzed cell mass and the resulting cell mass of the segments drawn in yellow by the software are observed. (B) Single-cell patches generated by the algorithm in a Pap sample. (C) Single-cell patches generated in an LCyt sample. Yellow arrows indicate zoomed areas where segments are observed that cannot define a single cell due to the diffuse of their boundaries. Pap, Papanicolaou; LCyt, liquid cytology.
Preprints 120346 g001
Figure 2. Performance of the algorithm in the training and testing phase of ResNet50-Pap model. (A) The AUC (left panel) and loss (right panel) metrics based on the training times in both the training data (blue) and validation/test (orange) are shown. (B) The confusion matrix is shown, where 0 corresponds to negative for malignant cells and 1 to malignant positive cells, where the true/predicted label of the type 0/0 corresponds to true negatives and 1/1 to true positives. Pap, Papanicolaou.
Figure 2. Performance of the algorithm in the training and testing phase of ResNet50-Pap model. (A) The AUC (left panel) and loss (right panel) metrics based on the training times in both the training data (blue) and validation/test (orange) are shown. (B) The confusion matrix is shown, where 0 corresponds to negative for malignant cells and 1 to malignant positive cells, where the true/predicted label of the type 0/0 corresponds to true negatives and 1/1 to true positives. Pap, Papanicolaou.
Preprints 120346 g002
Figure 3. Performance of the algorithm in the training and testing phase of ResNet50-LCyt model. (A) The AUC (left panel) and loss (right panel) metrics based on the training times in both the training data (blue) and validation/test (orange) are shown. (B) The confusion matrix is shown, where 0 corresponds to negative for malignant cells and 1 to malignant positive cells, where the true/predicted label of the type 0/0 corresponds to true negatives and 1/1 to true positives.
Figure 3. Performance of the algorithm in the training and testing phase of ResNet50-LCyt model. (A) The AUC (left panel) and loss (right panel) metrics based on the training times in both the training data (blue) and validation/test (orange) are shown. (B) The confusion matrix is shown, where 0 corresponds to negative for malignant cells and 1 to malignant positive cells, where the true/predicted label of the type 0/0 corresponds to true negatives and 1/1 to true positives.
Preprints 120346 g003
Table 1. Maximum performance values of the ResNet50-Pap classification model.
Table 1. Maximum performance values of the ResNet50-Pap classification model.
Metrics ResNet50-PAP
Accuracy 0.8341
Sensitivity 0.6939
Precision 0.8549
Specificity 0.8386
F-Score 0.8042
The accuracy, sensitivity, precision, specificity and F score were obtained in the validation phase of the model from the construction of the confounding matrix.
Table 2. Performance values of the ResNet50-LCyt vs. CNN models of Sompawong *et al* [22] and Chen *et al* [31].
Table 2. Performance values of the ResNet50-LCyt vs. CNN models of Sompawong *et al* [22] and Chen *et al* [31].
Metrics ResNet50-LCyt R-CNN LCyt Sompawong *et al*. VGG-LCyt Chen *et al*.
Accuracy 0.998 NR NR
Sensitivity 0.999 0.917 0.928
Precision 0.998 0.917 0.822
Specificity 0.997 0.917 0.911
F-Score 0.998 NR NR
The accuracy, sensitivity, precision, specificity and F score were obtained in the validation phase of the model from the construction of the confounding matrix.
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.
Copyright: This open access article is published under a Creative Commons CC BY 4.0 license, which permit the free download, distribution, and reuse, provided that the author and preprint are cited in any reuse.
Alerts
Prerpints.org logo

Preprints.org is a free preprint server supported by MDPI in Basel, Switzerland.

Subscribe

© 2025 MDPI (Basel, Switzerland) unless otherwise stated