1. Introduction
Welding is crucial in sectors like automobile, aerospace, and oil. Rigorous weld monitoring ensures high-quality welds and identifies defects, while robotic systems make real-time adjustments. The complexity of welding processes drives advancements in non-destructive testing (NDT), contributing to improved efficiency, safety, and quality standards across industries [
1,
2].
Advancements in non-destructive testing (NDT), fueled by technologies like AI and machine learning, are transforming materials inspection for heightened precision in defect identification. Techniques such as ultrasonic testing and digital radiography have evolved, ensuring structural integrity across industries. Ongoing scholarly efforts aim to refine NDT methodologies, promising increased sophistication in the non-destructive evaluation and ensuring precision and reliability in various industrial sectors [
3,
4,
5].
Various welding imperfections can compromise the strength of welded joints. The welding method influences Porosity from small voids or gas pockets and can be caused by issues such as impurities in the base metal. Prevention involves thorough cleaning and proper shielding. Additional defects like slag inclusion, incomplete penetration, lack of fusion, undercutting, and cracks may arise due to excessive heat input. Overlapping defects occur when there is an insufficient fusion between adjacent weld beads [
6,
7,
8]. Ensuring well-built weld joints necessitates optimizing welding parameters and using Non-Destructive Testing (NDT) methods for early flaw detection without causing damage [
9,
10,
11].
Various NDT techniques are used for comprehensive inspections. Radiographic testing(RT) uses X-rays or gamma rays to reveal internal structures and defects. In addition, visual inspection (VT) relies on direct observation and is essential for surface-level assessments. Furthermore, ultrasonic testing(UT) uses high-frequency sound waves to detect internal flaws and measure material thickness. Moreover, magnetic particle testing(MT) uses magnetic fields to identify surface and near-surface defects in ferromagnetic materials. Also, Acoustic emission testing (AE) detects structural changes by monitoring the release of stress-induced acoustic waves. Besides, eddy current testing(EC) induces electric currents to evaluate conductivity and detect defects in conductive materials. similarly, infrared thermography (IRT) measures surface temperatures to identify irregularities or anomalies. Each technique uniquely contributes to the quality control process, providing a comprehensive and accurate evaluation of materials and structures in various industries [
12,
13].
X-ray image processing advancements have transformed non-destructive testing (NDT) for welding joints, providing more precise inspections. Technologies like digital radiography and computed tomography offer improved resolution and faster imaging, enabling thorough analyses and enhanced defect detection. The integration of advanced algorithms ensures dependable inspections, contributing to continual improvements in welding quality and safety standards across industries
The integration of artificial intelligence (AI) into non-destructive testing (NDT) for weld flaws is a significant advancement that improves the accuracy and efficiency of flaw detection. AI, with its pattern recognition and data analysis capabilities, automates the detection process and enables real-time analysis of complex weld structures. This not only speeds up inspections but also reduces the likelihood of human error. The continuous adaptation and improvement of AI-driven NDT systems further refine their ability to detect and categorize weld defects over time, signifying a transformative shift in welding technology toward proactive and adaptive quality control measures [
14,
15].
The main contributions of this study are:
Performing a rigorous comparative analysis of the critical previous studies and surveys in Non-Destructive Testing (NDT) of welds while presenting the current landscape and highlighting the advancements in the NDT.
Presenting an in-depth examination of a novel paradigm about incorporating artificial intelligence (AI) assistive X-ray imaging in the NDT of welds. Establishing a solid research foundation through exploring X-ray imaging of weld defects datasets and investigating image processing, feature extraction, and AI techniques provides a comprehensive understanding of available data and processing methods.
Summarizing the practical exploration of AI-Assistive X-ray imaging in various industrial sectors, going beyond theoretical discussions.
The rest of this paper is organised as follows.
Section 2 presents the various weld defects and the NDT approaches.
Section 3 reviews recent survey papers on X-ray image-based inspection of welds, highlighting our contributions through a comparative analysis.
Section 4 provides a comprehensive review of publicly available X-ray imaging weld defects datasets, laying the foundation for subsequent analyses.
Section 5 reviews various image processing techniques, used in X-ray-based welds quality inspection.
Section 6 focuses on feature extraction and selection techniques, contributing to the understanding of key features. In
Section 7, we explore AI-based, including machine-learning, ensemble-learning and deep-learning, classifiers for the automated classification of weld defects using X-ray images.
Section 8 discusses practical applications of AI-assistive X-ray imaging-based weld defect identification.
Section 9 presents the opportunities and challenges and provides a comprehensive summary of key findings and contributions, outlining implications and suggesting future directions.
Section 10 concludes the paper.
3. Comparative Analysis of the Non-Destructive Testing(NDT) Techniques
This section navigates the present NDT landscape by providing a comprehensive, meticulously crafted comparison of key studies. Numerous studies provide extensive insight into various facets of weld quality, additive manufacturing, structural safety, and materials testing. Collectively, these articles address non-destructive testing (NDT) and defect detection in a variety of manufacturing domains. Madhvacharyula et al.(2022) [
13] focuses on improving weld quality through real-time defect detection during welding. They provide a concise overview of various weld defects, common nondestructive testing techniques, and in-situ detection methods, categorized by input signals and algorithms. M.shaloo et al.(2022) [
33] address defects in wire and arc additive manufacturing (WAAM) and fusion welding, highlighting their negative impact on mechanical properties and advocating for effective NDT techniques. J.rao et al.(2023) [
34] review the transformative impact of additive manufacturing (AM), highlighting its benefits while acknowledging its challenges and emphasizing the role of NDT in inspecting for damage. X.Shen et al.(2023) [
35] emphasize the importance of early detection of metal components in safety-critical structures through NDT, providing a comprehensive analysis of NDT technologies for metal crack detection. I.ramirez et al.(2023) [
36] discuss the central role of additive manufacturing in the Fourth Industrial Revolution, emphasizing the need for efficient inspection methods and reviewing NDT practices. Other investigations cover diverse areas such as real-time monitoring of laser welding discussed by W. Cai et al.(2020) [
37], the integration of deep learning for bridge deck assessment explored by DN. Lavadiya et al.(2022) [
38], and the application of automated defect detection in industrial processes explored by M. Amarnath et al. (2023) [
39]. Another focus is on surface defect detection using deep learning methods, as highlighted by Saberironaghi et al. (2023) [
40], while Liu et al.(2023) [
41] provide insights into radiographic image analysis of welding. Zhao et al.(2021) [
42] highlight the vulnerability of ceramic products to defects and discuss non-destructive testing (NDT) methods tailored for ceramics. In addition, Gupta et al.(2022) [
3] delve into the pivotal role of NDT in manufacturing processes, providing a comprehensive overview of various methods and highlighting the expanding applications of NDT. Taken together, these articles enrich our understanding of defect detection and NDT techniques, each contributing valuable insights and perspectives within its specific field.
Various research has been conducted to gain deeper knowledge about the different uses of NDT conventional techniques, such as the acoustic emission(AE) technique. Ramalho et al. [
43] conducted a detailed investigation of the influence of various defects on the sound waves captured by a microphone during the Wire Arc Additive Manufacturing (WAAM) process. Using power spectral density and short-time Fourier transform (STFT) analysis techniques, they successfully identified defects. Luo et al. [
44] investigated the application of AE count statistics, RMS waveform calculation, and power spectrum distribution methods for the analysis of AE signals during pulsed YAG laser welding. Their study asserted that the plasma plume induces recoil force and thermal vibration, which affect the acoustic parameters, especially influenced by the type of shielding gas and wire extension length exceeding 12 mm. In addition, Zhang et al. [
45] used acoustic emission and air-coupled ultrasonic testing for real-time monitoring of burn-through events in gas tungsten arc welding (GTAW).
The infrared thermography (IRT) technique, is valuable for the detection of weld defects. Elkihel et al. [
46] investigated the heat propagation of a weld using an active thermography method. Using inductive heating, they raised the temperature of the weld to 80 °C and observed the heat propagation using a FLIR T440 infrared camera with a resolution of 320 × 240 pixels and a bandwidth of 7.5 to 13 µm. Their results indicate that the heat loss in the weld zone is significantly greater than in the flawed region. In addition, researchers such as Massaro et al. [
47] have integrated image processing and thermography techniques to ensure weld quality. They presented a novel method for identifying weld defects on a welded steel tank (AISI 304/316) using infrared thermography and image processing. By cutting out a sample and applying heat with a heat gun, the heat distribution was captured using a FLIR T 1020 with a resolution of 1024 × 768 pixels. The combination of infrared thermography and various image processing techniques, including line calculation, 2D K-means algorithm, 2D morphology functions, and a Long Short Term Memory (LSTM) artificial neural network, proved to be a powerful tool for real-time identification and classification of weld defects. Ziegler et al. [
48] investigated the use of high-power laser excitation sources in lock-in thermography and claimed that the use of high-power lasers instead of LEDs and halogen lamps has minimal effect on the thermal emission generated by the excited sample since their emission is based on electroluminescence. Consequently, this method can be effectively used in one-sided transient thermography.
Ultrasonic testing is critical for detecting weld defects and providing an accurate assessment of the structural integrity of welded joints. Sun et al. [
49] introduced an innovative hybrid ultrasonic sensing system, called diffuse ultrasonic wave (DUW), designed for the detection of damage in railway tracks using a lead zirconate titanate (PZT) actuator and a fiber Bragg grating (FBG) hybrid sensing system. The experimental results showed that the DUW signals captured by the hybrid sensing system show significant promise for detecting damage in railroad tracks. The use of conventional ultrasonic testing (UT) is limited when it comes to inspecting structures with hard-to-reach areas, such as superstructures or substructures. To overcome this limitation, embedded ultrasonic techniques have been used for damage detection. In the study by Chakraborty et al. [
50], they presented a crack detection methodology based on an advanced signal processing algorithm, which was tested on various reinforced concrete structures and successfully identified cracks between embedded sensors. In an extension of their research, Chakraborty et al. [
51] proposed an active approach to damage detection in multiple structures using embedded ultrasonic sensors. This involved processing raw ultrasonic signals with continuous wavelet transform (CWT) and non-decimated wavelet transform (NDWT) methods to extract features for damage detection. Both studies concluded that embedded ultrasonic sensors better monitor real structures more effectively than conventional techniques.
The eddy current utilizes light reflection to detect weld defects. F.xie et al. [
52] investigate the use of pulsed eddy current (PEC) nondestructive testing to detect weld defects in large pressure vessel cylinders. Using a PEC sensor on a mobile platform, two simulated weld specimens are tested, revealing distinct signal patterns for specimens with and without defects. X-ray testing confirms the method’s feasibility for efficiently detecting subsurface defects in welds. Additionally, T.alvarenga et al. [
53] proposes an embedded system using eddy current for real-time detection and localization, introducing a novel method that uses wavelet transforms and a convolutional neural network to interpret signals. This approach is instrumental in efficiently categorizing and locating anomalies, thus contributing to the optimization of rail maintenance strategies. Field tests successfully classify rail anomalies into three main classes: squids, welds, and joints. Further, R.M. Gansel et al. [
54] highlight the need for a reliable inspection concept to detect fatigue cracks and damage. By evaluating five eddy current sensors, the research focuses on optimizing the signal-to-noise ratio during cyclic fatigue testing and selecting two sensors for semi-automated weld inspection. The effectiveness of the technique in detecting fatigue cracks is highlighted, with air coils arranged parallel to the test surface identified as optimal. The study demonstrates the ability of eddy currents to discriminate groove depths and detect actual fatigue cracks, providing important insights for assessing the structural integrity of wind turbines.
The use of magnetic inspection for weld flaw detection has been the focus of numerous research efforts to improve the reliability and efficiency of weld inspection processes. G.Y. Liu et al. [
32] focuses on improving the visual effectiveness of weld flaw detection through magneto-optical imaging nondestructive testing technology. Finite element analysis and magneto-optical image simulation are used to analyze the detection characteristics under rotating excitation. The study proposes an image fusion method based on pixel standard deviation to improve welding defect detection in magneto-optical imaging. By applying fast guided filtering and pixel standard deviation to fuse multi-frame magneto-optical images, the proposed method improves image quality and ensures efficient nondestructive testing of welding defects with improved visual effects. In addition, F.brauchle et al. [
55] addresses the detection of production defects in lithium-ion cell manufacturing to reduce scrap rates and improve energy efficiency through early detection. The proposed method uses an improved magnetic field imaging (MFI) setup and current reconstruction, building on previous work with anisotropic magnetic resistance (AMR) sensors. The approach involves scanning the magnetic field above the cell in a two-dimensional plane to detect and locate manufacturing defects, such as missing welds, cuts, cracks in the active material and current collector, and blocking elements between cell layers. Moreover, J. Ai et al. [
56] aims to improve the applicability of eddy current magneto-optical imaging nondestructive testing technology for defect detection in carbon fiber reinforced polymers (CFRP). By improving the magnetic field response of the system, especially in the context of CFRP with low conductivity, the research introduces a scanning eddy current magneto-optical imaging device. A novel inspection method known as eddy current magneto-optical phase imaging is proposed for detecting crack defects in CFRP.
The exploration of weld quality and defect detection with the use of X-ray techniques unfolds systematically, with each study building upon the foundation laid by Madhvacharyula et al.(2022) [
13], they concentrate on real-time defect detection during welding, offering a comprehensive overview of weld defects, prevalent nondestructive testing techniques, and in-situ detection methods categorized by input signals and algorithms. This establishes a cohesive starting point for subsequent research. Moreover, Li, Yaping, et al. (2019) [
57] contribute by proposing a deep learning network for X-ray image-based weld flaw detection, introducing a novel approach that simulates visual perception principles. This study directly identifies linear defects, circular defects, or noise without explicit feature extraction, demonstrating feasibility and effectiveness in enhancing efficiency. However, potential challenges, such as significant computational resource requirements and the need for robust model training for diverse defect types, are acknowledged. In addition, S.Sudhagar et al. (2020) [
58] take a numerical approach to evaluate friction stir welding (FSW) quality using X-ray images. Employing image processing, they quantify and correlate defect areas with weld mechanical properties, providing valuable quantitative insights. The Taguchi method is utilized to identify optimal process parameters. However, potential limitations arise from assumptions about the relationship between defect area and mechanical properties, as well as the need for precise control of process parameters. Furthermore, Chen, Ji et al. (2023) [
26] contribute to the sequence by aiming to enhance accuracy and efficiency in pipeline weld defect detection through non-destructive testing (NDT). Their model integrates the Feature Pyramid Network (FPN) and introduces a new visual attention mechanism (SPAM) to address challenges in X-ray image analysis. Improved detection accuracy is evident, although potential drawbacks include challenges in parameter tuning and dataset biases. Besides, J. Kastner et al. (2015) [
59] offers a unique perspective by exploring the application of flat-panel matrix X-ray computed tomography (XCT) as a non-destructive method for characterizing sample structures. Challenges in analyzing generated XCT data are highlighted, but the benefits include the ability to scan and quantify heterogeneities of different sizes and topologies, offering valuable insights that require specialized image processing expertise. Further, A. Bansal et al. (2023) [
60] conclude the sequence by focusing on weld defect detection through radiographic image analysis. They recognize the complexities introduced by diverse defect characteristics and propose computer-based image processing approaches. The study emphasizes the potential for automated detection using a unique image-based approach, with a comparison favoring deep learning networks for higher accuracy. This sequence of studies collectively presents a nuanced exploration of improving weld inspection and defect detection methodologies, with each study contributing progressively to the overarching theme.
Table 1 summarizes the significant contributions and limitations extracted from the relevant literature, providing essential insights into different methodological approaches within nondestructive testing research. It provides a valuable guide for navigating the intricacies of the field and enhances our understanding of both advances and challenges in the discipline.
Based on findings, presented in
Table 1 and
Table 2 outlines the advantages and disadvantages associated with each of the considered NDT technique. Each method has its plus and minus, and the choice depends on the particular inspection requirements, material characteristics, and the nature of the potential defects.
Among the considered NDT methods, the X-ray imaging approach is becoming famous because of its remarkable ability to penetrate materials, provide detailed information on internal structures, and effectively identify subsurface defects within welds. Moreover, it is easily compatible with existing mature image processing, feature extraction, and AI algorithms to attain a precise and automated solution. Additionally, its proven success in various industries makes it a reliable option for inspecting weld quality and categorizing major welding defects. This is why, in the remaining parts of this paper, we focused on the X-ray method for NDT of welds.
4. X-ray Images Datasets for Welding Joints
In the field of NDT, especially in welding, the datasets play an essential role in AI-based automated defect detection. They are crucial tools for analyzing and mining intricate weld details and defect characteristics. Leveraging advanced technologies, researchers use these datasets to investigate weld integrity and defect identification comprehensively. Adopting data-driven AI approaches increases the accuracy and reliability of weld quality assessment, contributing significantly to the advancement of weld inspection practices.
RIAWELC Dataset, [
73], is a novel dataset for automatic weld defect classification. It consists of 24,407 radiographic weld images classified into four classes: lack of penetration, cracks, porosity, and no defect. With an emphasis on authenticity, the dataset accounts for real-world conditions and variations in image quality. The original X-ray weld images, taken in an industrial setting and digitized as JPEGs (2000×8640 pixels, 8-bit), underwent initial processing including bead segmentation and background removal. A custom software routine extracted regions of interest, specifically slices with potential weld defects, using a windowing technique. An optimal compromise for clear visualization of various defects was found with a window size of 80×80, balancing the detection of both small and large defects. Totino. B et al. [
73] introduce the RIAWELC dataset, highlighting its characteristics and its usefulness in classifying weld defects using deep learning models. Then draw a comparison between the RIAWELC dataset and the GDX-ray and WDXI datasets. In addition, S.perri et al. [
74] present a new dataset of 24,407 annotated grayscale images of welding defects, along with a novel CNN model called WelDeNet. This model demonstrates a high accuracy of 99.5% in classifying four defect classes: lack of penetration, cracks, porosity, and no defect.
GDXray, [
75], is a weld x-ray database created by adapting the GRIMA x-ray database. The acquisition of the x-ray images followed the guidelines outlined in the ISO 17636-1 standard, specifically designed for the radiographic examination of metal fusion welds. A Lumisys LS85 SDR scanner was used to digitize the radiographic films. The rescaling process involved converting the original 12-bit data to 8-bit using a linear look-up table (LUT) proportional to the optical density of the film. The resulting radiographs are formatted in TIFF with a pixel size of 630 DPI. Several applications use the GDxray dataset to enhance our efforts in the area of weld joints. Say.D et al. [
76] study proposes an automated approach using GDXray, which combines data augmentation and convolutional neural networks (CNN) to identify multi-class weld defects in X-ray images. Additionally, Sh.naddaf et al. [
8] focus on the use of artificial intelligence, specifically deep learning, for automated defect detection and classification in nondestructive testing of newly created welds. The study involves generating 100,000 X-ray images of various welds, annotating them with NDT experts, and training a convolutional neural network (CNN) with an overall defect detection accuracy of 96%, prioritizing field-quality welds over laboratory welds. Furthermore, A.movafeghi et al. [
77] highlight the importance of identifying defects in industrial pipe welding through radiographic inspection. It introduces the Sparse Coding and Gaussian Scale Mixture (SSC-GSM) method, which uses Gaussian mixture models to enhance image contrast in radiographic images. By effectively eliminating background noise, the method results in a twofold increase in pixel density along analyzed profile lines. SSC-GSM demonstrates improved contrast and defect detection compared to conventional approaches. Also, S.kumaresan et al. [
78] presents a novel approach using a deep learning model trained on a small radiographic dataset. Data augmentation and fine-tuned transfer learning using VGG16 and ResNet50 architectures are employed. The VGG16-based model achieves a high average accuracy of 90%.
The WDXI dataset, [
79], contains 16,950 weld images obtained from equipment manufacturers and quality inspection laboratories. Converted from X-ray films and paper reports, the dataset covers various defect types and includes 13,766 annotated images. Stored in 16-bit TIF format, these images vary in resolution and aspect ratio, providing a diverse representation of real-world welding scenarios. S. Mohana et al. [
80] propose a method for crack detection in weld images using image processing techniques, involving stages of preprocessing, feature extraction, classification, and crack region segmentation. Further, J.zhang et al. [
81] focus on the problem of accurate segmentation of weld defects (WDS) in X-ray images, particularly in distinguishing critical defects such as cracks from the noisy background, is addressed. To address this challenge, the authors present a solution called Boundary Label Smoothing (BLS), which uses Gaussian blur to soften labels near object boundaries, while acknowledging the inherent inaccuracies in ground truth labels. Moreover, Ch.ajmi et al. [
82] discusses the impact of computer-aided weld flaw detection in nondestructive testing, highlighting its ability to overcome the limitations of manual inspection. It emphasizes the importance of overcoming the challenges of visually inspecting X-ray weld databases, especially when dealing with poor-quality data containing small, sticky porosities. The study presents a novel approach using the Faster RCNN architecture and thoroughly validates its effectiveness through parameterization, training, testing, and comparison with other models such as YOLO and DCNN.
The SBD dataset, [
83], is used for weld defect classification, which comprises 100,000 patches extracted from full-sized 13,560 × 1024-pixel images of a welded pipeline. An expert has categorized these patches into two groups: no-defective and defective. Sh. naddaf et al. [
83] describe the importance of continuous monitoring and advanced inspection in modern manufacturing and infrastructure maintenance. It highlights the potential economic risks associated with the growing demand for these processes and the shortage of skilled personnel. The use of artificial intelligence (AI) in advanced inspection is advocated to automate tasks and increase confidence in operations. The focus is on the non-destructive testing (NDT) of newly created welds using radiographic imaging. Existing Assisted Defect Recognition (ADR) tools are criticized for their limitations, leading to the introduction of deep learning for defect detection in newly created welds.
Table 3 concisely overviews various X-ray imaging-based welding datasets. It offers insight into the diversity of data sources, and access links to these datasets are included to facilitate further exploration and use for research purposes.
5. X-ray Image Processing for Welding Defects Enhancement
Digital radiographic images often present challenges, including reduced contrast, the presence of noise, and uneven gray scale distribution. These issues have a significant impact on weld defect detection, especially when dealing with small defects that are easily obscured by noise. Various processing methods have been used to address or mitigate these challenges. It is critical to perform the processing with precision to prevent the loss of critical information. For example, applying image enhancement methods such as normalization and histogram equalization can risk losing the original shape and brightness distribution of defects, which is essential for distinguishing between defect types.
A nuanced image processing pipeline is employed in the area of X-ray-based weld flaw detection, showcasing the fusion of advanced techniques. Notable methods such as histogram equalization [
84,
85] are used in image processing to improve image contrast by redistributing pixel intensities across the dynamic range. The process begins by constructing a histogram that represents the distribution of pixel intensities within the image. A transformation is applied to this histogram to achieve a more uniform distribution. This transformation adjusts and stretches the intensity values to encompass the available range. The result is an image with improved visibility of details and features. This improvement is particularly noticeable in regions that were previously confined to a narrow intensity range and are now more visible due to the increased contrast. In addition, image filtering [
86] is a powerful technique that involves applying convolution or mathematical operations to an image. This process uses filters or convolution matrices designed to enhance or reduce certain features within the image. Filters can be spatial or frequency-based and can include actions such as blurring, sharpening, or edge detection. Convolution, an integral part of this process, involves sliding a filter matrix over the pixel values of the image, performing a weighted sum at each step. This action changes pixel intensities, emphasizing or diminishing certain features. Image filtering is widely used in various image processing contexts for noise reduction, edge enhancement, and feature extraction tasks. Furthermore, the Contrast stretching technique [
85], Contrast stretching is an image enhancement technique that linearly scales pixel intensities across the entire dynamic range, effectively expanding the contrast within the image. By redistributing pixel values, it aims to improve the visibility of features. This method identifies the minimum and maximum pixel values in the original image and applies a linear transformation to enhance the differences between the intensities. Contrast stretching is valuable for improving visibility in images with limited contrast by making darker areas darker and brighter areas brighter. Moreover, the Wavelet Transform [
87,
88] is a mathematical tool that is critical in image processing for performing multi-resolution analysis. Its primary function is to decompose an image into distinct components present at different frequency bands, effectively capturing both intricate and broad details. This complex transformation is accomplished by convolving the image with a series of wavelet functions that vary in scale and location. The resulting coefficients provide a representation of the image’s frequency content, facilitating a thorough examination of its features. Wavelet transforms are particularly useful in a variety of applications such as image compression, noise reduction, and feature extraction because of their ability to provide a nuanced, multi-scale representation of visual data. Also, the thresholding and morphological operations [
89,
90] is a segmentation method used to divide an image into distinguishable regions based on pixel intensity values. The process requires the specification of a threshold, and pixels with intensities above or below the threshold are assigned to different segments. This binary classification streamlines image analysis and facilitates the feature extraction process. Thresholding is widely used in image segmentation, object detection, and image binarization tasks. In particular, adaptive thresholding techniques account for local fluctuations in pixel intensities, making the method more resilient when dealing with images under varying lighting conditions. The illustration in
Figure 9 depicts the assessment of an image processing technique applied to X-ray images.
6. Feature Extraction and Selection Techniques
Feature extraction and dimension reduction techniques are essential processes in data analysis and machine learning to improve model efficiency and reduce computational complexity.
The Non-transformed signal features [
91,
92] in image processing involve extracting features directly from pixel values, including intensity-based features (mean, standard deviation), spatial-based features (coherence, smoothness), statistical measures (entropy), color-based features (histograms), and edge-based features. This approach provides insight into image properties without mathematical transformations, making it suitable for tasks where computational efficiency is a priority or simple pixel analysis is required.
Transformed signal processing [
93] involves applying mathematical transformations such as Fourier or wavelet transforms to analyze pixel values. This process, which transforms images into frequency or spatial domains, reveals hidden patterns and structures. Key features include frequency components, making them valuable for tasks such as compression and texture analysis. Transformed signal characteristics offer efficiency in analyzing complex image content, especially for identifying specific patterns that are not readily apparent in the original pixel space. This technique involves the Principal Component Analysis (PCA) [
94] is a dimensionality reduction technique that transforms the original features into a new set of uncorrelated features called principal components. This process helps retain the essential information in the data while eliminating less important details. In addition, it requires discrete sine transform (DST) and discrete cosine transform (DCT) [
95,
96] are mathematical methods for transforming data from the spatial to the frequency domain. DST is effective for signals with odd symmetry, commonly used in applications such as speech processing. DCT, widely used in image and video compression (e.g., JPEG, MPEG), excels at concentrating signal energy into fewer coefficients, making it a key component in compression algorithms. Both transforms play an essential role in applications where efficient data representation and compression are critical.
Graph descriptors [
97] are quantitative measures derived from graphs that represent networks of nodes and edges. These measures, including degree distributions, clustering coefficients, and centrality metrics, provide insight into networks’ structural and topological properties. Graph descriptors are critical to analyzing complex systems such as social and biological networks, facilitating a systematic understanding of their organization and connectivity.
Structural descriptors in image processing [
98] are quantitative measures that capture spatial relationships, shapes, and patterns within an image. These descriptors, including moments, shape metrics, and spatial relations, provide valuable information for tasks such as object recognition and image segmentation. They play a crucial role in quantifying the structural attributes of digital images, supporting various image processing applications.
In the advanced domain of non-transformed and transformed signal characteristics in image processing [
1], methods such as Krawtchouk moments, Minkowski moments, and Zernike moments provide clear insight into spatial relationships and shape characteristics. These non-transformed techniques serve as powerful descriptors for tasks such as object recognition. At the same time, advanced transformed techniques, including Zernike velocity and the concept of writhe number, contribute to a comprehensive toolkit for image processing. The integration of these diverse methods enhances capabilities in tasks ranging from biomedical imaging to computer vision applications, exemplifying the dynamic environment of research underway in image processing.
In image processing, texture descriptors [
99] are crucial for quantifying visual patterns within an image and play an important role in tasks such as recognition and segmentation. These descriptors include statistical measures such as mean and variance, histogram-based features such as entropy, matrices such as the co-occurrence matrix (GLCM) that capture common pixel occurrences, and transforms such as wavelet transforms that reveal different frequency components. Local binary patterns (LBPs) are used to characterize local patterns. These descriptors play a crucial role in understanding the spatial arrangement of pixel values, contributing significantly to effective image analysis in applications such as medical imaging and computer vision.
Dimension reduction, or feature selection, is a critical step in image processing that uses statistical measures to efficiently select informative features from high-dimensional datasets and streamline computational complexity.
Filtering methods [
100] in dimension reduction for image processing use statistical measures such as variance thresholding, correlation-based selection, mutual information, chi-square tests, and ANOVA. These techniques efficiently identify informative features by evaluating variance, correlation, and statistical differences, making them computationally efficient for handling high-dimensional image data. Furthermore, Wrapper feature selection [
101] methods evaluate the performance of a machine learning model with different subsets of features, optimizing the selection based on the model’s predictive accuracy. In addition, Embedded methods for feature selection [
102] incorporate the process directly into model training, automatically selecting relevant features during learning, offering a streamlined and efficient approach. Manifold learning [
103] is a set of techniques in machine learning and data analysis that aim to uncover the intrinsic, lower-dimensional structure or geometry of high-dimensional data. These methods are instrumental when the data lies on or near a lower-dimensional manifold within a higher-dimensional space. Manifold learning algorithms such as t-distributed Stochastic Neighbor Embedding (t-SNE), Isomap, and Locally Linear Embedding (LLE) attempt to preserve the essential relationships and similarities between data points in the reduced-dimensional space. By revealing the underlying structure, manifold learning enables improved visualization, clustering, and classification of complex data sets.
Figure 10 below illustrates diverse techniques for feature extraction and dimension reduction, including methods for feature selection.