Systems supporting civil infrastructure gradually deteriorate over time. They may be divided into four categories: roads, bridges, structures, and water and sewer networks. The bridge classifications can be categorized into steel, reinforced concrete, and pre-stressed concrete with supporting and structural elements. The bridge elements, such as deck, abutment, foundation, expansion joints, railings, bearings, etc., might all be damaged. The fundamental difficulty in bridge inspection programs is the variety of elements that influence RC bridge degradation and cause various problems. Delamination, scaling, spalling, efflorescence, and cracking are various types of bridge degradation that may cause fatigue deterioration. Determining the kind, quantity, breadth, and length of flaws on bridges reveals the early stages of deterioration and avoids these kinds of incidents. Detecting and evaluating the degree of defect is an important process that may affect the structure’s capacity in its current stage or in the future [
1]. The main form of assessing the physical and functional conditions of civil infrastructure is the manual visual inspection. The main advantage of visual inspection is that it includes an extensive evaluation of the entire bridge and is not restricted to the detection or assessment of a particular type of damage or a component of the bridge. Its cost amount is determined based on the characteristics of the bridge and the level of inspection details and frequency. The main elements of the visual inspection costs are traffic management and labor [
2]. This technique of inspection still mainly relies on human eye observation. It is required that an expert maintenance engineer be on the field to decide whether the maintenance condition is required. It needs a lot of preparation on both inspection planning and expertized identification. Actually, the number of specialists in the field is inadequate compared to the number of bridges to be inspected. Thus, it is suffering from many drawbacks that have a potential threat to personal safety and caused a lot of accidents.
However, current inspection practices depend on visual inspection and basic tools, such as hammer sounding and chain drag, to determine subsurface defects such as delamination [
3]. These techniques suffer from some limitations, such as time-consuming, subjectivity, uncertainty, and the inability to detect all subsurface defects [
4]. Therefore, numerous authorities tend to use computer vision-based methods in the inspection process to evaluate concrete surface structures. This technology improves the inspection process and speed and eliminates the need for traffic disruption or total lane closure. It is not only used for defect detection but also to evaluate their severity [
5]. Recently, image acquisition applications are used by expert assistance and cameras to capture the image of bridge components and then send it to the server of the Department of Highways in order for the expert to verify in the office. Nevertheless, image acquisition applications still need human observation to check images one by one [
6]. It is worth mentioning that crack is one of the concrete defects that requires attention to understand their reasons and the remedial action that should be taken against it. It is a laborious process to manually identify, characterize, and record the cracks due to their enormous and wide variety. The development and propagation of cracks may tend to reduce the effective loading area and eventually lead to the failure of the concrete or other structures. Cracks allow dangerous and corrosive substances to enter a structure, especially one made of concrete, which compromises the structural integrity and aesthetics of the component. Crack detection is not sufficient in the inspection process. It requires extracting crack properties such as length, width, and angles. The information about cracks can be utilized to diagnose problems and choose the best rehabilitation strategy to fix damaged buildings and avoid catastrophic failure [
7,
40,
41,
42,
43]. ACI 201.1R [
8] expressed the severity of cracking as fine, medium, and wide, based on crack width, which is shown in
Table 1.
The manual measurement is a common approach that is conducted by a technician with measuring tools such as crack comparator cards, gauges, and callipers. This comparator card is designed as a ruler marked in inch and millimetre with a range of graded lines, and each line is specified as the width of the crack as shown in
Figure 1a. The limitation about those measuring is low accuracy and difficulties in recording data [
10]. A digital pachymeter (digital calliper) with higher accuracy is another instrument that requires knowledge by the technician to choose and insert the metallic blade in the crack opening. Another problem that can be raised is the uncertainty associated with the handling of the instrument by a technician. These factors required higher repetitive measures to gain more accuracy [
11]. Battery microscope magnification is designed for measuring concrete crack width range (4 mm) in (0.02 mm) divisions as shown in
Figure 1b. It is built into an adjustable light source powered by batteries. It can also be found as an electric microscopic, but it’s more difficult to use than the battery one. These instruments not only suffered from inconsistent measurement but also required the staff member to stay in hazardous condition during the period of inspection [
12].
There is a growing interest in the field of crack identification and characterization of concrete surface distresses, as evidenced by the number of recently published articles in this area. Methods based on visual inspection assist transportation organizations in identifying shortcomings and making decisions that are more accurate and objective [
13]. However, without eliminating various types of noise from images that are associated with diverse sources, such as concrete blebs, stains, uneven contrast, and shading, an image-based technique cannot be useful [
14,
15]. Performing various image processing methods to enhance image inspection makes it simpler to spot problems in inspection images. Many approaches have been applied for crack automated detection and width measurement. Noh et al. [
16] suggested utilizing fuzzy C-means clustering in segmentation to find 0.3 mm fractures in images. In this approach, a series of processes including segmentation, morphology, and filtering is used to improve the visibility of fracture features and eliminate background noise. First, fuzzy C-means is used for image segmentation. Second, morphological dilation is used to reveal fracture characteristics, after which manually calibrated masks for filtering are made. Finally, related noise locations are located and removed using a Grassfire search. In comparison to existing edge detection-based approaches, it is emphasized that the developed method achieves greater recall and precision. Another automated approach is developed by Jain and Sharma [
14] to determine the severity of cracks. The pre-processing of the images includes contrast boosting and histogram equalization. The K-means clustering technique is then used to segment the data. The images are segmented using this approach using various K-means clustering parameters, and it is discovered that a random initialization with Euclidean distance works best. Finally, a crack detection technique using fuzzy inference is utilized to provide a risk score that displays the proportion of dangerous cracks. Zhao et al. [
17] presented an inspection technique for bridge maintenance. The first part was built to determine the type of bridges, e.g., suspension, cable-stayed bridges, by using AlexNet. Second, a Faster-RCNN is trained to classify the bridge components (e.g., tower, deck). Finally, GoogLeNet was used for concrete crack detection. Nevertheless, this proposed method is suffering from a lack of discussions of the connections between the three previous components. A platform for the automatic identification and severity evaluation of spalling in reinforced concrete bridges is introduced by Abdelkader et al. [
18] The first module dealt with the preparation of images. The second module is created to automatically identify spalling. With the use of a single-objective particle swarm optimization (PSO) model that tries to increase the image Tsallis entropy, spalling images are segmented using bi-level thresholding. This module produces a single threshold
T that divides the image pixels into the spalling (foreground) and surface classes (background). The third module is the feature extraction process, which separates the collected images into high-pass and low-pass filters using Daubechie’s discrete wavelet transform. The automatic assessment of spalling severities is the final module. With reference to the artificial neural network, the suggested produced spalling evaluation model (ANN-PSO) decreases the prediction errors by percentages ranging from 71.43% to 76.65%. Additionally, in contrast to the created prediction model, the Otsu algorithm is unable to discern spalling pixels in the image. Li and others [
19] used the fully convolutional network and a Naive Bayes data fusion (NB-FCN) for crack detection, followed by a skeletonization process to extract crack properties: width and length, with a mean error less than 0.03 mm for width and 92.8% for length accuracy. Ong et al. [
20] produce a hybrid method combining the shortest method and the orthogonal projection method to measure pavement crack width with irregular boundaries or high curvature. In comparison to the shortest technique and the orthogonal projection method, the hybrid method yields the highest correlation coefficient (0.956) and the least average absolute deviation (1.769). Cardellicchio et al. [
21] introduce an approach through machine learning for defect detection of reinforced concrete bridge elements. The study divided the defects into seven groups: (1) corroded/oxidized steel reinforcement, (2) cracks, (3) deteriorated concrete, (4) honeycombs, (5) moisture spots, (6) pavement degradation, and (7) shrinkage cracks. The neural network was trained to classify single defect vs. all defect. InceptionV3 and ResNet50V2 as classic models, and DenseNet121, MobileNetV3, and NASNetMobile were used for network training. However, the approach required to increase and improved the data quality and enhance the hyper parameter optimization. Yu [
22] Introduced a proposed model for crack detection based on a Generative Adversarial Network (GAN). The accuracy result was 24.78%, and the recall rate was 19.64%, which was lower than other deep learning types. de León et al. [
23] presented a methodology for crack segmentation based on the theory of minimal path selection combined with a region-based approach obtained through the segmentation of texture features extracted using Gabor filters. An equalization of brightness and shadows is a pre-processing step to improve the detection of local minima. To improve the coverage of the cracks, these local minimal are constrained by a minimum distance between adjacent points. Subsequently, two areas are identified using a region-based segmentation technique, which establishes the threshold values for rejection. Lastly, a geometrical thresholding step is presented, which enables the exclusion of small, isolated cracks and rounded areas. Bae & An [
24] established a computer vision-based crack quantification algorithm incorporated with the deep semantic segmentation network. The crack width is calculated as the range value depending on the statistical confidence interval with a normal distribution of the maximum width. Two cases of crack area were examined to find the maximum width based on the average widths measured at each spotted point along the crack area for each image. The average difference for each case is, respectively, -41.43% and -11.14%. The model fixed the type of structuring element used in morphological operations, which cannot be generalized because of the different patterns of cracks in several images. Kao et al. [
25] used YOLOv4 deep learning, which is the integration of the development architecture YOLOv3 for the crack detection process. The images were taken by UAV from distance 1 m. Canny and morphological edge detectors were applied to extract the crack edges. Then, planar markers and measurement feature points were used to measure the crack width in images with an accuracy of 92%. Tran et al. [
26] applied You Only Look Once version 7 (YOLOv7), a deep learning network, which outperformed both Faster RCNN and RetinaNet with both ResNet50 and ResNet101 in speed and accuracy. Then, the method is used to measure the crack length and width to achieve an average accuracy of 92.38% and 91%, respectively.
Most of the previous studies concentrated on a deep learning model that required computational cost because it needed huge data for training and special resources with large memory. Additionally, the common nature of deep learning is the “black box” that keeps users blind and prevents them from changing any parameters. Also, most studies focused on defect detection, and less of them concentrated on retrieving their properties for safety assessment. Nevertheless, the previous studies have measured crack width in pixels units, and there are still significant limitations with this unit of measurement. This is because it is more accurate to measure the severity of concrete crack width in millimeters rather than pixels; hence, knowing the width of cracks in pixels does not yield useful information for field applications. Therefore, this paper focusses on retrieving crack width measures in millimeters that are associated with estimating condition assessment for concrete structures. The proposed method was applied to different concrete surfaces for crack detection and could be used as a supportive tool for authorities in the inspection process [
27]. As a result, the presented model aims to develop a crack detection model that can detect cracks on various concrete surfaces for measuring the crack width of reinforced concrete bridges. It is built based on image processing techniques that include pre-processing, thresholding, and several morphological operations and is finalized with applying Euclidean distance to measure the crack width. The main characteristics of the proposed model are the transparency of workflow and saving time in crack width measuring. The model required low costs compared with deep learning models that require huge amounts of data for learning and high computational costs. The significance of this model is related to its fast and reliability in measuring the crack width to assess the bridge condition rating. In addition, the presented model overcomes the shortcoming related to human errors caused by applying traditional tools and time-consuming for the inspection process.The research methodology, results, discussions, and limitations are presented in the following sections, followed by the conclusion.