1. Introduction
Identification of individuals represents a crucial area within the discipline of forensic sciences [
1,
2]. The age of the individual in question carries significant weight in the process of identification [
3,
4]. In instances of mass disasters, organized crime, and abuse cases, it is imperative to ascertain the age of suspects and victims to facilitate their identification [
5]. In the extant literature, the estimation of age in human history and forensic sciences has consistently been highlighted as a research topic, with the objective of developing reliable age estimation techniques for both living and deceased individuals [
4].
The term chronological age is used to describe the time that has elapsed since an individual's date of birth. This is calculated by determining the dates of both the individual's birth and death. In the absence of available data regarding the dates of birth and death, the age of an individual can be estimated by examining the biological age of the individual in question. The assessment of biological age is based on the physical development stages of the individual or the changes that occur with aging. This assessment includes the development of various systems such as height, weight, hair, skin, eyes, teeth, bones, and secondary sex characteristics [
6].
Dental tissues are the most durable part of the skeleton. It is resistant to extreme conditions and is preserved longer than bone. For this reason, teeth are often used in forensic science to estimate age [
11]. Even in cases of major disasters, when the victim's body is so mutilated that it cannot be visually identified, the remains of skull bones, jaw bones, and teeth have proven to be the most valuable source of identification [
4,
12]. Radiographic estimation of age and sex from radiographs of the jaw bones is considered more feasible because it is a simple and less destructive method that can be applied to both deceased and living cases [
4,
13].
A variety of methods, including morphological, metric, radio morphological, radio metric, histological, and biochemical approaches, can be utilized for age estimation in dental tissues [
2,
14,
15]. However, further research is necessary to ascertain the applicability and reliability of these methods across diverse populations [
11]. The classic age estimation methods rely on manual measurements and observer subjectivity, which are time-consuming and prone to observer bias, potentially increasing the workload of forensic experts and introducing subjectivity into the estimation process [
16,
17].
AI is defined as the capacity of a machine to imitate the cognitive processes and behaviors observed in humans, enabling the completion of tasks that would otherwise require human input. In recent years, the rapid development of AI has facilitated numerous technological advancements that have enhanced the quality of life for many individuals. Furthermore, AI has advanced rapidly in numerous medical domains, garnering significant interest in recent years, particularly within the radiology community. To date, the implementation of AI in dental radiography has demonstrated considerable potential for a multitude of applications. At this point, it can facilitate critical support for clinicians, novice physicians, and students in the decision-making process [
18].
In recent years, studies based on AI have been conducted with the objective of automating tasks in dentistry. These include caries diagnosis and classification [
19,
20,
21,
22,
23,
24,
25,
26,
27], automatic diagnosis of dental diseases and conditions of the teeth [
28], diagnostic evaluation and segmentation in periapical radiographs [
29], periodontal bone loss [
30,
31], implant planning [
32], detection and classification of periapical pathologies [
33], cephalometric analysis and automatic detection of anatomical landmarks [
34,
35] , detection of impacted mandibular third molars, and detection [
36], as well as to detect and classify impacted maxillary supernumerary teeth [
41]. Additionally, the research aims to detect vertical root fractures [
42,
43], classification of osteoporosis [
44,
45,
46] , improve image quality [
47,
48,
49], identify and classify odontogenic tumors and cysts [
50,
51], diagnose maxillary sinus pathologies [
52,
53,
54], dentification and classification of lymph node metastases [
55,
56], diagnosis in patients with Sjogren syndrome [
57], and detect oral cancer lesions [
58].
AI also offers significant potential in the field of forensic sciences for data analysis and ensuring the proper administration of justice [
59]. These models have the potential to be promising tools when identifying victims of mass disasters and as an additional aid in medico-legal situations. In the literature, AI-based models have been reported to show similar accuracy and precision as trained forensic scientists. It has been stated that these models can be promising tools when identifying victims of mass disasters and as an additional aid in medico-legal situations [
60].
The advent of deep learning represents a significant milestone in the evolution of AI algorithms. Deep learning is a group of algorithms based on the structure of artificial neural networks and defined as a sub-branch of machine learning. Deep learning algorithms offer significant advantages in processing complex data sets and making sense of problems. These algorithms simplify complex problems by classifying data sets in a hierarchical manner through the use of multilayer artificial neural networks. A typical deep learning system consists of an input layer representing the data set of the problem, multiple hidden layers, and an output layer. Hidden layers contain links that help to understand the relationships in the data set. This structure provides more effective results in solving complex problems than classical AI algorithms [
61].
Existing classic age estimation methods have been critiqued in the literature for several shortcomings. Observer subjectivity has been identified as a potential source of error in age estimation. Furthermore, manual measurement-based methods are time-consuming and laborious. Additionally, the feasibility of implementing these methods in routine clinical practice has been questioned due to the limited sample sizes typically used. Considering the considerations, the present research endeavors to employ DM-based methodologies for the processing of (panoramic radiograph) OPG images and patient records, with the objective of estimating the age of individuals with the highest estimation rate. The study considers the use of Deep 2D CNN and Deep 1D CNN for feature extraction, as these have the capability to identify applicable features automatically, without the need for human supervision. Furthermore, the study proposes a MG-RF regressor, which can minimize overfitting, enhancing the detection rate, utilizing genetic indications via chromosomes, and affording numerous optimal solutions. Regarding these advantages, the present work considers this AI-based methodology for age estimation.
The primary objectives of this study are as follows:
To extract the relevant features from OPG images and patient records individually using the proposed Deep 2D CNN and Deep 1D CNN, then concatenate these features to improve the estimation rate by enhancing the model performance.
To achieve the highest coefficient of determination (R²) for age estimation using the proposed MG-RF regressor.
To evaluate the efficiency of the proposed methodology with respect to standard deviation (SD), mean absolute error (MAE), mean square error (MSE), root mean square error (RMSE) and, R².
2. Related Work
In recent years, AI-based methodologies have been developed with the objective of overcoming these limitations and automating the age estimation process. These approaches yield consistent and reproducible results and facilitate reduced processing times. These approaches are deemed capable of detecting attributes that are not recognizable to human observers and are not feasible for manual computation. Moreover, the substantial benefits in identifying intricate relationships between features facilitate more accurate predictions with diminished error rates [
17]. AI-based approaches in the literature are generally categorized as machine learning, deep learning or a combination of these two methodologies [
16]. The objectives of these studies include numerical age regression [
16,
17,
66,
68,
69,
70,
71,
72,
73], staging of teeth [
74,
75,
76], classification of age groups [
77] and legal age classification [
78].
Galibourg et al. [
66] conducted a study with a machine learning approach for numerical age regression. The researchers employed ten machine learning methods including Random Forest (RF), Support Vector Machine (SVM), Decision Tree (DT), Bayesian Ridge Regression (BRR), K-Nearest Neighbors (KNN), AdaBoost (ADAB), Polynomial Regression (POLYREG), Multi-Layer Perceptron (MLP), Stacking (STACK), and Voting (VOTE). The researchers reported that age estimation using machine learning methods yielded superior results compared to manual methods based on radiographic dental staging from childhood to early adulthood.
Similarly, Tao et al. [
69] employed machine learning for numerical age regression. The researchers employed an MLP for this purpose. It was demonstrated that the proposed system exhibited superior performance compared to the reference manual methods across all performance metrics.
Shen et al. [
70] employed machine learning for numerical age regression, utilizing RF, SVM, and linear regression (LR). The findings of the study indicated that age estimation accuracy was superior in machine learning methods compared to traditional methods.
De Tobel [
74] employed a deep learning approach for staging teeth, with transfer learning (Alex-Net) demonstrating the most effective performance for staging.
Boedi et al. [
75] employed the Dense Net machine learning algorithm. The objective of the present study was to ascertain and validate the impact of lower third molar segmentations on automatic tooth development staging. The study's findings led to the conclusion that full tooth segmentation and DenseNet CNN optimization facilitate the accurate allocation of dental stages.
Banar et al [
75] employed a CNN that was similar in structure to the You Only Look Once (YOLO) algorithm, in conjunction with the U-Net CNN and the DenseNet201 CNN. The objective of this study is to automate the staging process in its entirety, utilizing CNN at each stage of the procedure. The results demonstrate that the proposed fully automated approach yields promising outcomes in comparison to manual staging.
In their study, Kim et al. [
77] employed a deep learning approach, ResNet 152, for the classification of age groups. The accuracy of the tooth-wise prediction was found to be between 89.05 and 90.27 percent. The performance accuracy was primarily assessed through the use of the majority voting system and area under the curve (AUC) scores. The AUC scores ranged from 0.94 to 0.98 for all age groups, indicating a superior capability.
Guo et al. [
78] employed the SE-ResNet101 model to ascertain the legal age groups. It was reported that end-to-end CNN models demonstrated superior performance, with accuracy rates of 92.5%, 91.3%, and 91.8% for age thresholds of 14, 16, and 18 years, respectively. To-end CNN models demonstrated superior performance, with accuracy rates of 95.9%, 95.4%, and 95.4% for age thresholds of 14, 16, and 18 years, respectively.
Čular et al. [
71] employed a combination of Deep Learning Active Shape Model (ASM), Active Appearance Model (AAM), and Radial Basis Network algorithms for the purpose of numerical age regression. In this study, the researchers proposed a semi-automated system based on deep learning techniques to predict tooth age by analyzing the mandibular right third molar tooth on OPGs.
De Back et al. [
72] applied deep learning and Bayesian convolutional neural networks methods for numerical age regression. The system achieved a concordance correlation coefficient of ccc = 0.91 in the validation set.
Vila-Blanco et al. [
16] developed and applied CNN models, specifically DANet and DASNet, for the purpose of numerical age regression. The results demonstrated that DASNet exhibited superior performance compared to DANet across all evaluated metrics, with a reduction in the median error (E) and median absolute error (AE) of approximately four months for the entire database. The authors concluded that DASNet is an effective method for automatically estimating the chronological age of a subject, particularly in younger individuals with developing teeth.
Wallraff et al. [
73] employed the ResNet18 algorithm. In this study, a supervised regression-based deep learning method for automatic age estimation of adolescents aged 11 to 20 years was proposed as a means of reducing the estimation error. In an initial investigation, the proposed methodology demonstrated a mean absolute error (MAE) of 1.08 years and an error rate (ER) of 17.52% on the test dataset, exhibiting superior performance compared to the predictions of dental experts.
In their study, Vila-Blanco et al. [
68] employed Rotated R-CNN algorithms to propose a novel, fully automatic methodology for age and gender estimation. The method initially employs a modified CNN to detect teeth and extract oriented bounding boxes for each tooth. These boxes are then fed into a second CNN module, designed to produce probability distributions of age and gender per tooth. Finally, an uncertainty-sensitive approach is used to aggregate these estimated distributions, resulting in an improvement in the absolute error rate.
Milošević et al. [
17] employed transfer learning techniques, utilizing a range of convolutional neural network architectures, including DenseNet201, InceptionResNetV2, ResNet50, VGG16, and VGG19. This study aimed to assess the feasibility of utilising AI techniques for chronological age estimation in older individuals through deep learning. The model was developed using contemporary architectures, exhibiting an average prediction error of 2.95 years for OPG images and 4.68 years for individual teeth. The researchers concluded that these AI approaches can be utilized in forensic odontology for older individuals, and this study provides a foundation for future research in this area.
3. Materials and Methods
The present study included all systemically healthy patients between the ages of 6 and 15 years who had undergone a OPGs in the correct position, free of artifacts and distortions, with clearly evaluable OPGs, without any dental deficiencies and dental restorations. The archive of the Department of Oral, Dental and Maxillofacial Radiology, Faculty of Dentistry, Pamukkale University, was the source of the data. The data set was created between March 1, 2020, and March 1, 2022. The patient's chronological age was calculated by subtracting the date of birth from the date of the OPGs.
This study was approved by the Pamukkale University Non-Interventional Clinical Research Ethics Committee (E-60116787-020-202083). The study was conducted in accordance with the principles set forth in the Declaration of Helsinki.
OPGs were obtained with a digital orthopantomograph (OP200D; Instrumentarium Company, Imaging Unit, Tuusula, Finland) with exposure values between 66 kVp, 2.5 mA, 13.4 s and 60 kVp, 6.3 mA, 14.1 s. The OPGs were subsequently evaluated using AI algorithms.
In this study, feature extraction is individually performed using the proposed Deep 2D CNN and Deep 1D CNN. In addition, MG-RF is proposed for estimating the age. The proposed methodologies estimate age through sequence of process as shown in
Figure 1. Initially, dataset is loaded. Subsequently, pre-processing is undertaken to make the data easier for interpretation and usage. This process also assists in eliminating the duplicates or inconsistencies in a data that could otherwise negatively impact the prediction rate of a model. Pre-processing a data also assists in ensuring that, there aren’t any missing or incorrect data due to bugs or human error. Following this, feature extraction is performed for eliminating the redundant data. Deep 2D CNN is used to extract the features from OPG images. Simultaneously, Deep ID CNN is utilized to extract features from patient records. These features are individually extracted and then concatenated to attain suitable efficient features for better age estimation using MG-RF. Finally, performance metrics are utilized to validate the effectiveness of the proposed system in comparison to existing methods.
3.1. Feature Extraction-Deep two-dimensional Convolution Neural Network and Deep One-dimensional Convolution Neural Network
Recently, DL has gained paramount significance in medical domain as it possesses the capability for handling huge data. Thus, this study employs two Deep CNN models to perform feature extraction. Deep 2D CNN model is applied to extract features from OPG images, while Deep 1D CNN is utilized for extracting features from the patient records. After extracting individual features, these are concatenated and fed into the trained model for age estimation.
3.1.1. Deep 2D CNN: Deep Two-Dimensional Convolutional Neural Network
Deep 2D CNN is a standard CNN and has obtained use in extensive application areas of deep CNN. It has numerous advantages. The CNNs possess the capability to integrate feature extraction and classification process into one learning body. It could also learn to perform feature optimization during training stage from raw input. Deep CNN are sparsely associated with connected weights and could process many inputs with high computational efficacy. These are also immune to trivial conversions in input data inclusive of translation, skewing, scaling and distortion. Deep CNNs could also adapt to varied input sizes. Generally, Deep 2D CNN is employed on images and its overall architecture is shown in
Figure 2 that encompass of numerous layers such as batch normalization, maxpooling, dropout, flatten and dense layer. It is termed 2D CNN as kernel slides with two data dimensions.
Feature extraction from OPG images is performed by Deep 2D CNN based on the below procedure,
1.Input Layer
Features from pre-processed data are declared as input of Deep 2D CNN and is given by equation.1.
In equation.1, represents the feature count per window after computation. To enhance the speed of model’s convergence, min-max normalization is used (as shown in equation.2) by which values in individual data dimension are linearly converted. Then, it is normalized to a range of [0, 1].
In equation.2, represents minimum of individual column, while represents maximum of individual column.
2.Convolution layer
Output from the feature map residing on unit of convolution layer is given by equation.3.
Where represents the bias for feature map, size of kernel is indicated by, indicates the weight of feature map, represents filter-index and represents activation function.
3.Maxpooling layer
Pooling layer descends aggregation measurements for adjacent outcome of that could minimize the dimension as well as the output sensitivity that accomplish scale invariant feature maintenance. Maxpooling is the pooling function utilized in this study that divides convolutional layer’s output features into various partitions and determining maximum in individual partition. Output of this layer is given by equation.4.
In equation.4, indicates pooling stride and represents the pooling size.
4.Training the model
Deep CNN comprise of numerous layers such as Conv2D, batch normalization, maxpooling2D, dropout, flatten and dense layer. Then, input is mapped into feature space of hidden layer. Finally, dense layer integrated varied features of local structure learned from lower layer for performing final prediction. This study uses single pair of convolution layer and maxpooling2D layer. Subsequently, 2D data is flattened into 1D data, in this way, overall neural network is completed with dense layer and is given by equation.5.
In equation.5, indicates class label, represents features of a sample, indicates layer index and indicates the class count. Further, forward-propagations are processed in accordance with equation.3 and equation.5. The information proliferates forward from input layer by hidden to output layer thereby accomplishes output of overall network. Moreover, an iteration of forward propagation will afford network error value. Cross entropy cost function computes error value as given in equation.6.
In equation.6, indicates sample, represents total training samples, indicates actual value, while represents predicted value. The overall algorithm of Deep 2D CNN is given in algorithm-I.
Algorithm-I: Deep 2D CNN |
1 |
Input: OPG Images |
2 |
Output: Features |
3 |
STEP 1: Sliding Window Process |
4 |
STEP 2: sef ← Extract Shadow Features |
5 |
STEP 3: Normalize sef using equation (2) |
6 |
regularization feature data, size = 64 Units - 128 Units |
7 |
repeat: |
8 |
STEP 4: Forward Propagation |
9 |
cdf ← Convolution2D(sef); |
10 |
mp ← Max_pooling(cdf ) ; |
11 |
fc ← Fully_connected(mp); |
12 |
class label ← relu(fc); |
13 |
STEP 5: Backward Propagation |
14 |
conduct backward propagation with Adam; |
15 |
Until wi convergences;// wi: weight |
16 |
STEP 6: Use the trained network to predict the features |
Initially, OPG images are taken as input. Then, sliding window process is performed. Following this, the shadow features are extracted. Then, these features are normalized and regularized using equation.2. Subsequently, forward and backward propagation is performed until the weight converges. Finally, the trained model is used for predicting features.
3.1.2. Deep 1D CNN: Deep One-Dimensional Convolutional Neural Network
Deep 2D CNN is an altered version of Deep 1D CNN which has gained interest in recent times. It requires minimum computational requirements and is suited for low cost and real time applications. Though mainstream models mostly rely on 2D convolution, main idea of 1D CNN is almost same. Main variation is that 2D convolution operates information of the matrix, wherein 1D convolution operates 1D vector information and could be expressed by,
In equation.7, represents the results of and indicates bias.
Non-linear functions are added to fit the actual function in a better way. So, the overall outcome is indicated as,
In equation.8, represents ReLU, activation functions or sigmoid functions.
In CNNs, there exists a significant concept termed receptive fields which represents the region where input data could be viewed by features of CNN. When (3*1) kernel size is utilized, initial hidden layer could view the three characteristic original data values and subsequent hidden layer could view five characteristic original data values. Thus, utilizing two (3*1) convolution kernels instead of single convolution kernel having size (5*1) could minimize the parameters while confirming that similar receptive area is attained. In accordance with this idea, the below block is framed using numerous small kernels for extracting features.
3.1.3. Feature Concatenation
After features are extracted from OPG images and patient records, it is concatenated, and these features are fed into train and test split for age estimation. The total number of OPG images are 622 and total number of records are also 622. Out of 622 OPG images, 60 images are extracted, while, out of 622 patient records, 15 features are extracted. On concatenating, a total of 75 features are obtained.
3.2. Regression-MG-RF (Modified Genetic-Random Forest)
Generally, GA (Genetic Algorithm) is general purpose and optimization search method relying on genetic theory and natural selection of Darwin in the biological systems. In this algorithm, population individual is termed as chromosome that relates to a resolution for specific issue. The individual chromosome indicates a combination of DT (Decision Tree) so length of individual chromosome indicates total DT, when bit possess value as one, DT is retained, while, value 0 represents that DT is denounced. On the other hand, RF includes additional randomness to model while developing trees. Rather than searching significant feature during node split, it searches the suitable feature within random feature subset. This leads to extensive diversity and better model. Due to such advantages this study considers GA and RF as MG-RF to estimate age on specific distinct process as shown in
Figure 3.
Initially, features are normalized wherein the numeric column values in dataset are changed to common form without distorting variations in value ranges. This assists in solving the learning challenges of a model. After this, feature ranking is performed using RF for considering important features. Following this, Compact Genetic Algorithm (cGA) is initialized. Subsequently, multiple RF models are created with chromosomes. Then, fitness value is evaluated using selection, crossover and mutation processes of GA. These operations transform chromosome’s initial population for improving quality. The selection represents the process that chooses parents which mate and then reunite for developing offspring for succeeding generation. It is vital for the convergence of GA based on which it could obtain best solutions. Selection process uses fitness for controlling chromosome evolution. Maximum fitness affords more chance to select optimal solutions. Whereas crossover states the integration of the genetic material as well as the elite position chosen for the crossover operation. Further, mutation is employed to chromosome at individual position. Finally, the fitness value is assessed, and current feature is recorded. Then, feature subset is evaluated. For specific issue, fitness function improves the chromosome quality as solution. Hence, it assists in attaining better age estimation. The overall algorithm of GA is shown in algorithm-II.
Algorithm-II: Genetic algorithm |
1 |
Input: (it, n, GA Parameters) |
2 |
STEP 1: begin |
3 |
STEP 2: Initialize c=0 and i=0, |
4 |
STEP 3: Generation: generate random n solutions; |
5 |
STEP 4: Compute Fitness(s) and Generation c; |
6 |
STEP 5: While fitness not reached compute for i iterations do |
7 |
Generation c+1 evolve(Generation c); |
8 |
STEP 6: fitness computeFitness (s) and Generation c; |
9 |
i =i + 1 |
10 |
end |
11 |
return (solution fitness) |
12 |
end |
|
|
At first, the parameters are initialized. Then, random solutions are generated. Subsequently, fitness and generation are computed. When suitable fitness value is not found, iterations are performed until best fitness solution is attained. After finding this value, best parameters are attained through the use of the obtained best fitness value. The algorithm corresponding to this process is shown in algorithm-III.
Algorithm-III: Modified fitness computation |
1 |
Input: Dataset(D), Chromosome |
2 |
Output: MAE of the Random Forests |
3 |
STEP 1: begin |
4 |
STEP 2: Ds - Dataset; |
5 |
STEP 3: Compute kvalues, num_trees, mtry by decoding (Chromosome); |
6 |
Dc - decompose the set as (Ds, kvalues); |
7 |
STEP 4: Fitnessmodel - RF fit(Dc,num_trees,mtry); |
8 |
STEP 5: Rank the feature using RF Regressor |
9 |
STEP 6: MAE - evaluate(model) |
10 |
STEP 7: return (MAE); |
11 |
end |
Initially, dataset and chromosome are taken as input. Then, are computed by decoding. Following this, the sets are decomposed. Subsequently, fitness model is determined. After this, the features are ranked using RF regressor. Finally, the MAE of a model is assessed. Lastly, the best parameters are attained from GA as shown in
Table 1.
After extraction of the best parameters, optimized solution is obtained based on Algorithm-IV.
Algorithm-IV: Optimized random forest |
1 |
Input: minK, maxK, minNTree, maxNTree, treeIncrement, RF best, RF fit |
|
2 |
Output: Optimized RF |
|
3 |
STEP 1: begin |
|
4 |
STEP 2: Compute computeFitness(s) and Generation c ; |
|
5 |
STEP 3: Evaluate fitness and return fitness, |
|
6 |
STEP 4: MAE (Fit RF best) |
|
7 |
STEP 5: Fit RF best= Optimized RF |
|
8 |
STEP 6: Optimized RF(D)= solution |
|
9 |
end |
|
|
|
|
At first the input is considered. Then, fitness value is computed. Following this, fitness value is evaluated and returned. Subsequently, fitness value is assessed to determine the solution. After this, hyper parameters for RF are attained as shown in
Table 2.
Main stages involved in outcome of the optimized RF is summarized below,
Step 1: MG-RF is trained where fitness value is evaluated with GA. In this phase, the classes obtained are optimized in accordance with GA and this attains input for RF that calculates fitness of individual trees in a forest. Iteration continues until optimized trees are determined.
Step 2: Fitness calculation
In this stage, overall fitness of trees is assessed based on values of through decoding of the chromosomes. Decomposed class obtained from phase 2 is assessed for the fitness value with ranked features corresponding to MAE score.
Step 3: Optimized RF
The optimized RF is attained from integration of GA (with the assistance of selection, crossover and mutation process) and multiple class decompositions.
Step 4: Termination
Operation is terminated after obtaining optimal RF.
Step 5: Outcome
This stage comprises of predicted data to estimate age.
5. Discussion
In the field of forensic science, the determination of an individual's age is of paramount importance. Consequently, the methodology employed must be both reliable and accurate. This indicates that the degree of accuracy should be high and the mean discrepancy between dental age and chronological age should be as minimal as feasible [
66].
The age estimation methods that have been documented in the existing literature are based on the evaluation of specific indicators that assess the stage of dental development that the individual has reached. The earliest known studies on this subject are dated to the 19th century [
17]. One of the earliest methods based on dental indicators is the Schour and Massler method, which was developed in 1941. This method involves the morphological evaluation of tooth development through the use of diagrams that illustrate the expected developmental stages of deciduous and permanent teeth [
62]. Subsequently, Nolla et al. [
63] developed a ten-stage chart for age estimation, which was considered an important milestone in the field of age estimation, prompting the rapid development of new methods [
11,
17,
63,
64,
65]. Currently in literature, the most commonly used methods for age estimation in children are reported to be the Willems and Demirjian methods [
66].
Demirjian et al. [
64] developed a method that can be used as a universal tool for assessing dental maturity and estimating dental age in children. Demirjian's method is the first in the literature to provide visualization of the stages of tooth development, descriptive criteria, radiographic examples of each stage, and selection rules for decision-making at borderline stages. In the Willems method, the stages of tooth development described in the Demirjian method are used to estimate tooth age by means of maturity tables that provide the age in years directly [
11,
64,
65].
The classical methods that have been introduced in the literature have been developed through the analysis of large datasets comprising a large number of participants. However, these methods have several shortcomings, including the fact that results are often population-specific and rely on time-consuming manual procedures that are susceptible to observer subjectivity [
16,
67].
A further limitation of classical manual radiologic tooth age estimation techniques is that they lack the requisite number of stages to enable the closest possible monitoring of the growth process. Another disadvantage is the difficulty in selecting a method that allows researchers to distinguish teeth that are not sufficiently differentiated in terms of developmental stage [
16].
The objective of the research was to estimate the age of patients from a real-time dataset comprising patient records and OPG images using AI-based methodologies. This study employed deep CNN with 1D and 2D architectures for feature extraction and a MG-RF for estimating age. The proposed system was evaluated by examining the discrepancies between the actual and predicted values. The results demonstrated a strong correlation between the two variables. Furthermore, the performance of this system was evaluated in comparison to that of the conventional system in terms of MAE, RMSE, MSE, SD, and R2 score. The analytical outcomes revealed that the proposed system outperformed the conventional system, exhibiting a SD rate of 0.0004, an MAE rate of 0.0079, an MSE rate of 0.00027, an RMSE value of 0.0888, and an R2 value of 0.9999.
The performance of the reference methods and machine learning algorithms utilized in the study conducted by Galibourg et al.[
66] was evaluated with analogous metrics to those employed in our study, namely R², MAE, RMSE, and SD. The researchers reported that age estimation with machine learning methods demonstrated superior performance compared to manual methods based on radiographic tooth staging from childhood to early adulthood. The results of the study are presented in
Table 3, along with a comparison to the findings of our study.
The study of Galibourg et al. was carried out with a methodology based entirely on machine learning. Demirjian and Willems methods were used as the explanatory system in the training of the machine learning system. These methods focus on the left seven mandibular permanent teeth for age estimation.
Our study was carried out with the aim of performing age estimation on OPGs sections containing the left seven mandibular permanent teeth in a fully automatic manner without any explanatory system. From the radiographs and patient records that constitute the dataset of our study, feature extraction was performed using 1D-DCNN and 2D-DCNN architecture from deep learning methods. In the regression step, RF and GA methods were modified and combined and age estimation was performed.
When our study is compared with the study of Galibourg et al. in terms of performance, it is observed that our study has a superior performance than the manual predictions and all machine learning approaches evaluated in the related study. In addition, it was found that the MAE value decreased significantly with the system used in our study. It is thought that the reason for the decrease in the MAE value may be due to the use of deep learning algorithms in the feature extraction step of our study, due to the perception of various age-related indicators that cannot be detected by the human eye by deep learning algorithms and thus more information is transferred to the MG-RF algorithm. In the MG-RF step, the RF algorithm reduces the similarity between individual trees as a methodology. For this reason, it is thought to increase the robustness of the final model by selecting the point of departure from a random subset of the input features at each step in the tree building process. In addition, in our study, it is thought that the integration of four different approaches into the system together with the use of machine learning techniques following deep learning processes results in a significant decrease in the error rate and a significant increase in the performance of the system.
In our study, age estimation is performed completely automatically without using any explanatory reference system. The fact that our study eliminates the disadvantages of exposure to human interpretation and the subjectivity of human observers by automating the age estimation task can be argued as a distinct advantage of our study. In addition, the fact that no explanatory system is used and that it is an automatic method makes our study easy to use, fast and reproducible.
Tao et al. [
69] proposed a machine learning-based approach to improve the accuracy of tooth age estimation in 2020 using a dataset of 1636 OPGs of 787 male and 849 female individuals aged 11 to 19 years. In the study, tooth age estimation is considered as a regression problem.
In the methodology of the study, manual measurements were first performed using the Demirjian method and the Willems method. The attributes were determined by entering the real ages of the patients into the system. Then, the MLP algorithm, which is a feed-forward artificial neural network from machine learning approaches, was trained with these features and experiments were conducted. The performance of the proposed system was evaluated using MAE, MSE, RMSE metrics. It is reported that the proposed system outperforms the reference manual methods in terms of all performance metrics [
69]. The findings of the study and the findings of our study are presented in
Table 4.
The findings of our study, as indicated by the RMSE, MSE, and MAE metrics, demonstrate significantly enhanced performance in comparison to the results obtained for both female and male groups in the study conducted by Tao et al. This discrepancy may be attributed to the fact that the group under examination in our study comprises younger individuals (6-15 years old), and the variations in the age distribution of these individuals. Given that age indicators of growth and development decline with age, it is a well-established fact that studies in the field of age estimation obtain more accurate results with younger study populations [
17].
In our study, it is thought that the combination of deep learning and machine learning methods significantly enhances the performance. The use of deep learning techniques allows for the establishment of connections that are not discernible to the human eye, thereby enabling the inclusion of age indicators that cannot be calculated manually in the system under study. It can be proposed that this may be the source of the observed improvement in the system's performance. Furthermore, our study employs an automated system that is not contingent on any explanatory framework. Therefore, our study has the advantages of being fast, repeatable, and less susceptible to human interpretation.
In their 2021 study, Shen et al. [
79] employed a series of machine learning systems to estimate age. The dataset utilized in the study comprises 748 OPGs of 356 female and 392 male individuals between the ages of 5 and 13. The study employed a methodology based on random forest (RF), support vector machines (SVM), and linear logistic regression (LR). The machine learning models were trained with the manually realized Cameriere method as an explanatory system and gender information. The target value is set to the subject's chronological age. The accuracy of the proposed systems for estimating age is evaluated based on the following metrics: R², ME, RMSE, MSE, and MAE. The results are then compared with those obtained using the European and Chinese formulas of the Cameriere method, which were employed in the training of the system. The findings of the study and the findings of our study are shown in
Table 5. [
79].
The performance of the systems proposed in the study is comparable to that observed in our own study in terms of the R² value. It is, however, noteworthy that the error rates of the systems proposed by Shen et al. are significantly higher than those observed in our study with respect to other performance metrics evaluated. It is thought that this discrepancy may be attributed to the utilization of deep convolutional neural networks in the feature extraction phase of the present study, the incorporation of certain age indicators that cannot be discerned by the human eye, and the combination of numerous techniques.
In 2017, Čular et al. [
71] conducted a study utilizing OPGs of 203 individuals between the ages of 10 and 25. In this study, the researchers proposed a semi-automated system based on deep learning techniques to estimate tooth age by examining the mandibular right third molar on OPGs. The researchers employed two statistical computer vision models, namely the Active Shape Model (ASM) and the Active Appearance Model (AAM), which have been extensively utilized in face recognition, gender estimation, and medical image interpretation and feature extraction in previous studies, to extract the features describing the right mandibular wisdom teeth selected for this investigation. In the training set, the images were manually segmented. In the study, the extracted features were presented as input to an artificial neural network, specifically the Radial Basis Network, and age estimation was performed as the output. The findings of the age estimation performance of the study were evaluated using the mean absolute error (MAE) in years. The findings of that study indicated that the system demonstrated superior performance when AAM feature extraction was employed. Although AGM was reported to perform better in this study, it was reported that the age estimation performance of the system was adversely affected when AAM and ASM were applied together [
71].
The researchers noted that the MAE value of less than 3 years represents a promising preliminary result. Should the prediction error be reduced in future studies, the system proposed in their study may prove a viable option for use within the scope of forensic sciences. Furthermore, the researchers indicated that the system proposed in this study offers two key advantages: minimal user input and the ability to function without the input of an experienced dentist. [
71] The findings of the study and the findings of our study are shown in
Table 6.
A direct comparison between our study and that of Čular et al. [
71] is not feasible due to differences in methodology and age group. As there are a greater number of age indicators of growth and development in younger individuals within the existing literature, the MAE value is typically reported to be lower in studies involving this age group. The present study focuses on individuals between the ages of 6 and 15. Accordingly, the lower MAE value observed in our study relative to that reported by Čular et al. is an anticipated outcome.
In contrast to the present study, the fact that our study does not entail a manual step such as segmentation over radiographs while estimating age can be demonstrated as an advantage of the system utilized in our study. Furthermore, as our study employed OPGs sections encompassing the left seven mandibular permanent teeth, the third molars, which are the most common congenitally missing teeth, were not required, thus conferring another advantage to our study.
In 2021, Wallraff et al. [
73] conducted a study on automatic age estimation to reduce the estimation error, which is a disadvantage of traditional manual age estimation methods. In this study, the researchers proposed a deep learning system based on supervised regression to perform age estimation. The study was conducted on 14000 OPGs of individuals between the ages of 11 and 20. The system proposed in the study uses raw OPGs as input. Images do not need to be pre-processed and cropped. The researchers reported that the focus of this study was 11-20 years of age, as teeth develop in a predictable pattern during the first two decades of life. Radiographs in the dataset that were affected by various external factors such as low-contrast images, diseases and jaw malpositions were not excluded. In this study, age estimation is considered as a regression problem and ResNet18 algorithm is used as the network architecture in the proposed system. The MAE of the proposed system was reported to be 1.08 with an SD of +1.41 and an error rate of 17.52%. The researchers stated that the dataset in their study provided comprehensive data for the age range of 11-20 years and that individuals aged 0-20 years should be included in the proposed system in the future [
73].
As indicated in the literature, age-related indicators tend to decline as growth and development progress, resulting in lower error rates in studies that examine younger age groups. In the study conducted by Wallraff et al. [
73], individuals below the age of 11 were excluded from the analysis. A comparison of our study with that of Wallraff et al. revealed a significantly lower MAE value for our study. It is postulated that this finding is attributable to the fact that our study focused on younger individuals.
In their 2020 study, Vila-Blanco et al. proposed a method based on deep learning algorithms for fully automatic age estimation from OPGs. This method was designed to overcome the limitations of traditional methods, which are affected by observer subjectivity and time-consuming manual operations. The study was conducted on OPGs of 2,289 individuals from the Spanish population, with an age range of 4.5 to 89 years. Only OPGs of the requisite quality were included in the study. In contrast to other studies in the literature, this study did not exclude OPGs with any of the following: orthodontic brackets or appliances, dentures, implants, restorative materials, fillings, endodontic treatment, foreign bodies such as caries, missing teeth, residual tooth roots, or earrings, and external elements such as distorted or blurred images. The images in question have been identified as defective within the system. In this study, convolutional neural networks from deep learning algorithms were employed as the methodology. Two distinct network architectures, namely DANet and DASNet, were devised for the purposes of this study. These architectures were designed and trained specifically for the purpose of age estimation in the context of this study [
16].
The results of the study indicate a strong correlation between the DANet and DASNet systems and age. The coefficient of determination was R²=0.87 for DANet and R²=0.90 for DASNet. In the assessment of the data across all age groups, MAE was found to be 2.84 years. As the age of the individuals decreased, MAE value was found to be 0.78 years for DANet and 0.75 years for DASNet in the group of individuals younger than 15 years.
In contrast to the methodology employed by Vila-Blanco et al., our study utilized a OPG section encompassing seven left mandibular permanent teeth as the input data for the system. In the study conducted by Vila-Blanco et al., the raw OPGs were utilized as the input data, and no exclusion criteria were employed. However, radiographs with inadequate acquisition quality were designated as defective and submitted to the system. The present study was conducted using high-quality radiographs of individuals without any dental or bony pathology. It is believed that this contributed to the success of the system used in our study.
With regard to the age group under examination, our study encompasses a much younger population than that considered by Vila-Blanco et al. Given that developmental age indicators are more prevalent in younger age groups, it may be posited that the superior performance of the Vila-Blanco et al. study can be attributed to the findings of our study. Moreover, the integration of deep learning and machine learning methodologies is believed to be a contributing factor to the enhanced performance of the system in our study.