With the rapid development of computer technology, tensor data increased and extended in many areas such as augmented reality, signal processing, computer vision, medical image analysis, and web data mining. Especially the most famous As the problem of pain detection is not new and much research has been done on this topic, it is very important to have a view of these previous works to have an idea of the approaches used and their limitations as well as the results obtained, so for this, we made a literature review of some articles presented in Table 1. In [
8], Ashraf et al. used the Active Appearance Model (AAM) on videos containing pain expressions based on Support Vector Machine (SVM) based machine learning procedure to classify pain and non-pain. With the advantage of representing dynamic alterations in pain-related actions, the best-performing predictive model yielded a success rate of
. In [
9], Lucey et al. Utilized a classifier SVM to develop a system for detecting image-level pain in two ways on patient images: first directly from facial features, and second by fusing individual action unit (AU) detectors. In addition. extended their work as described to detect pain to use an AAM approach on a frame-by-frame. They showed that fusing all AAM representations using linear logistic regression (LLR) provides notable performance for detecting pain and action units in the image. In [
10], Kaltwang et al. proposed to use a different form of facial landmarks and appearance features, namely discrete cosine transform (DCT), relevance vector regression (RVR), and local binary model (LBP), and then merged these features and thus showed that merging these features leads to a better estimation of pain level compared to the specific estimation of pain intensity, and they manage to achieve an accuracy of
. In ref [
11], authors used AAM to extract the canonical normalized facial appearance. Finally, an SVM classifier is used to detect the pain level on a frame to obtain an accuracy of
. In ref [
12], Sourav and Mrinal utilized Gabor filtering as a contributing step for face feature extraction and dimension reduction using principal component analysis (PCA), which increases the detection rate according to the comparative study. In [
17], authors utilized test datasets to compare the target algorithm. They then use the support vector machine as a classifier and achieve
accuracy on the images, which is the best result in the pain detection problem so far. Still recent research was published in [
13] where the authors proposed a fully automatic pain assessment model including a novel convolutional network fusion framework for assessing different pain intensities from raw facial images that achieved an F1-score of
for pain level classification. In ref [
14] Fat et al. present a framework for pain detection/classification that uses a combination of KNN and Adaboost classifiers to obtain an accuracy of
. Alghamdi et al. in [
18], developed a new system expressions-called (FEAPAS) to notify a staff when a patient suffers pain by activating an alarm. Plus the intensive care patients in ref [
23]. Pedersen et al. [
25] proposed a new approach based on the combination of depth information and descriptors such as RGB values, thermal facial images. Ferroukhi et al. proposed a new coding method video based on bandelet transform algorithm in [
26]. In addition, Chen et al. [
15] detect a pain event and locating in the video. We used CNN for automatic feature extraction from frames in [
24]. Often the pain is expressed verbally, but in some cases, traditional patient self-reporting is not efficient in [
16]. Ghosh et al. proposed method in the fourth component which utilize the scores due to the statistical and deep feature analysis are fused to ameliorate the performance in [
27]. To utilize the prediction models are based on deep feature analysis and statistical procure scores for the facial region’s pain intensities (low-pain, high-pain, and no-pain). X et al. [
20] have extensively used the convolutional neural network (CNN) using a different image classification tasks. Recently authors proposes a new architecture that uses discriminative information, which is based on the exponential discriminant analysis (DIEDA) and the projected histograms for each region are scored using the discriminative metric in refs [
21]. Mimouna et al. simplifed a heterogeneous multimodal dataset for advanced environment in [
42].[
38] Al-Shiha et al. [
39]. We see through our article that we have used, several hand-crafted features in the pain estimation task. In addition, many classifiers are used to discriminate these features. They implemented the proposed approaches to a sufficiently detailed level, for instance, in refs. [Aliradi], the relationship between AUs and pain have been considered.