The general essence of DT is to detect any abnormal conditions in the physical asset before it reaches a malfunction or failure as all physical assets inevitably degrade over time, thus preventing various consequences (economic, environmental, workforce safety). To ensure the reliability of the physical asset, monitoring the results of predictive simulation from the initial stage of degradation will serve as a basis for subsequent maintenance [
77]. According to [
78], there exist three types of prediction methods: data-driven, model-based, and hybrid, which combine the two.
Data-driven methods depend on either historical data alone and identify matching patterns or both real-time data and historical data, to estimate the future operating performance of a physical asset [
79]. Further, an advantage of the data-driven method is all technical information regarding the equipment/system isn’t required to be generated, it only requires data from the many sensors to be analyzed, and the data structure would wholly depend on the user [
80]. Artificial intelligence methods, statistical methods, and reliability functions are some methods utilized in the data-driven approach. For example, [
45] proposed a data-driven DT fault diagnosis learning group, that evaluates the operational conditions of machining tools used in automotive applications through deep transfer learning. To sense the temperature of the sampling tool, a k-type thermocouple is integrated with a cloud data acquisition system over a WiFi module. The DDFD approach achieved 92.33% accuracy, better than DNN-Virtual [
81] and DNN-Physical [
82] models of which got an accuracy of 90.13% and 90.13%, respectively. [
37] provides a data-driven approach to Smart Prognostics and Health Management (SPHM), of specifically a milling machine, using large amounts of data generated from shop floor devices for detecting the presence of a fault, the estimation of Remaining Useful Life (RUL), and highlights the need for a multi-faceted approach or framework with Prognostics and Health Management (PHM) which includes three phases: Setup and Data Acquisition, Data Preparation and Analysis, and SPHM Modeling and Evaluation. These three phases allow the understanding that predictive maintenance is a collection of methods (machine learning, deep learning, reliability, etc.) The SPHM framework’s effectiveness was proven in its fault detection and RUL estimation capabilities. As the data-driven method is heavily reliant on the operation data obtained by numerous sensors (a single sensor cannot detect all desired information) installed in a system, a drawback of this method arises on how sensors sometimes cannot be installed in specific areas or components of interest in the system which makes data acquisition difficult and hinders the creation of a holistic representation of the physical asset [
83].
Model-based method, on the other hand, relies on mathematical models of a physical system that simulates its behavior which can be derived from either first principles or can be developed using data-driven techniques [
84], and have its model parameters updated from measured data [
85]. Furthermore, the model from this method reflects the performance of a system, with degradation dependent on its internal working mechanism, and represents all links between various components within the system [
80] From this, the trend in performance degradation can be predicted. [
51] presented a model-based simulation through 3D finite element method as the computational modeling technique, using parameters of both healthy and broken induction motors, and motor current signature analysis to determine the impact of fault presence within the motors. The outcomes were evaluated in both time and frequency domains, and an artificial neural network was employed to categorize the current health of the motor model. The authors suggested the possibility of creating a parameterized database of healthy and faulty motors which could be used to train fault diagnosis (FD) systems. According to [
65], the model-based DT approach for FD can be a robust and cost-effective method that ensures the dependability and fault tolerance of systems specifically in Photovoltaic (PV) systems that uses a mathematical analysis, simulation study, and experimental validation. Their approach allows the real-time estimation of the outputs characteristic to a PV energy conversion unit (PVECU) and diagnoses faults by generating and evaluating an error residual vector, the difference between the estimated and measured outputs, which showed that the proposed approach is capable of detecting the presence of a fault and classifying the type of fault existing in the PVECU, with fault detection and identification times ranging from less than 290 (micro s) to less than 1.2s. This methodology illustrates greater fault sensitivity compared to existing approaches. The model-based method offers the users the freedom in simulating various scenarios with the operation of the system achieved through a myriad of data sheets, and information with individual components present in the system. A drawback of this method is its complexity and the technical experts to design and generate the model as accurately as possible [
80]. Both approaches have their respective advantages and disadvantages which are often case-specific. Hence, the existence of a third method that fuses the two mentioned methods and adopts their advantages. The
Hybrid-method combines first-principle and operation data. Based on [
86], this method can be divided into three parts: data input, mechanism analysis, and data fitting. The mechanism analysis is deemed the most critical aspect of this approach since it embodies the operation of the model. Simply, the performance information is taken from the operation data (data-driven) and analyzed through a first-principle mechanism (model-based) before the hybrid modeling. This method was employed by [
52] employed this approach to propose a solution that tackles the issue of intelligent instrument FD, which is gaining prominence in the field of manufacturing. The suggested system comprises of three layers: the data layer, control layer, and output layer. The data layer employs Micro-Electro-Mechanical Systems (MEMS) sensors and a Zigbee wireless transmission network to build a data connection between the physical endpoint and the virtual model. Their designed FD and prediction system for the indentation tested yielded an accuracy of 90%. These three methods of prediction have their own merits, the method most appropriate for a prognosis would depend on what the demands of the user would be.