Altmetrics
Downloads
137
Views
25
Comments
0
A peer-reviewed article of this preprint also exists.
This version is not peer-reviewed
Submitted:
22 May 2023
Posted:
23 May 2023
You are already at the latest version
First proposal [1] | Second proposal [2] | |
---|---|---|
Formalization and diversification of knowledge | Incorporates a correcting block based on the use of ANFIS and a specific heuristic algorithm, through which a partial formalization of knowledge is carried out. Considers several AHI threshold levels in the prediction, which facilitates the subsequent prioritization of the patients according to the severity of their condition by different types of health professionals. |
Formalizes knowledge by using of a cascade of expert systems. Considers a single AHI level, which in some way reduces the subsequent performance of the system, and allows only a single threshold to distinguish between patients who suffer from OSA and those who do not. This estimation might be used by the professionals to identify those patients who suffer from the condition. |
Uncertainty management |
Performs uncertainty management based on the use of statistical approaches, but without implicit processing of vagueness. | Uses statistical and non-statistical approaches, and therefore performs a more complete management of uncertainty and vagueness. The definition of the method for the union or aggregation of the system risks uses a utility function, which in some way involves increasing the uncertainty associated with the process. |
MATLAB | |
---|---|
Toolbox | Comments |
App Designer [54] | Facilitates the development of the user graphical interface for the artifact. |
Classification Learner [55] | Allows to perform the training and massive machine learning classification algorithms test. |
Fuzzy Logic Toolbox [56] | Makes possible to implement the fuzzy logic-based inferential engines. |
Python | |
Package | Comments |
Imbalanced-learn library [57] | Provides different tools for addressing classification problems when unbalanced datasets are available. In this case, the SMOTE-NC algorithm is used. |
Threshold | Before SMOTE-NC | After SMOTE-NC | ||||
---|---|---|---|---|---|---|
AHI < Threshold | AHI ≥ Threshold | Total | AHI < Threshold | AHI ≥ Threshold | Total | |
10 | 1,227 | 3,173 | 4,400 | 4,000 | 4,000 | 8,000 |
15 | 1,707 | 2,693 | 4,400 | 4,000 | 4,000 | 8,000 |
20 | 1,726 | 2,274 | 4,400 | 4,000 | 4,000 | 8,000 |
25 | 2,072 | 1,928 | 4,400 | 4,000 | 4,000 | 8,000 |
30 | 2,365 | 1,635 | 4,400 | 4,000 | 4,000 | 8,000 |
Cascade levels | Observations |
---|---|
Level 1 | At the first level, four expert systems are used to process the information related to the symptoms reported by the patient (for more information, see Figure 6 and Figure 7). The first of the expert systems is in charge of processing the set of information related to sleep time and determines the risk indicator R1.a at its output. The second of the expert systems focuses on processing the group of information related to unrefreshing sleep, determining the risk indicator R1.b at its output. The third of the expert systems focuses on the group of information related to complicating sleep factors, determining the risk indicator R2.a at its output. Finally, the fourth of the expert systems focuses on the snores information group, determining the risk indicator R2.b at its output. Each one of the determined risk indicators is related to the group of information used for its determination and represents respectively the hazard level associated with suffering from an OSA case in relation to each group of data. |
Level 2 | Once the risk indicators have been determined at the first level of the cascade of expert systems, they are aggregated into groups of two (R1.a and R1.b, as well as R2.a and R.2b) using two new expert systems working concurrently [1,2,4,5,8,10,11,12] as can be seen in Figure 6 and Figure 7, and at their output, after the defuzzification process, determine two new indicators, the risks R1 and R2. These new risk indicators represent, respectively, the danger of suffering from OSA in relation to the risk indicators of the previous level, through which it was possible to determine each of the indicator. |
Level 3 | Finally, at the last level of the cascade, a final expert system aggregates the risk indicators obtained at the second level (R1 and R2), determining at its output an indicator, called Symbolic Risk, related to the conjoint hazard that a patient has of suffering from an OSA case, after contemplating all the symptoms reported by the patient. |
Fuzzy Structure | Mamdani-type |
---|---|
Defuzzification method | Centroid [48] |
Implication method | Min |
Aggregation method | Max |
Inference system associated to the sleep time data group | |||
---|---|---|---|
Input Data | Range | Output Risk | Range |
Hours of sleep | 0 – 14 hours | R1.a | 0 – 10 |
Minutes until falling asleep |
0 – 240 minutes | Initial configuration | |
Fuzzy structure: Mamdani-type. Membership function type: trapezoidal. Defuzzification method: centroid [48]. Implication method: MIN. Aggregation method: MAX. Number of fuzzy rules: 46 |
|||
Prolonged intra-sleep awakenings |
0 – 10 | Subset as an example of the 46 fuzzy rules | |
1. IF (Hours_of_sleep is Few) AND (Minutes_until_falling_asleep is Few) AND (Prolonged_intra-sleep_awakenings is Never) THEN (R1.a is Low). 2. IF (Hours_of_Sleep is Few) AND (Minutes_until_falling_asleep is Few) AND (Prolonged_intra-sleep_awakenings is Never) THEN (R1.a is Medium). |
|||
Graphical example of fuzzy rules 1 and 2 | |||
Statistical Risk | mfst9 | AR: mf3 | AR: mf3 | AR: mf3 | AR: mf3 | AR: mf3 | AR: mf3 | AR: mf3 | AR: mf3 | AR: mf3 |
mfst8 | AR: mf3 | AR: mf3 | AR: mf3 | AR: mf3 | AR: mf3 | AR: mf3 | AR: mf3 | AR: mf3 | AR: mf3 | |
mfst7 | - | AR: mf3 | AR: mf3 | AR: mf3 | AR: mf3 | AR: mf3 | AR: mf3 | AR: mf3 | AR: mf3 | |
mfst6 | AR: mf3 | - | AR: mf3 | AR: mf3 | AR: mf3 | AR: mf3 | AR: mf3 | AR: mf3 | - | |
mfst5 | - | - | - | AR: mf1 | AR: mf3 | AR: mf3 | AR: mf1 | AR: mf3 | AR: mf3 | |
mfst4 | - | AR: mf1 | AR: mf1 | AR: mf1 | AR: mf1 | AR: mf1 | AR: mf1 | AR: mf1 | - | |
mfst3 | AR: mf1 | AR: mf1 | AR: mf1 | AR: mf1 | AR: mf1 | AR: mf1 | AR: mf1 | AR: mf1 | AR: mf1 | |
mfst2 | - | AR: mf1 | AR: mf1 | AR: mf1 | AR: mf1 | AR: mf1 | AR: mf1 | AR: mf1 | AR: mf1 | |
mfst1 | - | AR: mf1 | AR: mf1 | AR: mf1 | AR: mf1 | AR: mf1 | AR: mf1 | AR: mf1 | AR: mf1 | |
Apnea Risk (AR) | mfsy1 | mfsy2 | mfsy3 | mfsy4 | mfsy5 | mfsy6 | mfsy7 | mfsy8 | mfsy9 | |
Symbolic Risk |
AHI 10 | AHI 15 | AHI 20 | AHI 25 | AHI 30 | |
---|---|---|---|---|---|
Number of rules | 68 | 71 | 72 | 72 | 71 |
Generated surface |
Sensibility | Specificity | |
---|---|---|
AHI 10 | 0.83 | 0.88 |
AHI 15 | 0.88 | 0.71 |
AHI 20 | 0.93 | 0.62 |
AHI 25 | 0.64 | 0.83 |
AHI 30 | 0.72 | 0.69 |
Data | Value | ||
---|---|---|---|
Objective data | Gender | Man | |
Age | 69 | ||
Weight | 93 kg | ||
Size | 179 cm | ||
BMI | 29.03 | ||
Neck circumference length | 43 cm | ||
Habits | Drinking habits: daily, 30g of alcohol |
||
Drug treatments | - | ||
Illnesses | - | ||
Subjective data | Sleep time subgroup | Hours of sleep | 8h |
Minutes until falling asleep | 30 minutes | ||
Prolonged intra-sleep awakenings | No | ||
Unrefreshing sleep subgroup | Feeling of unrefreshing sleep | Occasionally | |
Daytime tiredness | Occasionally | ||
Morning dullness | No | ||
Complicating sleep factors | Unjustified multiple awakenings | No | |
Nocturia | Occasionally | ||
Breathless awakenings | No | ||
Reported apneas | No | ||
Snores subgroup | Snorer | Yes | |
High-intensity snorer | No | ||
Snore-related awakenings | No | ||
Sleep test | AHI | 23 |
Statistical Risk [0,100] | Symbolic Risk [0,100] | |
---|---|---|
AHI = 10 | 70 | 42.86 |
AHI = 15 | 46.67 | 42.86 |
AHI = 20 | 80 | 42.86 |
AHI = 25 | 33.33 | 42.86 |
AHI = 30 | 43.44 | 42.86 |
Ease of use (Easy / moderate / complex) |
Automated knowledge Acquisition (full / partial) |
Data dependency (full / partial) |
Combination of inference models (yes / no) |
|
---|---|---|---|---|
Ismail Atacak [65]: This paper proposes a malware detection system for the Android operating system. To this end, six classification machine learning algorithms are first used, focusing on the processing of application data. Then, through a voting process, three of the algorithms are selected, whose results are interpreted and aggregated by a Mamdani-type fuzzy inference system, which makes it possible to determine the degree of malware. | Moderate difficulty of use. To use this system, it is essential to define in advance the knowledge base of the fuzzy inference system that will allow the aggregation of the results of the machine learning algorithms. | Partial knowledge acquisition. The system architecture does not have an automatic knowledge acquisition sub-system. The rules of the Mamdani inference system need to be defined manually. | Full data dependency. This is a data dependent approach as it uses supervised machine learning algorithms. | Yes, it consists of the sequential use of a block of machine learning algorithms and a fuzzy inference system, which can be understood as an ensemble, that is, where the fuzzy engine allows the predictions of the preceding algorithms to be aggregated. |
- | - | = | = | |
Melin et al. [66]: In this work, aimed at predicting the COVID-19 time series, it is proposed to use jointly a block of neural networks, more specifically nonlinear autoregressive neural networks and function fitting neural networks, and a fuzzy inference system focused on determining the importance of the predictions of the models, which are aggregated through a weighted summation model. | Moderate difficulty of use. To use it, it is essential to first define the knowledge base that will allow the importance of each model’s predictions to be determined on the basis of its prediction error. | Partial knowledge acquisition. The system integrates a module based on the Mamdani inference system. Its rules are defined manually, as it does not include a knowledge acquisition subsystem. | Full data dependency. This is an unavoidably data-dependent approach where it is necessary to train the neural networks used. | Yes, this paper uses together both statistical inferential approaches and symbolic inferential approaches. |
- | - | = | = | |
Ahmed et al. [67]: This paper proposes a system for the prediction of diabetes condition. To this end, two machine learning algorithms are used that focus on the processing of initial data, whose predictions are then processed by an inference system based on fuzzy logic of the Mamdani type, which is responsible for determining the final prediction. | Moderate difficulty of use. In order to use this system, it is necessary to first define the knowledge base of the Mamdani inference system, which is essential in determining the prediction of the model. | Partial knowledge acquisition. The knowledge base is manually defined, as the system lacks a knowledge acquisition subsystem. | Full data dependency. As it integrates approaches based on the use of classification machine learning algorithms, the availability of data is essential. | Yes, this system uses both statistical and symbolic inferential approaches in a joint and sequential manner. |
- | - | = | = | |
Ragman et al. [68]: This paper proposes a real-time rainfall forecasting system. To achieve this, it uses four classification machine learning algorithms that focus on the processing of sensor data. The predictions of these models are combined by a Mamdani inference system, which is responsible for determining the final prediction. | Moderately difficult to use. In order to use the system, it is essential to define the knowledge base of the fuzzy inference system responsible for aggregating the predictions. | Partial knowledge acquisition. The system does not have a knowledge acquisition subsystem. The knowledge base is manually defined. | Full data dependency. This is a data dependent approach as it incorporates supervised learning approaches into its architecture. | Yes. It uses statistical and symbolic inference approaches. |
- | - | = | = | |
Casal-Guisande et al. [2]: This is an intelligent decision support system applied to the diagnosis of obstructive sleep apnea. Its architecture uses a set of classification machine learning algorithms and a cascade of expert systems, each of which outputs a risk indicator. These indicators are then combined through a utility function that determines a metric associated with suffering from the pathology. | Moderate difficulty of use. In order to use the intelligent system, it is necessary to have defined the knowledge bases of the different expert systems. In this sense, after this milestone, the system could be used without major difficulties other than those related to the revision and improvement of the rules for its use. | Partial knowledge acquisition. The architecture of the intelligent system, more specifically the cascade of expert systems, does not include a specific knowledge acquisition subsystem. This must be manually defined by the expert team. | Full data dependency. The data is required to train the machine learning classification algorithms. | Yes. It uses symbolic and statistical inference approaches in a concurrent mode. Their results are then combined using a specific utility function. |
- | - | = | = | |
Casal-Guisande et al. [5]: This is an intelligent decision support system applied to the diagnosis of breast cancer, focusing on the interpretation of information obtained from mammograms. Its sequential architecture uses a cascade of expert systems, whose output is a set of risk indicators. A set of underlying factors that summarise and represent the risks is then obtained by applying factor analysis approaches. These are processed by a classification machine learning algorithm, which makes it possible to determine a risk metric associated with suffering from the pathology. | Moderate difficulty of use. The use of the intelligent system requires the definition of the knowledge bases of the different expert systems. In this sense, after this milestone, the system could be used without major difficulties other than those related to the revision and improvement of the rules. | Partial knowledge acquisition. The architecture of the intelligent system, and more specifically the cascade of expert systems, does not contemplate a specific subsystem for acquiring knowledge. The knowledge bases must be defined manually by the expert team. | Full data dependency. Due to its sequential nature and the use of a machine learning classification algorithm, this approach requires a data set from the beginning. | Yes. It uses symbolic and statistical inferential approaches. |
- | - | = | = | |
Casal-Guisande et al. [10]: This paper presents an intelligent system applied to the diagnosis of breast cancer, focusing on the interpretation of the information obtained after performing mammograms. Its architecture uses a set of expert systems and a classification machine learning algorithm working concurrently. The output is a set of risk indicators that are combined using a specific analytical function to obtain a risk indicator. In addition, the system incorporates a corrective approach that allows the weighting of the risk obtained through the opinions of the experts, which is reflected in the BI-RADS indicator. | Moderate difficulty of use. For the use of the intelligent system, it is necessary to define the knowledge bases of the different expert systems. In this sense, after this milestone, the system could be used without major difficulties other than those associated with the revision and improvement of the rules. | Partial knowledge acquisition. There is no subsystem for knowledge acquisition. The knowledge bases of expert systems must be defined manually. | Full data dependency. The system is data dependent due to the use of a classification machine learning algorithm. | Yes, the system integrates various inferential approaches, both symbolic and statistic. |
- | - | = | = | |
Our proposal | Very easy to use. Beyond the definition of the knowledge bases of the cascade of expert systems, it is not necessary to define the rules of the Mamdani inference system responsible for risk aggregation. The system can be used from the beginning without major difficulties. | Full and automatic knowledge acquisition. The system integrates an automatic fuzzy rule generation mechanism, the algorithm proposed by Wang and Mendel. | Full data dependency. In addition to defining the classification machine learning algorithms, data is also needed to create the corpus of rules for the aggregation inference system. | Yes, the system integrates a variety of inference approaches, both symbolic and statistical. |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2024 MDPI (Basel, Switzerland) unless otherwise stated