3.2.1. Construction and Composition of the Different Classes
After a meticulous refinement process through iterative adjustments, and following the application of the ascending hierarchical clustering method, the convergence of our algorithm allows us to identify five final superclasses. This convergence is of capital importance in ensuring the consistency of the algorithm and is guaranteed by the design of our parameters, configured to decrease over the iterations. The number of iterations needed for convergence is carefully selected by exploring different parameter setups and monitoring the graphic evolution of points, ensuring the choice of an optimal number for stable and meaningful superclasses.
Thus, the final five superclasses encapsulate a wide range of unique attributes and interrelationships within the dataset, thereby offering valuable perspectives on various aspects of the COVID-19 pandemic.
To clarify the characteristics of these superclasses, we adopt a dual-view approach, as illustrated in
Figure 1 and
Figure 2, thereby providing a detailed exploration of the complex patterns and relationships revealed by our clustering analysis. Presenting the data in separate views aims to provide an exhaustive understanding of the underlying dynamics that leads to the formation of each superclass.
Additionally, in
Figure 3, we introduce an alternative visualization technique – a joint 2D PCA projection – providing a broader overview of the dataset. This method facilitates comparative analyses and identification of trends across all superclasses. Including bubble size as an indicator of the severity index adds additional understanding into the severity of COVID-19 measures implemented in different scenarios.
By exploiting multiple visualization techniques, we improve the interpretability and comprehensiveness of our analysis. While
Figure 1 and
Figure 2 explore specific characteristics and relationships within each superclass,
Figure 3 provides a larger perspective, contributing to a complete understanding of the complex dynamics and implications of the COVID-19 pandemic.
Aiming to provide valuable information regarding the trajectory of the COVID-19 pandemic in different countries, we identify five key categories that illustrate different aspects of the pandemic’s effects. Firstly, the comforting situation (Cf1 - blue) depicts a reassuring scenario with minimal new cases, deaths, and intensive care cases per million people, alongside a low severity index. Secondly, the epidemic control situation (Cf2 - green) represents a state of epidemic control, characterized by a high incidence of new cases per million people, while the number of deaths remains relatively low. Thirdly, the panic situation (Cf3 - red) signifies a scenario inducing panic, marked by a significant increase in deaths and intensive care cases relative to new cases. Next, the preventive situation (Cf4 - purple) portrays a proactive approach to combating COVID-19, with low counts of new cases, deaths, and intensive care cases per million people, despite a high severity index. Lastly, the worrying situation (Cf5 - yellow) indicates a transition to concern, with a high number of deaths and intensive care cases relative to new cases, despite the latter being low. The severity index is substantial, reflecting significant measures in place.
In addition to the findings from
Figure 1,
Figure 2, and
Figure 3,
Figure 4 serves as a complementary visualization, shedding further light on the distinct characteristics of each identified class. By examining the average values of pertinent variables within each class, we gain a deeper understanding of the nuanced differences among them. Upon closer inspection, these average values reaffirm the distinctive traits attributed to each class. For instance, the blue class continues to exemplify a state of relative comfort amidst the pandemic, with consistently low averages across all variables. Conversely, the green class signifies a phase of epidemic control, as evidenced by a notably higher average number of new cases in comparison to new deaths, suggesting effective management measures. Moving forward, the red class embodies a concerning scenario marked by a stark contrast between the average number of new deaths and new cases, indicative of a potential healthcare crisis. Meanwhile, the purple class portrays a proactive stance against the spread of COVID-19, characterized by a relatively high average stringency index alongside low averages for other variables, signaling stringent preventive measures. Lastly, the yellow class denotes a transitioning phase towards heightened concern, where the average number of new deaths slightly surpasses that of new cases, coupled with a sustained adherence to significant measures reflected in the stringency index. By looking into these nuanced averages, we enrich our understanding of the diverse COVID-19 scenarios encapsulated within each class, thereby facilitating more informed decision-making and targeted interventions.
In
Figure 5 and
Figure 6, we present the results obtained by applying our proposed algorithm to the COVID-19 dataset. Each box on the graphs corresponds to the stringency index of a specific country at a particular point in time. The horizontal axis represents the timeline in months, spanning from February 2020 to April 2022. Meanwhile, the vertical axis displays the 15 countries included in the study. In the context of studying the COVID-19 pandemic, "waves" refer to distinct and often recurring periods of increased COVID-19 cases, hospitalizations, and/or deaths within a given region or population. These waves characterize surges in the number of individuals testing positive for COVID-19 or experiencing severe illness and may correspond to the rapid spread of new variants of the virus. The concept of waves describes the fluctuating patterns of COVID-19 transmission over time. Typically, a wave involves a significant increase in cases, reaches a peak, and then shows a decrease in cases. The reasons for these waves vary and may include factors such as changes in public health measures (e.g., lockdowns, mask mandates), the emergence of new variants, vaccination campaigns, and public behavior. It is important to note that the terminology and criteria for defining COVID-19 waves vary between regions and among researchers. Waves are often analyzed to understand the dynamics of the pandemic, assess the effectiveness of public health interventions, and plan for healthcare system capacity. In summary, COVID-19 waves represent recurring surges in cases or outbreaks of the virus, playing a key role in shaping the trajectory and impact of the pandemic.
In our analysis of the COVID-19 pandemic, we focus in
Figure 6 on identifying and observing distinct waves of the virus within the data. To do this effectively, we focus our attention on specific classes, namely the "Worrying situation," "Epidemic control situation," and "Panic situation." The rationale behind this approach lies in the unique characteristics and severity of these classes. These particular classes represent critical phases of the pandemic, where the impact of COVID-19 on a given country or region is most pronounced. In the "Worrying situation" class, the population is deeply affected, either due to a substantial number of new deaths in proportion to new cases or due to a massive spread of the virus with effective control over the number of deaths. Similarly, the "Epidemic control situation" class indicates a situation where there is a significant number of new cases but relatively fewer deaths, signifying effective measures to mitigate the impact. The "Panic situation" class reflects a scenario where there is a substantial number of deaths and patients in intensive care, often exceeding the number of new cases. This situation results from the time delay between COVID-19 infection and mortality, highlighting the urgency of addressing the pandemic’s impact. By focusing on these three classes over time, we trace the trajectory of the pandemic’s waves. These classes encapsulate the periods when the COVID-19 situation is most critical and when waves, characterized by surges in cases and their consequences, are most evident. Our approach enables us to discern and analyze the dynamics of the pandemic with a sharper focus on these impactful phases, providing a clearer perspective on the virus’s behavior and its response to various factors such as public health measures, vaccination efforts, and emerging variants.
For the first wave, the countries divide into two groups: the yellow class and the red class, representing respectively situations of concern and panic where the number of new deaths is high compared to the number of new cases. Additionally, only a small number of countries considered in the study are deeply affected by this first wave, which lasts only about three months, from March 2020 to May 2020. Each of the countries that is part of this first wave stays there for up to two consecutive months, except Belgium and Sweden, which stay there for three months. It appears that during this wave, Switzerland and Luxembourg do not reach the extreme situation, the situation of panic. It is noteworthy that Italy is the only country among the considered countries that is in the most critical class from the start and remains there for two months without transitioning through a worrying class. Moreover, despite their absence during the first three months of the COVID-19 wave, the United States experiences a later first wave, during the months of July and August 2020 when all the European countries considered are in a rest period between the first and the second wave.
In the second wave, the most common group is the red class, indicating the peak in terms of new COVID-19 deaths since the beginning of this crisis. This represents the most critical situation as the number of new deaths is very high and often exceeds the number of new cases. This period spans approximately eight months from October 2020 to May 2021. During this wave, all the considered countries are affected over more or less similar periods, particularly during the first four months. In the United States, Spain, France, and Belgium, the second wave of COVID-19 begins with a number of new cases that is not yet very high. Then, the majority of the countries in this study move directly during November 2020 and December 2020 to a very large number of new deaths compared to new cases, the panic situation. Cyprus is the country least affected by this wave.
For the third and last wave observed on this graph, we observe the emergence of a new cluster, the green group, which is prevalent from January 2022 until March 2022 and indicates good control of deaths due to COVID-19. During this wave, despite a very high number of new cases, the number of deaths remains considerably low. This situation results from the impact of the vaccination against COVID-19, which starts in January 2021, in addition to the possibility of acquisition of global immunity around the world. It is also possible that the variant of COVID-19 present at this time is less severe than the previous forms. During this wave, all the considered countries are affected, and it is especially in January 2022 that the number of new cases peaks in all countries. This period follows the end-of-year celebrations, which could explain this observation. This last wave lasts about nine months, from August 2021 to April 2022. Almost absent from the first and second waves of COVID-19 so far, Cyprus now starts the third wave in July 2021. This wave spreads to Spain, France, and the United States in August 2021. Five months later, in January 2022, this wave of COVID-19 encompasses all the considered countries without exception. An interesting point to note is that France and Spain are at the beginning of each wave and are therefore always among the first countries to feel the effect of a new wave of COVID-19. Another surprising observation is that during this wave, despite the explosion in the number of new cases, the measures taken are quite weak, and this does not affect the small number of deaths.
As for the other periods, we notice that the first period of calm between the first and second waves lasts about four months, from June 2020 to September 2020, while the second period of rest between the second and third waves lasts about one to two months. This situation results from the fact that during the first period of COVID-19, the measures taken concerning quarantine are quite severe and, at a certain point, COVID-19 stops spreading as fast as before. In contrast, especially after the vaccination and the control of the number of new deaths, there is a relaxation of the considered measures.
3.2.2. Evolution of COVID-19 in France, Italy, Sweden, and the United States
Thanks to the results obtained, we can follow the transition of countries from one cluster to another over time. From fig3 and fig4, we notice that France often starts the waves, Italy is one of the first European countries to enter the worst COVID-19 situation, Sweden very rarely, almost never takes severe measures according to the values seen in the boxes, and the United States experiences a delay in its COVID-19 situation compared to Europe. For these reasons, we are interested in observing the evolution of the COVID-19 situations for these three European countries: France, Italy, and Sweden, and comparing them with the United States’ COVID-19 situation. We examine these COVID-19 trajectories in fig5. The colors used represent the same clusters with the same characteristics mentioned earlier in this paper. The curve drawn above each list of colored boxes represents the wave of COVID-19 that the specific country is experiencing. It is drawn according to the following benchmark: if the country passes through a comforting or preventive situation (blue or purple), the curve takes the value 0; if the country is in a worrying situation (yellow), the curve takes the value 0.5; for a passage through an epidemic control situation (green), the curve takes the value 1; and for the panic situation (red), the curve takes the value 2.
In the United States, during the months of February and March 2020, the COVID-19 pandemic does not start yet. In April 2020, the USA moves to the preventive class during which the government establishes several measures and precautions even though the numbers of new cases and new deaths are still very low. The United States remains in this class for three months until June 2020. In July 2020, the health situation in the United States becomes critical and concerning with a significant number of new deaths despite a small number of new cases. This situation remains stable for two months, then the United States sees its number of new deaths and new cases become low but maintains fairly significant measures in September 2020. In October 2020, the number of new deaths increases and the USA returns to the critical class where the number of deaths is high compared to a low number of new cases. In November 2020, the pandemic reaches the panic situation for the first time with a very high number of new deaths compared to the number of new cases, which itself is quite large. It remains there for four months in a row, therefore until February 2021. In March 2021, the number of new cases decreases but the number of new deaths remains quite high compared to the number of new cases until June 2021. In June 2021, the situation becomes less worrying as the United States moves to the preventive cluster (purple) and to a situation where the undertaken measures remain quite significant with a low number of new cases and new deaths. Then, in July 2021, the United States moves to a comforting situation with even weaker measures. Just after July 2021, in August 2021, the United States returns to a worrying class and then remains for two months in the red group, the panic class with a very high number of new deaths compared to the number of new cases. The USA remains in the worrying class for two months then reaches a new peak of panic for two months and finally in April 2022, the USA witnesses an easing of the COVID-19 situation with its entry into a preventive situation with a low number of new cases and new deaths while the stringency index is high. Generally, for the United States, it is noted that the crisis only really begins in July 2020. It is also noted that there are few blue and purple classes since the outbreak of this crisis, meaning that the situations of rest between a wave and another wave are very short or that the situation has never been calm enough until April 2022. The absence of a green class, which constitutes the control situation class where the number of new deaths is limited according to the number of new cases, is also noted.
As for Sweden, during the first two months of February and March 2020, the epidemic does not arrive yet. In April 2020, Sweden moves directly into the red class, the panic class, where the number of new deaths exceeds the number of new cases. Sweden remains in this class for two months and transitions to the yellow class in June 2020, representing a class that is still quite critical but where the number of new cases has decreased. From July 2020, Sweden shifts to the purple group where significant prevention measures are established, but the numbers of new cases and new deaths are quite low. Sweden stays in this cluster for four months until November 2020 when it returns to the yellow class with quite low new cases but a significant number of new deaths. Then, in December 2020 and January 2021, Sweden is in the red group, the panic class, until February 2021 when it moves to the class where the panic is less intense with a smaller number of new cases. Sweden then moves to the purple class, the preventive class, in March 2021, and to the yellow, the worrying class, in April 2021. In May 2021, Sweden is in the preventive situation. After that, there seems to be an eradication of the epidemic in this country as Sweden mainly remains in the blue class until April 2022. Indeed, the only class change from June 2021 occurs in January 2022, when Sweden makes a brief passage through the green cluster which characterizes the control of the number of new deaths compared to the number of new cases, which is quite high. Thus, even when there is a high level of contamination and therefore a large number of new cases of COVID-19, Sweden manages the crisis and limits the damages. What characterizes the evolution of the epidemic in Sweden is that this country quickly limits the damages caused by COVID. From June 2021, the situation concerning COVID-19 is quite calm and the undertaken measures are no longer severe. Even when the number of new cases in January 2022 becomes very large, the number of new deaths remains quite low, indicating that the epidemic is well under control.
Unlike Sweden and the United States, Italy sees the epidemic break out from the second month considered, i.e., March 2020. Italy quickly moves to the red class, the panic class, with a very high number of new deaths compared to the very high number of new cases. Italy remains in this situation for the two months of March and April 2020. In May 2020, the precautions taken are significant, but the numbers of new cases and new deaths weaken, and therefore Italy enters the preventive class and stays there for about six months, until October 2020. In November 2020, Italy reverts to the red group, the panic class, and stays there for three months: November 2020, December 2020, and January 2021. In February 2021, Italy enters the worrying class, the yellow class, with a considerably high number of new deaths compared to the new cases, which are still low. The situation worsens in March and April 2021, with Italy entering the panic situation with a higher number of new deaths than the number of new cases, which is significant. In May 2021, Italy is in the yellow worrying class, i.e., the class where the number of new deaths is high despite the number of new cases being low. Then, the number of new deaths decreases in May 2021, allowing Italy to move in June 2021 to the purple preventive situation where the stringency index is quite high, but the numbers of new cases and new deaths are quite low. Italy stays in this group until December 2021 and then moves to the green class in January 2022 with the very low number of deaths compared to the number of new cases, which is very high. In February and March 2022, Italy returns to the preventive situation where the undertaken measures remain important while the number of new cases and new deaths are low. Finally, in April 2022, Italy moves to the yellow class where the situation starts to be a little worrying again. The particularity of Italy is that it quickly enters into a panic situation from the second month of the crisis: March 2020. Additionally, the measures taken remain quite significant throughout the entire COVID-19 period until April 2022. Since the beginning of the crisis in Italy, there are no comforting situations (blue); the calmest groups are the preventive situations (purple) with significant importance for the conditions and measures taken. We conclude that even during periods of rest, the undertaken measures in Italy remain quite rigid. We also notice that even after passing through a control class in January 2022 where the number of new deaths is limited compared to the number of new cases that is very high, it appears that the number of new deaths starts again to be considerably large compared to the number of new cases in April 2022.
For France, as for Italy, the epidemic becomes visible from March 2020. However, in France, the arrival in the panic situation is gradual as there is a passage through the worrying class in yellow in March 2020. In April 2020, France enters a situation of panic with a very high number of new deaths compared to the number of new cases. In May and June 2020, France controls the situation and is in a preventive situation where the measures taken are significant but the numbers of new cases and new deaths are quite low. In July, August, and September 2020, France remains in a calm, comforting situation where even the measures are relaxed. In October 2020, France returns to the yellow cluster where the number of new deaths is quite high compared to the number of new cases which remains quite low. In November 2020, France reaches the panic situation and stays in this situation for six months in a row until April 2021. The number of new cases then decreases, allowing France to switch to the yellow class during May 2021 and then to the blue class in July 2021 with a comfortable situation where even the measures are weakened. This peaceful situation does not last very long and in August 2021, France returns to the yellow class, a worrying class. However, the situation is quickly taken care of, and France returns to the purple group of prevention with the presence of measures with quite low numbers of new cases and new deaths. France stays in this purple group until November 2021 and then moves into the worrying yellow cluster in December 2021 with a number of new deaths which is considerably large compared to the number of new cases which is quite low. In January 2022, we see the emergence of the green class which indicates a situation of control of the number of new deaths compared to the number of new cases, so even if the number of new cases becomes very important, the number of new deaths remains weak. In February and March 2022, France is in the worrying yellow class but returns to a light blue comforting group in April 2022 where even the measures are abandoned. The evolution of COVID-19 in France from February 2020 to April 2022 is characterized by a long period in a situation of panic which extends from November 2020 until April 2021. During this period, the number of new deaths is very high compared to the number of new cases. In addition, we see that after each yellow period and therefore after each period where the number of new cases is considerably low, France goes directly to a light blue class where the conditions and measures taken are abandoned. We can then say that, unlike Italy which remains vigilant by constantly implementing measures throughout the pandemic, France lifts the conditions as soon as the health situation seems to improve.
By making a general comparison of the evolution of COVID-19 between the countries of Europe and the United States, we see that over the last year considered, from April 2021 to April 2022, the three European countries considered do not enter the panic situation (red) at all times. This red cluster indicates the worst COVID-19 situation with more new deaths than new cases. For Italy and France, even after entering the yellow class, a worrying class, these countries quickly master the COVID-19 situation. However, looking at the evolution of COVID-19 in the United States for the period from April 2021 to April 2022, we notice that the USA remains in the most serious panic situation for four months (September 2021, October 2021, January 2022, February 2022) and in general, after each worrying situation (yellow), there is a transition to the panic group (red), which represents the most serious case. Also, as noted earlier, the United States never enters the epidemic control situation that indicates good control of the number of new deaths relative to the number of new cases. There also appears to be a lag of about three to four months between the start of the COVID-19 pandemic in Europe and the start of the crisis in the United States, which could partly explain the occurrence of serious situations in the USA until April 2022, unlike in Europe.