Preprint
Article

Spatio-Temporal Patterns of the SARS-CoV-2 Epidemic in Germany

Altmetrics

Downloads

129

Views

86

Comments

0

A peer-reviewed article of this preprint also exists.

This version is not peer-reviewed

Submitted:

21 June 2023

Posted:

22 June 2023

You are already at the latest version

Alerts
Abstract
Results from an explorative study revealing spatio-temporal patterns of the SARS-CoV-2/COVID-19 epidemic in Germany are presented. We dispense with contestable model assumptions and show the intrinsic spatio-temporal patterns of the epidemic dynamics. The analysis is based on COVID-19 incidence data, which are age-stratified and spatially resolved at the county level, provided by the Federal Government’s Public Health Institute of Germany (RKI) for public use. Although the 400 county-related incidence time series show enormous heterogeneity both with respect to temporal features as well as spatial distributions, the counties’ incidence curves organise into well distinguished clusters that coincide with East and West Germany. The analysis is based on dimensionality reduction, multidimensional scaling, network analysis, and diversity measures. Dynamical changes are captured by means of difference-in-difference methods which are related to fold changes of the effective reproduction numbers. The age-related dynamical patterns suggest a considerably stronger impact of children, adolescents and seniors on the epidemic activity than previously expected. Besides these concrete interpretations, the work mainly aims at providing an atlas for spatio-temporal patterns of the epidemic which serves as a basis to be further explored with the expertise from different disciplines, particularly sociology and policy makers. The study should also be understood as a methodological contribution to getting a handle on the unusual complexity of the COVID-19 pandemic.
Keywords: 
Subject: Biology and Life Sciences  -   Other

1. Introduction

For about 3 years, beginning in September 2019, the SARS-CoV-2/COVID-19 pandemic shook the world in an almost unprecedented way ([1]). Three years after the outbreak, it is still not clear whether the pandemic entered an endemic phase and how long a crisis-like situation will persist or re-emerge ([2]). There are still too many unknowns to be able to give clear prognoses, although the flood of COVID-19 related publications is without example ([3,4]). Equally unprecedented is the fact that a high proportion of the literature dealing with the pandemic is meta-scientific and/or meta-bibliographic in nature or belongs to the sociology of behaviour or the sociology of science. Topics include discussions of malicious or accidental miscommunication even within the scientific context ([5]). Contributions address misconduct, but are themselves often characterized by sheer polemic, if not denunciation. This fact may best be summarised that we face a harsh "COVID-19 infodemic" ([6]). No surprise that a number of surveys and studies clearly points to a polarisation and radicalisation in public attitudes and behaviour driven by a polarisation in elite rhetoric that hinders effective responses to the COVID-19 crisis ([7]) and gives rise to an increasing social Darwinism ([8]). Thus, health behaviour is increasingly driven by political ideology ([9,10,11]) such that the resulting epidemic dynamics have become almost unpredictable and uncontrollable.
In this paper, we take a closer look at the spatio-temporal dynamics of the epidemic in Germany. Due to the aforementioned socio-behavioural imponderabilities, the problem is inherently systemic and adaptive in that sense, that preventive measures including their associated compliances and epidemic activity are bidirectionally related via nonlinear feedback loops. It follows, even if containment measures were precisely datable, one could not rule out the possibility that they would ultimately be counteracted and even be changed to the opposite. The notation of a "self-disorganisation" suggests itself. We here take the stance that the recorded, more or less objective incidences, should speak for themselves since most of the aforementioned socio-behavioural determinants defy quantifiability. We aim at presenting suitably quantified spatio-temporal patterns of the German epidemic activity in terms of features of weekly recorded incidence time series at the relatively fine-grained spatial resolution of German districts. In essence, our approach is explorative in that we invert the search direction: striking spatio-temporal incidence patterns provide timestamps and spatial clues that point to external determinants that trigger changes and allow for associations with prevailing social conditions.
"Systems thinking" is a collective term for very different conceptions of complex self-(dis)organised systems ([12]), even if, strictly speaking, the term self-organisation was only used extensively in the context of synergetics ([13]). The scientific nature of systems thinking has been questioned and it has, apparently, been conceived as a hermeneutic process ([14,15]), thus belonging to the context of discovery rather than to the context of justification in terms of Reichenbach’s partition of the epistemic process ([16]). The pressing need to better understand the COVID-19 pandemic, and the sheer lack of convincing and actionable evaluations, encouraged us to place greater emphasis on the benefits of systems thinking. It is due to the very nature of complex systems that no unique optimal tool exists for their analysis. Each analysis tool allows to take a particular look at the system and the combination of such analyses reveals the different facets of the given complexity. Specifically, we derive diversity measures based on entropy calculations, but also apply more recently introduced methods from the field of network analysis. These include methods of dimensionality reduction, multidimensional scaling, as well as cluster analyses. Taken together, we try to provide as comprehensive a picture of the complexity of the pandemic as possible, using only available incidence data. Some facets of the complexity of COVID-19 epidemic have been published recently ([17,18]) and we ask the readers to combine these previous results mainly focusing on temporal features with the new findings presented here, which put more emphasis on spatial patterns.

2. Materials and Methods

Throughout the paper, statistical calculations and creation of graphs were done with R ([19]). Used R program packages are mentioned at the appropriate places.

2.1. General Settings and Nomenclature

In the following, contextually either the 400 German rural/urban districts (counties) or the 16 federal states are labelled by index i = 1 , , 400 or i = 1 , , 16 , respectively. In graphs, however, federal states are represented by the official 2-letter abbreviation (see the list of abbreviations in the appendix). The age-specific incidences,
I i ( a , t ) = c o u n t s i ( a , t ) p o p s i z e i ( a ) ,
given by registered counts, c o u n t s i ( a , t ) per sub-population size, p o p s i z e i ( a ) , at time point t of counties i = 1 , , 400 (or, depending on context, federal states i = 1 , , 16 ) have been calculated from the counts retrieved from the Robert Koch-Institute (RKI) database ([20]) and from the respective age-specific sub-population sizes retrieved from [21]. Depending on the context, age a either refers to the age classes (in years) [ 0 , 5 ) , [ 5 , 12 ) , [ 12 , 18 ) , [ 18 , 30 ) , [ 30 , 40 ) , , [ 70 , 80 ) , 80 + or to k i d s : = [ 0 , 18 ) , a d u l t s : = [ 18 , 60 ) , s e n i o r s : = 60 + , respectively, and time t is given by calendar date in weekly steps from January 2020 through end of August 2022, i.e. we have t = 1 , , 135 time steps. Of note, Berlin is sub-divided into 12 administrative districts. Although COVID-19 counts are separately listed in the RKI database with respect to these districts, we use aggregated data for Berlin constrained by the database structure of the Census Bureau. Berlin as well as Hamburg thus function both as a single county and as a federal state.

2.2. t-sne

The "t-distributed stochastic neighbour embedding," in short t-sne, is a commonly applied dimensionality reduction algorithm ([22]) with a precursor, called "SNE", introduce by [23]. For a brief outline, assume that each of the 400 incidence time series corresponding to the 400 German districts is represented as a point in a 135-dimensional vector space with 135 being the length of the weekly sampled time series. SNE, and likewise t-sne, as applied here, reduces the 135-dimensional to a 2-dimensional vector space while preserving proximity of data points based on a distance or similarity measure. Such a reduction in dimensionality needs to specify a so-called p e r p l e x i t y parameter, which, in a nutshell, reflects the users taste of how quickly similarity should fade out with increasing distance, i.e., which area is conceived as neighbourhood. This trick allows for a visual inspection of clustering patterns in two dimensions, if any. Calculations are based on algorithms provided by [24].

2.3. UMAP and PCA

"Uniform manifold approximation and projection," in short UMAP, has been introduced by [25] as a dimensionality reduction algorithm that out-performs t-sne by means of preserving global structures, at least such is the claim. However, comparative studies do not allow a definitive conclusion. Analogously to t-sne, UMAP necessitates a free parameter, called n n e i g h b o r s , to be set, which defines a custom range of neighbourhood. Both algorithms compete with the well-known method of principal component decomposition/analysis (PCA) ([26]), which seeks for an optimal explanation of variability after the decomposition while preserving the overall variance in the data. We present the results from applications to the COVID-19 time series of all three algorithms side by side and conceive this as a sensitivity analysis. Used algorithms are provided by [27,28]. The PCA algorithm is endowed with the possibility to calculate normal data ellipses around a set of predefined data points, which are assumed to constitute clusters. Therefore, an approximate (pseudo) confidence measure for the separability of clusters is available.

2.4. Correlation Matrix and Hierarchical Clustering

Calculating pairwise correlation coefficients of the 16 federal state-specific incidence time series can be conceived as a reduction to a 1-dimensional space since correlation is just a specific similarity measure. Along with hierarchical clustering, a visualisation of the correlation matrix should essentially yield the same information as a two-dimensional reduction, as long as specific structures that can only be recognised by a specific algorithm are absent. Both Pearson as well as Kendall correlations are calculated since it is common place, that Pearson is good in recognising linear correlations whereas Kendall can also be applied to nonlinearly correlated data vectors. Thus, a comparison of the results after the application of all suggested methods will decidedly bring added value in interpreting the dynamic hallmarks of the epidemic. The correlation plots are created using the R-package provided by [29]. Hierarchical cluster analysis is based on Ward’s minimum variance method, which aims at finding compact, spherical clusters. Specifically, we use the "ward.D2" method, which means that dissimilarities are squared before cluster updating, according the package manual.

2.5. Multidimensional Scaling and Network Graphs

Multidimensional scaling (MDS) is the umbrella term of a family of dimensionality reduction algorithms. MDS aims at preserving distances or, conversely, proximities between data points. Note that this differs from the t-sne neighborhood embedding, which clusters neighboured points tightly in order to clearly visually separate the clusters. Here, we exclusively use non-metric MDS based on spline transformations. In other words, the spline function f transforms dissimilarities δ i j to disparities d ^ i j via d ^ i j = f ( δ i j ) . Corresponding dissimilarities in the low-dimensional space are then found by minimising a so called stress function, which in essence is a function of the difference between disparities in the original and the reduced space. For details cf. [30].
Recently, MDS is found to be combined with graph visualisation with increasing popularity. Based on a scalar measure of proximity between any pair of data points (here time series), as e.g. correlation coefficients, the data points can still be represented in two dimensions in the form of nodes (or vertices), where the scalar similarities determine the strength of the connecting edges. The spatial arrangement of the nodes can then be based on similarities calculated by minimising the corresponding stress function. Other, usually ambiguous arrangements are in use, which can be constrained by the requirement of having non-overlapping nodes. Also popular is the simple arrangement of nodes into a circle, whereby nodes assumed to share a cluster can be arranged such that they are adjacently located on the circumference.
Although this type of presentation in form of a graph visualisation is suggestive to the eye of the beholder and therefore prone to misinterpretation, it can flank the exploratory approach if one is aware of the pitfalls. We present graphs which are based on correlation coefficients or on distances supplied by a PCA, respectively. Moreover, after the application of a Gaussian graphical model using LASSO, a partial correlation network can be derived, which reduces the number and strengths of the edges of the graph to a relevant magnitude. Please cf. [30,31] for a detailed description of the graph visualisation methods used here, including the Gaussian graphical model, referred to as "graphical LASSO."
Network graphs are commonly published together with so called centrality indicators. Network strength, sometimes also referred to as degree centrality, assigns an importance score to each node/vertex, which is, in our application, the sum of pairwise absolute values of correlations of the given to all other nodes. Betweenness centrality measures quantify how strong given nodes build bridges between other pairs of nodes. Closeness centrality scores each node based on their strengths to all other nodes in the network. It is worth of note, that both betweenness and closeness do not differ strongly from degree centrality, if the similarity of the nodes is measured in terms of correlations. However, we will report these centrality measures for the sake of completeness. Expected influence, occasionally called eigen centrality, measures a node’s influence onto the entire network. If correlations are used for the quantification of similarities, the expected influence differs from network strength only if positive and negative correlations are simultaneously present in the network, since expected influence does not use absolute values when summing up the correlations.

2.6. Spatial Heterogeneity

Spatial Shannon entropy at time t is given by
S ( a , t ) = i = 1 400 I i ( a , t ) i = 1 400 I i ( a , t ) ln I i ( a , t ) i = 1 400 I i ( a , t ) ,
from which a measure of diversity (or spatial heterogeneity), given by
D ( a , t ) = e S ( a , t ) 400 , with   0 D ( a , t ) 1 ,
can be calculated. The upper limit of 400 in the summation refers to the number of districts. However, in order to detect possible differences between East and West Germany, the summation will also be restricted to either the 75 East German or to the 325 West German districts. For details on interpreting the diversity function confer [32], and for an analogous application within an epidemiological context see [33]. Briefly, for a given age class a at time point t, equal incidences over all counties gives maximum entropy hence maximum diversity D ( a , t ) = 1 , however, generally D ( a , t ) < 1 since a synchronisation of the epidemic activity across districts appears to be unlikely. Particularly in the beginning of the epidemic with one or a few number of early index cases located within one or a few number of districts, D ( a , t ) will be close to 0. Over the course of time, intervals with a more or less homogeneous distribution of incidences across counties will probably alternate with asynchronous epidemic activities, as a consequence of spatially unequally distributed index cases of new epidemic waves and differences in socio-behavioural conditions. Here, we focus on trends and abstain from presenting statistical significance, i.e., confidence intervals for D ( a , t ) are ignored. In this regard, the reported incidence data may substantially deviate from true incidences such that confidence intervals would give rise to spurious certainty.
The following measure serves to capture changes in the dynamics of the epidemic in district/federal state i,
Δ Δ I i ( t ) = I i ( t ) I i ( t 1 ) I i ( t 1 ) I i ( t 2 ) ,
i.e. fold changes of weekly fold changes in incidence, which has also been used in [34]. Equation 4 goes to show the similarity to the weekly fold change in the effective reproduction number R i ( t ) / R i ( t 1 ) of area i. Taking the logarithm yields
ln Δ Δ I i ( t ) = ln I i ( t ) 2 ln I i ( t 1 ) + ln I i ( t 2 ) ,
which is used in the following due to its favourable symmetry with respect to zero. For convenience, ln R i ( t ) / R i ( t 1 ) and ln Δ Δ I i ( t ) is used interchangeably, whereby the first version can straightforwardly be extended to comparisons of two areas i and j at a given point in time, i.e. ln R i ( t ) / R j ( t ) (for details cf. Equation 7 below).
With the exception of taking logarithms, Equation 5 bears resemblance to the so called difference-in-difference method, which has frequently been used to identify causal effects of COVID-19 non-pharmaceutical interventions (cf. [35] for a review of the method and [36] for a systematic review of applications within the scope of COVID-19). Within the latter context, the counterfactual difference-in-difference method is applied to a setting which is assumed to be quasi-experimental in nature. If I ¯ i p r e and I ¯ i p o s t denote average incidences taken over a period before or after a containment measure has been mandated in district/federal state i, respectively, then
β ^ = I ¯ i p r e I ¯ i p o s t I ¯ j p r e I ¯ j p o s t
measures the effect of the containment action when j refers to a district/federal state without a corresponding mandate. Of note, Equation 6 does yield a reliable result if and only if areas i and j are "structurally" comparable, i.e., if a common trend assumption (constant underlying differences) holds (for details cf. [35]).
Supposedly, the individual counties exhibit individual epidemic dynamics, in particular as a consequence of different (starting and stopping of) containment strategies, but also due to inherent socio-structural conditions, hence creating and amplifying spatial heterogeneity. Equation 5, therefore, serves as an auto-difference-in-difference method to detect dynamical change points. It appears plausible that a district remains structurally "self-similar" over time such that the auto-difference-in-difference is even more valid than the between-counties counterpart.
Specifically, as a consequence of the previous remarks, a special application of Equation 5 reads
ln R i ( t ) R j ( t ) = ln I i ( t ) ln I i ( t 1 ) ln I j ( t ) ln I j ( t 1 ) ,
which goes to show the difference in the dynamics of two distinct areas at a given time point, hence defining a "cross-difference-in-difference." If, e.g., two structurally similar districts i and j both follow exactly the same non-pharmaceutic intervention schedule, Equation 7 should then yield a time series constantly close to zero.
To complete, we use cross-correlation analyses based on Kendall’s correlation coefficient in order to quantify mutual associations of the dynamical patterns of areas (counties or federal states, respectively) expressed via the associated auto-difference-in-difference time series. Hierarchical clustering is applied in "Ward.D2" mode.

3. Results

3.1. German SARS-CoV-2 Epidemic Activity Geographically Clusters Into East and West

3.1.1. Allowing for a Visual Exploration Through Dimensionality Reduction

Dimensionality reduction of the 400 county-specific incidence time series leads to patterns within the 2-dimensional target space as depicted in Figure 1. Remarkably, three different commonly used reduction algorithms basically yield the same pattern: a clear separation into an Eastern and a Western German cluster can be observed. Panels A and C each show a decomposition into two principal components. The two panels differ only in the choice of subsets for which normal data ellipses have been calculated. Panel A shows normal data ellipses for the two subsets of counties that belong to West and East Germany, respectively, whereas panel C does show the ellipses for subsets corresponding to the 16 federal states separately. The locations of the ellipses belonging to the five Eastern German States can clearly be distinguished from the ellipses corresponding to Western German States. Although the clusters slightly overlap, the East-West dichotomy is clearly visible.
This result also applies to the right panels of Figure 1, which show t-sne (panel B) and UMAP (panel D) transformations, respectively. A few data points corresponding to Eastern German counties are located at the periphery of the Western German cluster, however, the two centers of mass are clearly separated.

3.1.2. Canonical Correlation Analysis Provides Added Values to the Findings

The aforementioned results can be confirmed by applying a canonical correlation analysis. Correlation matrices corresponding to Pearson’s and Kendall’s correlation coefficients, respectively, are depicted in Figure 2. Also shown are two clusters for each correlation matrix resulting from Ward’s hierarchical cluster analysis. Application of the very same cluster algorithm does lead to a strict separation of Eastern and Western German States when being applied to Kendall’s correlation matrix (lower panel) in contrast to the application to Pearson’s correlation matrix (upper panel). Specifically, hierarchical cluster analysis following a linear correlation analysis leads to the allocation of the two West German states Bavaria (BY) and Baden-Wuerttemberg (BW) to the cluster otherwise dominated by East German states. As we learned from dimensionality reduction above, points corresponding to counties belonging to BY and BW, respectively, are located at the interface between East and West German clusters (cf. Figure 1). Since the pairwise correlations of incidence time series cannot be expected to be strictly linear, the hierarchical clustering following Kendall’s correlation appears to be more convincing. It is compatible with what we learned from visual inspection of Figure 1.

3.1.3. Consolidation of the Observed Clusters Through Network Visualisation

Remarkably, the previous conclusions can also be confirmed in the form of network visualisations (Figure 3 and Figure 4) when being applied to incidence data aggregated to the federal state level. The network shown in Figure 3 results from an MDS based on Kendall’s correlation coefficients. Specifically, the edges between the vertices (federal states) represent partial (Kendall) correlations derived from graphical LASSO (correlation strength mapped to line width, positive correlations are colored green, negative red, respectively). Spatial arrangement results from similarities also calculated from Kendall’s correlation coefficients, i.e. from the corresponding MDS. Once again, Eastern and Western German states are clearly separated into well-distinct clusters.
Basing the spatial arrangement on principal components instead of correlations yields the network structure depicted in Figure 4. The graph clearly tells us that one principal component would be sufficient to explain the variability of the time series. The nodes belonging to Western German states relatively tightly cluster at one end of this component, whereas the nodes representing the Eastern States extend over a greater length but are still well-separated from the Western German cluster. Thickness and colour of edges obey the same rules as in Figure 3. Since PCA is based on a reduction that optimizes variability, it may be conceived as the most evident result, taking into account, however, that it results from linear modelling constraint by the corresponding assumptions. In summary, the results from different approaches to dimensionality reduction are largely in agreement. Thus far, a striking difference in the epidemic dynamics of East and West Germany can safely be concluded.
To conclude this section, four commonly communicated centrality indicators are presented in Figure 5. Of note, these indicators are identical for the two networks presented in Figure 3 and Figure 4 since both networks are based on the same correlation matrix. All four indicators are similar, which is not unusual for the given case of connections of nodes, which are not strictly binary (connected vs not connected). Strength of a given node is simply the sum of the absolute values of all pairwise correlation coefficients of this node within the entire network. However, the x-axes of all four centrality indicators are transformed to z-scores rather than showing the raw values, which allows for a straightforward comparison. Obviously, expected influence and strength make not much of a difference. Slight differences between strength and expected influence, as can be observed e.g. for NW, stem from a small number of low magnitude negative correlations considered when calculating expected influence. Assume given a particular index node that sits between two nodes, which are not strongly correlated, then a strong betweenness entails strong correlations for the index node with respect to the neighbored nodes. As an example, Saxony-Anhalt (ST) does strongly correlate with both MV and TH, but without a substantial correlation between the latter two federal states (cf. Figure 3). This substantially increases betweenness of ST. However, overall betweenness and strength do not differ substantially for most of the nodes. The same is true for closeness.
In summary, the indices may not contribute very much to the understanding of the prevailing dynamics, nevertheless the reader should not be deprived of the information they provide - according to the motto, no result is also a result.

3.2. Variability of County-Specific Fold Changes in Reproduction Numbers Correlates With Spatial Heterogeneity

SARS-CoV-2 spatio-temporal heterogeneity in Germany is depicted in different facets in Figure 6. To start with, age-independent incidence time series of all 400 German rural/urban districts (counties) (Equation 1 aggregated over all age classes) by and large follow the same wave-like shape as observed for the pan-German incidence curve (Equation 1 aggregated over all age classes and over all counties), however, exhibiting rather wide variations in magnitude (see Figure 6A). Even on the level of the 16 federal states, the individual curves deviate considerably from each other. The ranges depicted in Figure 6A give a vivid expression.
The variations of county-specific incidence curves are the result of continual dynamical changes expressed by a dense series of spikes of weekly fold changes in the instantaneous effective reproduction numbers as calculated from Equation 5 on the pan-German as well as on the county and federal state levels, respectively, as shown in Figure 6B. Hardly surprising, these differences in oscillatory patterns in the time domain show up as spatial heterogeneity expressed via the diversity measure calculated from Equation 3 as shown in Figure 6C. A non-constant heterogeneity over time points to a residual coherence between the auto-difference-in-difference time courses. Thereby, spatial heterogeneity has been calculated and is presented both with and without stratification by three age classes (i.e., kids ( a g e < 18 y ), adults ( 18 y a g e < 60 y ), seniors ( a g e 60 y ), all ages).
As expected, at the outset of the German COVID-19 epidemic for all age classes, spatial diversity starts at a value close to zero, followed by a rather steep increase roughly within the first 4 to 6 epidemic weeks (Figure 6C). For all three age classes, the diversity curves show relatively sharp and short-lived dips that begin shortly before the respective holiday periods of the 3 observation years. The slump is particularly pronounced in 2020, only slightly smaller in 2021 and more moderate in 2022. These dips, at least for 2020 and 2021, coincide with low incidence periods (Figure 6A) but also with periods of high relative changes in reproduction numbers (Figure 6B). This behaviour also holds for some smaller intermittent drops in diversity.
Indeed, the logarithm of the age-independent SARS-CoV-2 incidence time course (Figure 6A) strongly correlates with age-independent diversity (Figure 6C) yielding Kendall coefficient 0.74 ( p < 0.001 ). Likewise, the Kendall coefficient of correlation between the time course of the logarithmised range of the age-independent auto-difference-in-difference taken over the counties (Figure 6B) and age-independent diversity (Figure 6C) amounts to 0.6 with p < 0.001 . Thereby, the first 9 weeks have been removed from the time series due to unreliable diversity estimates at the outset of the epidemic. The activity in 2022 is somewhat out of line overall, although the indicated behaviour continues at least moderately.

3.3. Spatial Homogeneity of Child Incidence But Increased Overall Heterogeneity in the East

Remarkably, the diversity curve corresponding to the youngest age class (children and juveniles), remains on top of the two other curves which correspond to the adults and seniors during the course of time until the curves apparently converge towards the end of 2021. From roughly April 2022 onward, diversity corresponding to the adult age class starts again to drop and, therefore, diverges from the two other curves which remain in-phase at almost identical magnitudes.
If we calculate the diversities for West and East separately, we obtain the same ordering pattern by age group (not explicitly shown). However, the diversity curves for West Germany are larger in magnitude than those for East Germany over almost the entire time course. The difference of diversity between West and East is depicted in Figure 6D, which remains positive most of the time. To summarise the findings so far, increased variability of county-specific fold changes in reproduction numbers correlates with increased spatial heterogeneity and coincides with a drop in incidence. East German COVID-19 incidence exhibits a considerably stronger spatial heterogeneity than observed for the West.

3.4. Decreasing Trend in Fold Changes in Reproduction Numbers

A closer look at the courses of age-independent rural state-specific fold changes in reproduction numbers calculated according to Equation 5 reveals a clear overall trend in decreasing magnitudes (Figure 7). Although later "epidemic waves" have much more pronounced magnitudes than early "waves" (cf. Figure 6A), their rates of change appear to be more moderate. The 16 rural state-specific auto-difference-in-difference curves are depicted in Figure 7A along with a 5-week-windowed envelope. More concrete, at each point in time, t, the envelope is showing maximum and minimum values within the window time [ t 5 , t + 5 ] (in weeks). The evolution of density curves as shown in Figure 7B further clarifies this converging behaviour. The gradual narrowing of the densities, calculated per quarter, is striking.
The district of Heinsberg, located in North Rhein-Westfalia (NW), is known for the first "super-spreading" event which arguably sparked the COVID-19 epidemic in Germany (cf. [37]). Unsurprisingly, after a quick rise of numbers of infected individuals, the first attempts to mitigate the epidemic by means of lockdown orders took effect in NW which at least partially explains the strong acceleration and deceleration during the first weeks of the epidemic. A similar pattern can be observed for the second smallest federal state Saarland (SL) and the second most populous federal state Bavaria (BY). The latter state is known to have seen the first SARS-CoV-2 index case (cf. [38]), although without super-spreading event. Some states, particularly East German states as MV, ST, TH, BB, SN (cf. the list of abbreviations), show moderate de-/accelerations during the first wave but more pronounced changes in reproduction numbers during the second wave. In Baden-Wuerttemberg (BW) the amplitude of fold changes in reproduction number remained strikingly low and approximately constant during the entire epidemic. Apparently, social conduct did not change considerably in BW during the epidemic, although this is speculation.
To summarise, a common trend in the long run over all German federal states of decreasing amplitudes of fold changes in reproduction numbers can be observed, however, there are state-specific differences with respect to intermittent bursts.

3.5. Pronounced Fold Changes in Reproduction Numbers for the Younger and the Elder Cohorts

Throwing a glance onto age-stratified time courses of auto-difference-in-difference reveals striking age-dependent differences. Figure 8 depicts the auto-difference-in-difference curves corresponding to 5 arbitrarily chosen West German federal states separately for the 10 age classes. Likewise, the age-specific auto-difference-in-difference curves for the 5 East German federal states (Berlin excluded) are shown in Figure 9. The younger cohorts up to age 18 y and the elder from age 60 y upwards unveil strong amplitudes of fold changes in reproduction numbers whereas the corresponding amplitudes of the adult (medium aged) cohorts remain moderate throughout the epidemic. During the epidemic, enormous efforts were made to protect the elderly. It is therefore easy to understand that shelter-in-place or isolation orders showed greatest effect for the seniors compared to the employed people. At the same time, the political controversies led to an inconsistent and erratically changing set of rules and regulations. And this is just as true for the youngest cohort, the children and juveniles. School closing orders have been replaced by school opening orders in a discordant and somewhat haphazard fashion - a behaviour which has also been called "flying blind" ([39]). While the impact of these discordant rules on the epidemic activity is of course speculative, it can safely be stated up to this point that the de-/acceleration of this activity, i.e. the fold changes in reproduction speed, is by far more pronounce for both the young and the elder subpopulation, but not so for the medium aged adult cohort. In exactly this sense, kids and juveniles, as well as the seniors, are the driving factors of the epidemic, at least for the West German federal states. For it is the case that a striking difference between the West and the East German states can be observed. The differences between the medium-aged and the other (junior and senior) cohorts are less strong or even absent for the East German states. Within the West German set of states, Schleswig-Holstein (SH) is an exception in that the oscillation of auto-difference-in-difference resembles the corresponding East German patterns.
In summary, both the younger (up to 18 y ) and the elder ( 60 + y ) cohorts show stronger changes in SARS-CoV-2 reproduction numbers when being compared to the medium aged adult subpopulation. In this sense, kids, juveniles, and seniors drive the epidemic stronger than working adults. We hypothesise that containment and isolation measures are less actionable for working people. It is perhaps more difficult to explain the East German dynamic patterns, which appear to be much more similar with respect to age classes and, at the same time, show much more pronounced amplitudes for the fold changes of the reproduction number. This finding is compatible with our results above derived from cluster analyses. Our previous findings in [11] draw us to the conclusion that the well-known political and socio-structural differences between East and West Germany are proper surrogates for the underlying mechanisms.

3.6. Federal States Exhibit Dynamic Dissimilarities

As shown in the previous sections, the rates of change of the reproduction numbers calculated per age class and per federal state in the course of time do produce age-dependent patterns but appear also to exhibit state-dependent dynamical features. In this section, a closer look is taken at the differences and similarities resulting from state-by-state comparisons. Equation 7 is considered an appropriate time-dependent measure that captures mutual dynamic similarity. Indeed, Equation 7 can be conceived as a direct application of the original difference-in-difference concept. The logarithm of the ratio defined in Equation 7 is expected to yield constantly zero if the dynamical features of the two states under comparison are identical. Figure 10 depicts all pairwise comparisons with reference state North Rhine-Westfalia (NW). Obviously, all 15 comparator states have a markedly different dynamical patterns at the outset (roughly the first 3-6 months) of the epidemic, with the most extreme differences observed for SL and BY. For some of the comparator states, particularly HE, NI, and RP, the pronounced initial oscillation fades out to a very small amplitude. A look at Figure 2 reveals that the Kendall correlations of the incidence time series also go in the same direction, namely slightly smaller coefficients for the NW-SL and NW-BY correlations, compared to the other three pair correlations mentioned above. We hypothesise that the corresponding federal states exhibit a more coherent dynamical pattern. The corresponding curves for each of the remaining 15 reference states are shown in supplement S1.
For a more condensed or integral comparison, it is hypothesised that Kendall’s correlation coefficient is another valid quantification that captures dynamical similarity. The resulting correlation plot is shown in Figure 11. The correlation plot is ordered in hierarchical clustering mode using agglomeration method "Ward.D2" as before, constraint by two clusters. Changing the agglomeration method or the chosen number of clusters leads to a considerable fluctuation of clustering patterns (not shown). It turns out that the canonical pairwise correlations of auto-difference-in-difference curves are comparably less sensitive then the method defined by Equation 7 to detect shared dynamical patterns of the federal states via hierarchical clustering.

4. Discussion

Only a couple of months after the pandemic outbreak of SARS-CoV-2/COVID-19, RJ Klement presented a systemic picture including close to hundred components to build complex functional relations ([40]). Although Klement’s network of "causal" relations is of rather wide scope and included interactions within and between levels of organisation (i.e. macro-micro interactions), it is still preliminary and far from exhaustive, as the author confirms. Klement thus contributed to a hermeneutic discourse on the nature of the pandemic, i.e., to its better understanding, but the question remains how such a complex picture can be operationalised. In practice, science is thrown back on ambiguous reductionist views of a few interacting components. Unprecedentedly, the conception of countermeasures based on both political as well as scientific criteria, indeed the entire culture of communication, has been overshadowed by an intra-scientific scramble for interpretive sovereignty as a consequence of this ambiguity ([41]). This completely derailed communication culture undoubtedly itself added quite considerably to the pandemic-influencing mechanisms: an infodemic ([6]). At least this important cluster of nodes should be added to Klement’s "causal" network.
The strategy followed here can be described as an attempt to largely dispense with model assumptions and simply let the data speak for themselves. In this sense, the approach is non-parametric and largely descriptive. The limitation of this approach is at the same time its strength. No causal relationships, or even functional dependencies, are shown. An important note on this: causality is an a priori, i.e., a fundamental principle that cannot be derived from empiricism - not even from experimental empiricism. At best, a randomised controlled trial (RCT) provides evidence for functional dependencies under very specific conditions. Less evidence is attributed to results of regressions in the context of observational studies, even if the result thus obtained is highly self-evident. But be that as it may, at the end of the day, the conclusion and, in our opinion, already the choice of study design, including RCTs, is always already value-based (cf. [42]).
The semi-quantitative and broadly descriptive way of proceeding to assess age- and county-specific COVID-19 incidence time series recorded in Germany reveal informative patterns. Most striking is the fact, that the time series can be allocated to two geographic similarity classes or clusters which coincide with the geopolitical division in East and West Germany. This confirms findings based on regression analysis ([11]). The dependence of this clustering on the chosen measure of similarity and reduction algorithm could give rise to a possible critique. However, this criticism can be countered by the fact that 4 different methods produce the same results.
Spatial heterogeneity of COVID-19 incidence is waxing and waning in the course of time for each age class, however, remains lowest for the youngest age class (children, adolescents) during the entire time course. To interpret this phenomenon as a spatially homogeneous background incidence of the youngest age group may be going a bit too far, but a tendency in this direction is indicated. This view is supported by findings of increased seroprevalence among school-aged children (cf. e.g. [43,44,45,46] and references therein). Note, however, that a comparably lower virus load in children may lead to underdetection which in turn might also bias the calculation of spatial diversity. Furthermore, in East Germany we find a considerably larger spatial heterogeneity when being compared to West Germany. Seen in the light of marked geopolitical differences between East and West (cf. [10]), this result may appear less puzzling. In addition, it is in agreement with other spatio-temporal characteristics as discussed in the following.
Temporal acceleration patterns do differ both geographically as well as age-related. Generally, for all federal states the dynamical changes in terms of fold changes of reproduction number gradually fade out in the course of time, although some intermittent bursts can be observed. Particularly striking is the fact that the dynamics for Baden-Wuerttemberg (BW) are relatively flat over the entire observation period. In sharp contrast, Saarland (SL), a very small federal state, does show the largest variability in fold-changes of the reproduction number. The cross-difference-in-difference method applied to all pairs of state-dependent incidence time series reveals patters of similarities and dissimilarites that have to be discussed from the perspective of geopolitical differences, which is beyond the scope of this contribution. However, we point to the observation that North Rhine-Westphalia (NW) and Hesse (HE) do not only exhibit similarity both in terms of cross-difference-in-difference and Kendall correlation, but these two federal states are also located in the center of the Western German cluster derived from a network analysis (Figure 3). In this sense, NW and HE are "average states" with respect to the SARS-CoV-2/COVID-19 epidemic. The reader is encouraged to throw a glance on the series of figures in appendix S1. Among many other interesting patterns, Saarland (SL) exclusively shows very pronounced mutual cross-difference-in-difference curves and, therefore, makes SL unique in a certain sense.
In West Germany, children and juveniles as well as seniors do contribute more intensively to de-/acceleration of the epidemic spread when being compared to the middle-aged class. Such a difference cannot be observed in East Germany where all age classes equally strongly contribute to fold-changes of the reproduction number. This result is in agreement with what has been found in [18] using a different analytical approach which gives rise to the interpretation that children have a significantly greater influence as a driver of the pandemic than previously suspected.
A possible explanation is the relatively unsteady dynamics of introduction and withdrawal of containment measures for kids, particularly in schools and daycare facilities. The pronounced impact of adults in the East on dynamical changes when being compared with the West German population may be due to carelessness or differing socio-political attitudes. And, indeed, evidence is mounting on how influential sociocultural aspects and personal beliefs are in relation to epidemic activity. For a more profound discussion of this issue cf. [10,11,47,48,49,50,51]. Thus, with due caution we conclude that varying containment measures and their compliance, as well as regular occurrences like school vacations are much more instrumental to change the behaviour of non-adult people relevant for the control of the epidemic reproduction number. We hypothesise, however, that the effects of school openings are strongly confounded by the current local epidemic conditions, i.e. the current effective reproduction numbers, and, even more important, by prevailing preparedness, facilities and reliance at the schools and accountable local authorities. Thus, on the one hand, school opening can contribute to combat the epidemic in case of a quick detection of an infection and a proximate shelter-in-place order. On the other hand, school opening can worsen the situation in case of overwhelming infectivity and concurrent lack of preparedness. The pronounced irregularity observed for the epidemic dynamics corresponding to the young population teaches to shift the focus regarding control measures toward children and adolescents. Further research is needed to better understand the causes behind the observed irregularities.
Whether the observed local differences can be attributed to real differences in incidences or rather to locally different frequencies of testing, i.e. different numbers of undetected cases, is unclear (cf. [52]). This is an unavoidable and probably the strongest limitation of evaluations that refer to publicly available registered case numbers. The observed spatio-temporal intermittency, i.e. the irregular alternation of phases, are presumably to some extent caused by the time-dependent detection ratio. Moreover, the existence of a depensation effect leading to a "detection threshold" cannot be ruled out. In other words, during a low incidence period, disposition to test might be particularly low, which might in turn entail unrealistically many "zero events." To put this limitation in a slightly better light, we point out that the interpretations given here are only suggestions anyway. The observed dynamic patterns are objective, but the associated potential explanations, including changing vaccination rates and the like, are not at all.
Finally, the lack of reliable data on all types of contact regulations does definitely limit the explanatory power of our analysis. However, we are able to report intrinsic spatio-temporal patterns of the epidemic that now can be linked to all types of socio-cultural occurrences that are suspected to influence the transmission dynamics. We presented the results from an explorative study and want to conclude by highlighting the captivating advantage of such a study: we let the data speak and did not use contestable model assumptions.

5. Conclusions

We presented the results of an observational study of the German COVID-19 epidemic with all the intrinsic limitations of such a secondary data analysis. With due caution, we conclude that children and juveniles are the sub-populations which are predestined and most susceptible for successful applications of protective measures. This can be inferred from the age-dependent spatio-temporal patterns spotted in the COVID-19 epidemic activity. Furthermore, also inferred from the corresponding spatio-temporal analysis, there exists a clear East-West difference in the epidemiological dynamics in agreement with socio-structural differences reported elsewhere. Broadly speaking, the work provides a kind of atlas for spatio-temporal patterns of the epidemic, which now need to be interpreted with expertise from different disciplines. We encourage readers to combine the results presented here with those in [17] and [18] to obtain an even more comprehensive picture on COVID-19 pandemic dynamics. In particular, we urge sociologists and policy makers to associate the observed processes of change with both sociocultural characteristics of individual regions and local policy-making processes. Finally, we hope that our study can also make a methodological contribution to getting a handle on the unusual complexity of the COVID-19 pandemic.

Funding

This research received no external funding. We acknowledge support by the DFG Open Access Publication Funds of the Ruhr-Universität Bochum.

Institutional Review Board Statement

Not applicable

Informed Consent Statement

Not applicable

Data Availability Statement

Publicly available data provided by [20,21] have been used exclusively.

Conflicts of Interest

The author declares no conflict of interest.

Abbreviations

The following abbreviations are used in this manuscript:
BB Brandenburg
BE Berlin
BW Baden Württemberg (Baden-Wuerttemberg)
BY Bayern (Bavaria)
HB Hansestadt Bremen (Hanseatic City of Bremen)
HE Hessen (Hesse)
HH Hansestadt Hamburg (Hanseatic City of Hamburg)
MV Mecklenburg-Vorpommern (Mecklenburg-Western Pomerania)
NI Niedersachsen (Lower Saxony)
NW Nord-Rhein-Westfalen (North Rhine-Westphalia)
RP Rheinland-Pfalz (Rhineland-Palatinate)
SH Schleswig-Holstein (Schleswig-Holstein)
SL Saarland
SN Sachsen (Saxony)
ST Sachsen-Anhalt (Saxony-Anhalt)
TH Thüringen (Thuringia)
Counties Land-/Stadtkreise (rural/urban districts),
local administrative districts (subdivisions of the federal states) in Germany
DE Deutschland (Germany)
EW East/West, used to label the categorical variable with values Eastern and Western
Germany, where East comprises the federal states BB, MV, SN, ST, TH.
Western Germany accounts for the remaining federal states.
FS, Fed. State Bundesland (federal state)
agegrp age group or age class
RKI Robert Koch Institute (Federal Government’s Public Health Institute of Germany)
MDS Multidimensional Scaling
PCA Principal Component Analysis

References

  1. Wang, C.; Horby, P.W.; Hayden, F.G.; Gao, G.F. A novel coronavirus outbreak of global health concern. The Lancet 2020, 395, 470–473. [Google Scholar] [CrossRef]
  2. Abu El Kheir-Mataria, W.; H., E.F.; Chun, S. Global health governance performance during COVID-19, what needs to be changed? A Delphi survey study. Global Health 2023, 19. [Google Scholar] [CrossRef]
  3. Brainard, J. Scientists are drowning in COVID-19 papers. Can new tools keep them afloat? Science Mag News 2020. [Google Scholar] [CrossRef]
  4. Ahmad, S.J.; Degiannis, K.; Borucki, J.; Pouwels, S.; Rawaf, D.L.; Head, M.; Li, C.H.; Archid, R.; Ahmed, A.R.; Lala, A.; Raza, W.; Mellor, K.; Wichmann, D.; Exadaktylos, A. The most influential COVID-19 articles: A systematic review. New Microbes and New Infections 2023, 52, 101094. [Google Scholar] [CrossRef] [PubMed]
  5. Maier, C.; Ankermann, T. Studienrückrufe: Fake News in Fachzeitschriften. Dtsch Arztebl International 2022, 119, A–116. [Google Scholar]
  6. The Lancet Infectious Diseases. The COVID-19 infodemic. The Lancet Infectious Diseases 2020, 20, 875. [Google Scholar] [CrossRef]
  7. Green, J.; Edgerton, J.; Naftel, D.; Shoub, K.; Cranmer, S.J. Elusive consensus: Polarization in elite communication on the COVID-19 pandemic. Science Advances 2020, 6, eabc2717. [Google Scholar] [CrossRef]
  8. Nachtwey, P.; Walther, E. Survival of the fittest in the pandemic age: Introducing disease-related social Darwinism. PLOS ONE 2023, 18, 1–20. [Google Scholar] [CrossRef]
  9. Geana, M.V.; Rabb, N.; Sloman, S. Walking the party line: The growing role of political ideology in shaping health behavior in the United States. SSM - Population Health 2021, 16, 100950. [Google Scholar] [CrossRef]
  10. Richter, C.; Wächter, M.; Reinecke, J.; Salheiser, A.; Quent, M.; Wjst, M. Politische Raumkultur als Verstärker der Corona-Pandemie? Einflussfaktoren auf die regionale Inzidenzentwicklung in Deutschland in der ersten und zweiten Pandemiewelle 2020. ZRex–Zeitschrift für Rechtsextremismusforschung 2021, pp. 191–211. [CrossRef]
  11. Qamar, A.I.; Gronwald, L.; Timmesfeld, N.; Diebner, H.H. Local socio-structural predictors of COVID-19 incidence in Germany. Frontiers in Public Health 2022, 10. [Google Scholar] [CrossRef]
  12. Emery, F. (Ed.) Systems Thinking; Penguin Books: Harmondsworth, 1969. [Google Scholar]
  13. Haken, H. Information and Self-Organization – A Macroscopic Approach to Complex System; Springer: Berlin, Heidelberg, 2006. [Google Scholar] [CrossRef]
  14. Diebner, H.H. Bilder sind komplexe Systeme und deren Interpretationen noch viel komplexer - Über die Verwandtschaft von Hermeneutik und Systemtheorie. In The Picture´s Image. Wissenschaftliche Visualisierung als Komposit; Hinterwaldner, I., Buschhaus, M., Eds.; Fink-Verlag: München, 2006; pp. 282–299. [Google Scholar]
  15. Diebner, H.H., Performative Science—Transgressions from Scientific to Artistic Practices and Reverse. In Complexity and Synergetics; Müller, S.C.; Plath, P.J.; Radons, G.; Fuchs, A., Eds.; Springer International Publishing: Cham, 2018; pp. 373–381. [CrossRef]
  16. Seo, M.; Chang, H., Context of Discovery and Context of Justification. In Encyclopedia of Science Education; Gunstone, R., Ed.; Springer Netherlands: Dordrecht, 2015; pp. 229–232. [CrossRef]
  17. Diebner, H.H.; Timmesfeld, N. Exploring COVID-19 Daily Records of Diagnosed Cases and Fatalities Based on Simple Nonparametric Methods. Infectious disease reports 2021, 13, 302–328. [Google Scholar] [CrossRef]
  18. Diebner, H.H. Phase Shift Between Age-Specific COVID-19 Incidence Curves Points to a Potential Epidemic Driver Function of Kids and Juveniles in Germany. medRxiv 2021. [Google Scholar] [CrossRef]
  19. R Core Team, R: A Language and Environment for Statistical Computing; R Foundation for Statistical Computing: Vienna, Austria, 2023.
  20. Robert Koch-Institut. SurvStat@RKI 2.0, URL survstat.rki.de, accessed on 01 Sep 2022, 2022.
  21. Federal Statistical Office of Germany. German Census Data. URL: www.destatis.de, accessed on 02 Nov 2022, 2022.
  22. van der Maaten, L.; Hinton, G. Visualizing Data using t-SNE. Journal of Machine Learning Research 2008, 9, 2579–2605. [Google Scholar]
  23. Hinton, G.; Roweis, S. Stochastic Neighbor Embedding. In Advances in Neural Information Processing Systems; MIT press: Cambridge, MA, USA, 2002; Vol. 15, pp. 833–840. [Google Scholar]
  24. Krijthe, J.H. Rtsne: T-Distributed Stochastic Neighbor Embedding using Barnes-Hut Implementation, 2015. R package version 0.16.
  25. McInnes, L.; Healy, J.; Melville, J. UMAP: Uniform Manifold Approximation and Projection for Dimension Reduction. arXiv 2018. [Google Scholar] [CrossRef]
  26. Hotelling, H. Analysis of a complex of statistical variables into principal components. Journal of Educational Psychology 1933, 24, 417–441. [Google Scholar] [CrossRef]
  27. Konopka, T. umap: Uniform Manifold Approximation and Projection, 2023. R package version 0.2.10.0.
  28. William Revelle. psych: Procedures for Psychological, Psychometric, and Personality Research. Northwestern University, Evanston, Illinois, 2023. R package version 2.3.3.
  29. Wei, T.; Simko, V. R package ’corrplot’: Visualization of a Correlation Matrix, 2021. (Version 0.92).
  30. Mair, P.; Groenen, P.J.F.; de Leeuw, J. More on Multidimensional Scaling and Unfolding in R: smacof Version 2. Journal of Statistical Software 2022, 102, 1–47. [Google Scholar] [CrossRef]
  31. Epskamp, S.; Cramer, A.O.J.; Waldorp, L.J.; Schmittmann, V.D.; Borsboom, D. qgraph: Network Visualizations of Relationships in Psychometric Data. Journal of Statistical Software 2012, 48, 1–18. [Google Scholar] [CrossRef]
  32. Tuomisto, H. A consistent terminology for quantifying species diversity? Yes, it does exist. Oecologia 2010, 164, 853–860. [Google Scholar] [CrossRef]
  33. Diebner, H.H.; Kather, A.; Roeder, I.; de With, K. Mathematical basis for the assessment of antibiotic resistance and administrative counter-strategies. PloS one 2020, 15, e0238692. [Google Scholar] [CrossRef]
  34. Berger, U.; Fritz, C.; Kauermann, G. Schulschließungen oder Schulöffnung mit Testpflicht? Epidemiologisch-statistische Aspekte sprechen für Schulöffnungen mit verpflichtenden Tests. Report CODAG Bericht Nr. 14 vom 30.04.2021, Uni München, München, 2021.
  35. Goodman-Bacon, A.; Marcus, J. Using Difference-in-Differences to Identify Causal Effects of COVID-19 Policies. Survey Research Methods 2020, 14, 153–158. [Google Scholar] [CrossRef]
  36. Herby, J.; Jonung, L.; Hanke, S.H. A Literature Review and Meta-Analysis of the Effects of Lockdowns on COVID-19 Mortality. Technical report, Institute for Applied Economics, Global Health, and the Study of Business Enterprise at the Johns Hopkins University, Baltimore, 2022.
  37. Korencak, M.; Sivalingam, S.; Sahu, A.; Dressen, D.; Schmidt, A.; Brand, F.; Krawitz, P.; Hart, L.; Maria Eis-Hübinger, A.; Buness, A.; Streeck, H. Reconstruction of the origin of the first major SARS-CoV-2 outbreak in Germany. Computational and Structural Biotechnology Journal 2022, 20, 2292–2296. [Google Scholar] [CrossRef]
  38. Böhmer, M.M.; Buchholz, U.; Corman, V.M.; Hoch, M.; Katz, K.; Marosevic, D.V.; Böhm, S.; Woudenberg, T.; Ackermann, N.; Konrad, R.; Eberle, U.; Treis, B.; Dangel, A.; Bengs, K.; Fingerle, V.; Berger, A.; Hörmansdorfer, S.; Ippisch, S.; Wicklein, B.; Grahl, A.; Pörtner, K.; Muller, N.; Zeitlmann, N.; Boender, T.S.; Cai, W.; Reich, A.; an der Heiden, M.; Rexroth, U.; Hamouda, O.; Schneider, J.; Veith, T.; Mühlemann, B.; Wölfel, R.; Antwerpen, M.; Walter, M.; Protzer, U.; Liebl, B.; Haas, W.; Sing, A.; Drosten, C.; Zapf, A. Investigation of a COVID-19 outbreak in Germany resulting from a single travel-associated primary case: a case series. The Lancet Infectious Diseases 2020. [Google Scholar] [CrossRef]
  39. Häussler, B. Pandemie-Meldewesen: Deutschland im Corona-Blindflug. ÄrzteZeitung 2021, 15.01.2021. [Google Scholar]
  40. Klement, R.J. Systems Thinking About SARS-CoV-2. Front. Public Health 2020, 8, 585229. [Google Scholar] [CrossRef]
  41. Müller, B. Zur Modellierung der Corona-Pandemie - Eine Streitschrift. Monitor Versorgungsforschung 2021, 14. [Google Scholar] [CrossRef]
  42. Heath, A.; Hunink, M.G.M.; Krijkamp, E.; Pechlivanoglou, P. Prioritisation and design of clinical trials. Eur J Epidemiol 2021, 36, 1111–1121. [Google Scholar] [CrossRef]
  43. Ott, R.; Achenbach, P.; Ewald, D.A.; Friedl, N.; Gemulla, G.; Hubmann, M.; Kordonouri, O.; Loff, A.; Marquardt, E.; Sifft, P.; Sporreiter, M.; Zapardiel-Gonzalo, J.; Ziegler, A.G. SARS-CoV-2 seroprevalence in preschool and school-age children. Dtsch Arztebl International 2022, 119, 765–770. [Google Scholar] [CrossRef]
  44. Brinkmann, F.; Diebner, H.H.; Matenar, C.; Schlegtendal, A.; Spiecker, J.; Eitner, L.; Timmesfeld, N.; Maier, C.; Lücke, T. Longitudinal Rise in Seroprevalence of SARS-CoV-2 Infections in Children in Western Germany–A Blind Spot in Epidemiology? Infectious Disease Reports 2021, 13, 957–964. [Google Scholar] [CrossRef]
  45. Brinkmann, F.; Schlegtendal, A.; Hoffmann, A.; Theile, K.; Hippert, F.; Strodka, R.; Timmesfeld, N.; Diebner, H.H.; Lücke, T.; Maier, C. SARS-CoV-2 Infections Among Children and Adolescents With Acute Infections in the Ruhr Region. Dtsch Arztebl International 2021, 118, 363–364. [Google Scholar] [CrossRef]
  46. Brinkmann, F.; Diebner, H.H.; Matenar, C.; Schlegtendal, A.; Eitner, L.; Timmesfeld, N.; Maier, C.; Lücke, T. Seroconversion rate and socio-economic and ethnic risk factors for SARS-CoV-2 infection in children in a population-based cohort, Germany, June 2020 to February 2021. Euro Surveill. 2022, 27. [Google Scholar] [CrossRef]
  47. Nachtwey, O.; Schäfer, R.; Frei, N. Politische Soziologie der Corona-Proteste. SocArXiv 2020. [Google Scholar] [CrossRef]
  48. Nachtwey, O.; Frei, N.; Markwardt, N. “Querdenken”: Die erste wirklich postmoderne Bewegung. Oliver Nachtwey und Nadine Frei, im Interview mit Nils Markwardt. Philosophie Magazin Online 2021. Accessed on Dec 20, 2021. [Google Scholar]
  49. Wachtler, B.; Hoebel, J. Soziale Ungleichheit und COVID-19: Sozialepidemiologische Perspektiven auf die Pandemie. Gesundheitswesen 2020, 82, 670–675. [Google Scholar] [CrossRef]
  50. Hoebel, J.; Michalski, N.; Wachtler, B.; Diercke, M.; Neuhauser, H.; Wieler, L.H.; Hövener, C. Socioeconomic Differences in the Risk of Infection During the Second SARS-CoV-2 Wave in Germany. Dtsch Arztebl International 2021, 118, 269–270. [Google Scholar] [CrossRef]
  51. Maftei, A.; Petroi, C.E. "I’m luckier than everybody else!”: Optimistic bias, COVID-19 conspiracy beliefs, vaccination status, and the link with the time spent online, anticipated regret, and the perceived threat. Front. Public Health 2022, 10, 1019298. [Google Scholar] [CrossRef]
  52. Fuhrmann, J.; Barbarossa, M.V. The significance of case detection ratios for predictions on the outcome of an epidemic - a message from mathematical modelers. Arch Public Health 2020, 78, 63. [Google Scholar] [CrossRef]
Figure 1. County-specific total incidence time series after reduction to two dimensions. A) and C) Principle component analysis (PCA) along with normal data ellipses embracing East and West Germany (A) and the 16 federal states (C), respectively. X- and y-axes labels contain percentages of explained variability by the corresponding component. B) Dimensionality reduction using t-sne with p e r p l e x i t y = 30 . Full circles point to incidence time series observed in East German counties, whereas circled crosses refer to West German counties. D) Dimensionality reduction using UMAP with n n e i g h b o r s = 10 . Usage of markers as in B.
Figure 1. County-specific total incidence time series after reduction to two dimensions. A) and C) Principle component analysis (PCA) along with normal data ellipses embracing East and West Germany (A) and the 16 federal states (C), respectively. X- and y-axes labels contain percentages of explained variability by the corresponding component. B) Dimensionality reduction using t-sne with p e r p l e x i t y = 30 . Full circles point to incidence time series observed in East German counties, whereas circled crosses refer to West German counties. D) Dimensionality reduction using UMAP with n n e i g h b o r s = 10 . Usage of markers as in B.
Preprints 77319 g001
Figure 2. Correlation matrix corresponding to the incidence time series with hierarchical clustering. Upper panel: Heatmap-like depiction of correlation coefficients resulting from pairwise Pearson correlations of federal state-specific incidence time series along with hierarchical clustering restricted to two clusters. Lower panel: Correlation matrix as in the upper panel, however, calculated on the basis of the Kendall’s correlation coefficients.
Figure 2. Correlation matrix corresponding to the incidence time series with hierarchical clustering. Upper panel: Heatmap-like depiction of correlation coefficients resulting from pairwise Pearson correlations of federal state-specific incidence time series along with hierarchical clustering restricted to two clusters. Lower panel: Correlation matrix as in the upper panel, however, calculated on the basis of the Kendall’s correlation coefficients.
Preprints 77319 g002
Figure 3. Network visualisation of the federal state-specific incidence time series based on MDS. For the spatial arrangement, similarities are calculated from Kendall’s correlation coefficients. Colour (green corresponds to positive and red to negative correlations, respectively) and strengths of edges are likewise derived from these coefficients.
Figure 3. Network visualisation of the federal state-specific incidence time series based on MDS. For the spatial arrangement, similarities are calculated from Kendall’s correlation coefficients. Colour (green corresponds to positive and red to negative correlations, respectively) and strengths of edges are likewise derived from these coefficients.
Preprints 77319 g003
Figure 4. Network visualisation of the federal state-specific incidence time series based on MDS. For the spatial arrangement, similarities are calculated from a PCA. Colour (green corresponds to positive and red to negative correlations, respectively) and strengths of edges are likewise derived from these coefficients.
Figure 4. Network visualisation of the federal state-specific incidence time series based on MDS. For the spatial arrangement, similarities are calculated from a PCA. Colour (green corresponds to positive and red to negative correlations, respectively) and strengths of edges are likewise derived from these coefficients.
Preprints 77319 g004
Figure 5. Centrality indicators of the network depicted in Figure 3. X-axes are scaled as z-score.
Figure 5. Centrality indicators of the network depicted in Figure 3. X-axes are scaled as z-score.
Preprints 77319 g005
Figure 6. Spatio-temporal heterogeneity of SARS-CoV-2 incidence. A) Pan-German incidence, i.e. weekly new cases by population size (black line), range spanned by state-specific incidence curves (dark green area), and range spanned by the county-specific incidence curves (light green area). B) Auto-difference-in-difference time series for the pan-German incidence (red bars), the 16 federal states (black needles, slightly displaced for better visibility), and the 400 German counties (maximum to minimum range). C) Age-stratified time courses of the spatial heterogeneity of incidences over 400 German counties given by Shannon’s diversity measure. D) West-East difference of diversity. A) and C) Red and green rectangles show the first (MV or NW, resp.) and the last (BY or BW, resp.) summer vacation in 2020, 2021, and 2022, respectively.
Figure 6. Spatio-temporal heterogeneity of SARS-CoV-2 incidence. A) Pan-German incidence, i.e. weekly new cases by population size (black line), range spanned by state-specific incidence curves (dark green area), and range spanned by the county-specific incidence curves (light green area). B) Auto-difference-in-difference time series for the pan-German incidence (red bars), the 16 federal states (black needles, slightly displaced for better visibility), and the 400 German counties (maximum to minimum range). C) Age-stratified time courses of the spatial heterogeneity of incidences over 400 German counties given by Shannon’s diversity measure. D) West-East difference of diversity. A) and C) Red and green rectangles show the first (MV or NW, resp.) and the last (BY or BW, resp.) summer vacation in 2020, 2021, and 2022, respectively.
Preprints 77319 g006
Figure 7. Auto-difference-in-difference per federal state. A) Time series of the auto-difference-in-difference according to Equation 5 calculated per federal state (age- and sex-aggregated) along with the corresponding summer vacations periods in 2020, 2021, and 2022 (green shaded areas) in descending order of variance. The curves are enclosed by a 5-week-windowed envelope for visualising trends. B) Evolution of corresponding density functions calculated per quarter for each auto-difference-in-difference curve.
Figure 7. Auto-difference-in-difference per federal state. A) Time series of the auto-difference-in-difference according to Equation 5 calculated per federal state (age- and sex-aggregated) along with the corresponding summer vacations periods in 2020, 2021, and 2022 (green shaded areas) in descending order of variance. The curves are enclosed by a 5-week-windowed envelope for visualising trends. B) Evolution of corresponding density functions calculated per quarter for each auto-difference-in-difference curve.
Preprints 77319 g007
Figure 8. Time series of age-dependent (age classes ordered from top to bottom) auto-difference-in-difference curves according to Equation 7 shown for 5 arbitrarily chosen Western German federal states (horizontally arranged).
Figure 8. Time series of age-dependent (age classes ordered from top to bottom) auto-difference-in-difference curves according to Equation 7 shown for 5 arbitrarily chosen Western German federal states (horizontally arranged).
Preprints 77319 g008
Figure 9. Time series of age-dependent (age classes vertically ordered) auto-difference-in-difference curves according to Equation 7 shown for the 5 Eastern German (excl. Berlin) federal states (horizontally arranged).
Figure 9. Time series of age-dependent (age classes vertically ordered) auto-difference-in-difference curves according to Equation 7 shown for the 5 Eastern German (excl. Berlin) federal states (horizontally arranged).
Preprints 77319 g009
Figure 10. Time series of the cross-difference-in-difference according to Equation 7 calculated for NW versus all 16 federal states. The empty NW-panel is kept to easily spot NW as the reference federal state.
Figure 10. Time series of the cross-difference-in-difference according to Equation 7 calculated for NW versus all 16 federal states. The empty NW-panel is kept to easily spot NW as the reference federal state.
Preprints 77319 g010
Figure 11. Correlation matrix showing coefficients (in %) of pairwise Kendall’s cross-correlations of auto-difference-in-difference time series ln Δ Δ I i ( t ) (Equation 5) corresponding to the pair of federal states as indicated by the row and column labels, depicted in hierarchical (Ward.D2) clustering mode.
Figure 11. Correlation matrix showing coefficients (in %) of pairwise Kendall’s cross-correlations of auto-difference-in-difference time series ln Δ Δ I i ( t ) (Equation 5) corresponding to the pair of federal states as indicated by the row and column labels, depicted in hierarchical (Ward.D2) clustering mode.
Preprints 77319 g011
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.
Copyright: This open access article is published under a Creative Commons CC BY 4.0 license, which permit the free download, distribution, and reuse, provided that the author and preprint are cited in any reuse.
Prerpints.org logo

Preprints.org is a free preprint server supported by MDPI in Basel, Switzerland.

Subscribe

© 2024 MDPI (Basel, Switzerland) unless otherwise stated