5.2. Descriptive Statistics
According to sources [
34], Descriptive statistics are short descriptive coefficient that summarizes a given set of data, which may be a representation of the total population or a sample of the population. As we know Descriptive Statistics describes the characteristics of the given data, and it provides very important data to understand the basic behavior of the data, it consists of two basic categories of measures: measures of central tendency and measures of variability (or spread). As we already described Descriptive statistics are used to describe or summarize aspects of a sample or set of data, such as the definition of variance, standard deviation, or frequency. Inferential statistics can help us understand the collected features of sample data elements. Knowing sample meanings, variations, and distribution of variables can help us understand data very clearly to find the validity of our work in this paper. Descriptive statistics can be useful for two purposes:
- 1)
To provide basic information about variables in a dataset and
- 2)
To highlight potential relationships between variables.
The three most common descriptive statistics can be displayed graphically or pictorially and are measures of:
Now here we have taken the data from December 1, 2021, to January 05, 2022, to continue our work on Covid 19 in Bangladesh so we, here, apply Descriptive Statistics on the data dated above in
Table 2 and find all the necessary fields in Excel below,
Table 4.
Descriptive statistics on Data from 01/12/21 to 05/01/22.
Table 4.
Descriptive statistics on Data from 01/12/21 to 05/01/22.
| Statistical analysis (Descriptive statistics) |
| Dead |
Infected |
| |
|
|
|
| Mean |
3.06 |
Mean |
349.74 |
| Standard Error |
0.33 |
Standard Error |
28.51 |
| Median |
3.00 |
Median |
291.00 |
| Mode |
1.00 |
Mode |
277.00 |
| Standard Deviation |
1.94 |
Standard Deviation |
168.65 |
| Sample Variance |
3.76 |
Sample Variance |
28443.67 |
| Kurtosis |
-1.00 |
Kurtosis |
2.81 |
| Skewness |
0.38 |
Skewness |
1.63 |
| Range |
7.00 |
Range |
770.00 |
| Minimum |
0.00 |
Minimum |
122.00 |
| Maximum |
7.00 |
Maximum |
892.00 |
| Sum |
107.00 |
Sum |
12241.00 |
| Count |
35.00 |
Count |
35.00 |
As we see in
Table 6 all statistical properties are normal for the death column but recovered and infected columns are showing significantly high magnitude where the number of recovered is good for us but at the same time, the number of infected people is the problem here which shows the number of infected people is increasing which is quite like our result in
Figure 10. Now here, the mode of Infected people is 277 and mean and median are respectively 349.77 and 291 so we can say that Mean > Median > Mode which shows that the graph will be positively skewed on the other hand in Death column as it is showing Mean > Median > Mode so this graph will also Positively Skewed.
So, in terms of our figure, we find similar patterns with our statistical analysis. Now we will discuss these in brief. Here, another property called skewness is also matters. As we know few conditions exist in statistics for Skewness which are [
35],
If skewness is less than -1 or greater than 1, the distribution is highly skewed.
If skewness is between -1 and -0.5 or between 0.5 and 1, the distribution is moderately skewed.
If skewness is between -0.5 and 0.5, the distribution is approximately symmetric.
Now in our case, the skewness for Death and Infected are respectively 0.38 and 1.63 so with satisfying the above conditions we can say that the curve of death is approximately symmetric and for infected one it is highly skewed. And from the relation of Mean and Median, The data skewed right.
Now lets the data from October 01, 2021, to January 05, 2022, and we find,
Table 5.
Descriptive statistics on Data from 01/10/21 to 05/01/22.
Table 5.
Descriptive statistics on Data from 01/10/21 to 05/01/22.
| October 01, 2021 to January 05, 2022 |
| Death |
Infected |
| |
|
|
|
| Mean |
5.82 |
Mean |
333.84 |
| Standard Error |
0.53 |
Standard Error |
16.56 |
| Median |
4.50 |
Median |
276.50 |
| Mode |
6.00 |
Mode |
243.00 |
| Standard Deviation |
5.20 |
Standard Deviation |
162.28 |
| Sample Variance |
26.99 |
Sample Variance |
26333.71 |
| Kurtosis |
2.85 |
Kurtosis |
1.56 |
| Skewness |
1.70 |
Skewness |
1.45 |
| Range |
24.00 |
Range |
770.00 |
| Minimum |
0.00 |
Minimum |
122.00 |
| Maximum |
24.00 |
Maximum |
892.00 |
| Sum |
559.00 |
Sum |
32049.00 |
| Count |
96.00 |
Count |
96.00 |
Now if we compare
Table 6 and 7 with the condition, we can see that the skewness for the death curve has decreased from 1.70 to 0.38 which denotes it is shifting from highly skewed to approximate symmetrically and on the other hand skewness of Infected has increased from 1.45 to 1.63 which is showing that with time the curve of Infected is skewed highly rapidly and remain left-skewed. For more validity we take data from April 01, 2021, to January 05, 2022, to get a clear view.
Table 6.
Descriptive statistics on Data from 01/04/21 to 05/01/22.
Table 6.
Descriptive statistics on Data from 01/04/21 to 05/01/22.
| April 01, 2021 to January 05, 2022 |
| Death |
Infected |
| |
|
|
|
| Mean |
68.05 |
Mean |
3480.44 |
| Standard Error |
4.44 |
Standard Error |
241.77 |
| Median |
39.00 |
Median |
1682.00 |
| Mode |
6.00 |
Mode |
261.00 |
| Standard Deviation |
74.23 |
Standard Deviation |
4038.40 |
| Sample Variance |
5509.63 |
Sample Variance |
16308645.90 |
| Kurtosis |
0.28 |
Kurtosis |
1.08 |
| Skewness |
1.20 |
Skewness |
1.41 |
| Range |
264.00 |
Range |
16108.00 |
| Minimum |
0.00 |
Minimum |
122.00 |
| Maximum |
264.00 |
Maximum |
16230.00 |
| Sum |
18985.00 |
Sum |
971043.00 |
| Count |
279.00 |
Count |
279.00 |
Now if we compare tables 6, 7, and 8 with the condition we can see that the skewness for the death curve has Increased from 1.20 to 1.70 which denotes it is shifting upward, and then after October 01, 2021, it changes from highly skewed to approximate symmetrically and on the other hand skewness of Infected has increased from 1.41 to 1.45 which is showing that with time the curve of Infected is skewed highly rapidly and remain left-skewed. Now we analyze
Table 6 with our work and find with passing time death curve has been come to asymmetry form so we can say the death curve became and remain stable but for the curve of infected people the curve is being skewed and if we take the month February 2022, we will see the curve of infected will be skewed higher which will be a sign of the next wave of Covid 19 is coming.
Now we will take data from December 01, 2021, to February 28, 2022, and apply statistical analysis on that data where we will find the validity to our result of the work.
Table 7.
Descriptive statistics on Data from 01/12/21 to 28/02/22.
Table 7.
Descriptive statistics on Data from 01/12/21 to 28/02/22.
| December 01, 2021 - February 28, 2022 |
| Death |
Infected |
| |
|
|
|
| Mean |
11.84 |
Mean |
4123.72 |
| Standard Error |
1.23 |
Standard Error |
508.94 |
| Median |
7.00 |
Median |
1516.00 |
| Mode |
1.00 |
Mode |
277.00 |
| Standard Deviation |
11.58 |
Standard Deviation |
4801.29 |
| Sample Variance |
134.04 |
Sample Variance |
23052357.75 |
| Kurtosis |
0.06 |
Kurtosis |
-0.07 |
| Skewness |
1.10 |
Skewness |
1.10 |
| Range |
43.00 |
Range |
15911.00 |
| Minimum |
0.00 |
Minimum |
122.00 |
| Maximum |
43.00 |
Maximum |
16033.00 |
| Sum |
1054.00 |
Sum |
367011.00 |
| Count |
89.00 |
Count |
89.00 |
Here we can see the skewness for the Death curve has risen compared to the
Table 6 and converted to highly skewed on the other hand the skewness for the infected curve has decreased from 1.63 to 1.10 which is showing the deflection of our curve within the month of February which means our result from
Figure 10 is accurate because comparing
Table 6,
Table 7,
Table 8 and
Table 9 we can say that the curve of infection raised till February 2022 and then it started decreasing which states our result to be accurate. Now if we take a look at
Figure 10 we will find that our curve is also showing the same behavior, as we find with the passing of time the curve of infected is skewing high so in our
Figure 10, we see that the curve is highly skewed with being left-skewed. If we take any random example of a curve being highly skewed, we can see some similarity with
Figure 10 as here is a basic example of a highly right-skewed graph.
Generally, A histogram is a graphical representation that organizes a group of data points into user-specified ranges. In our paper we have focused on the infection rates so we have used Histogram because we know, In trading, the MACD histogram is used by technical analysts to indicate changes in momentum and we are here to find these changes in Infection rates in our data from the
Table 2 by using this feature of the histogram.
5.2. Histogram
Now here we tried to find the number of infected people in different ranges, and we find,
As we get the data, we have also constructed the histogram which is,
In
Figure 15, As we can see in the Histogram, till 05 January 2022 almost 50% of daily infected people are between the number of 200 and 300, and no daily infection under 100. But if we look at the histogram from December 01, 2021, to February 06, 2022, we get,
ere we find a completely different histogram which containing 32 daily infected people’s number is greater than 1000 as below,
Here we can see for the month of January the curve goes higher for the number of daily infected people of 1000+ infect highest. So, histogram is also showing the increase of the infection rate in one month from January 05, 2022, which makes our model more valid.