Category: COVID-19

  • COVID-19 Update: Data is Awful – Deaths are now the Best Metric

    At this point, most of the data aggregators (including Johns Hopkins) have failed to keep track of the myriad of reporting organizations and have settled for less comprehensive data collections. So I have had to go off and decide which metrics are the most meaningful and aggregate them myself.

    Because confirmed cases are so dependent on whether a country/state is testing and also because cases range from no symptoms all the way to stays in the ICU and even death, I have decided that confirmed cases are just a curiosity. In the chart above, you can see that NY has just over 2 confirmed cases per 1000 people. However, the number of deaths per 1000 people in NY is 0.02, 100x lower. Since hospitals must be required to report deaths (and since death is a fairly straightforward metric, whereas a confirmed case requires a positive test), I think this is a much better way to evaluate the severity of the outbreaks around the world.

    The above chart makes the case (that I have been trying to document) that the ratio of deaths to cases is quite different in different parts of the world. However, there’s one more important measure that can also reflect on the severity, and that is the velocity and acceleration of this normalized death rate. The velocity over a one day period is easy to calculate. It’s just today’s normalized deaths minus yesterdays. The acceleration is the more useful number to understand, however, as it relates to how rapidly the problem is going to get out of control (or under control). This is because all of these outbreaks across the world appear to be increasing non-linearly. To understand what a non-linear process might look like, think about the old saying, “The Rich get Richer.” Because they’re Rich, they get Richer, and because now they’re Richer, they get even more Richer!

    So here is a table showing the slope of the non-linear curve for today and the change in this slope over the last two days. Any of these countries or states with a positively changing slope is accelerating. Any with a negatively changing slope is decelerating (what we want!)

    The countries that are accelerating the fastest (inst_roc_delta) are poor little Andorra, San Marino, Iceland, Spain, and then the trio of states NY, Vermont, and Louisiana. As you can see, even a small number of deaths in a country or state with a low population, can contribute to a big change in the instantaneous slope.

    So… takeaway. Deaths in New York, Vermont, and Louisiana are increasing quickly, but are still well below the rates per 1000 people that we’re seeing in Spain, Italy, France, Iran, and even Switzerland (who is also well below the rates of those other hard-hit countries). The point may well be that things will eventually equalize across the world and these numbers will align. Or it may well be that some regions will remain lucky and will be up to 10x less hit with deaths than neighboring countries. It’s hard to tell right now, so we’ll keep watching.

  • COVID-19 Update: More on the Impact of Latitude

    Still wondering if the Latitude effect on COVID-19 is real or just coincidental. I have shown the below chart a few times. In it, you can see that over 90% of the world’s cases have occurred in the band between 30 and 60 N. Latitude.

    I have noticed, however, another effect that is interesting. The cases in the more Northern region of this band (between 50 and 60 N. Latitude) seem to have similar numbers of cases, but with far fewer deaths. I looked through the data and built tables and charts to see if this observation was true. Below, you can see a simple bar chart with these results. All the numbers are normalized by 1000 population, which takes into account the fact that the population in the range from 30 to 40 degrees N. Latitude is just over 3x larger than the population in the range from 40 to 50 degrees. Therefore, this chart shows us that per 1000 people, the number of cases in both of these ranges is nearly equal (about .23 cases per 1000 people). HOWEVER, the normalized death rate in 30 to 40 is 3x larger than it is in 40 to 50 (or any other 10 degree range of latitudes for that matter). See the bar chart below and the table in the next block down if you prefer looking at numbers. I think these numbers tell us that — for now at least — this trend is real.

    Why might this effect exist? It could be just coincidence… large numbers of countries with aged populations between 30 and 40 N. Latitude? Other factors that amplify the outbreak? Societal factors in the Northern latitudes that enable better social distancing? I have no idea, but am very interested to see if this trend holds.

  • COVID-19 Update: Comparison of Iceland and San Marino Hospitalization Rates

    Today’s numbers are out and we see a new spiking (small) country, San Marino. San Marino (SMR) is a small micro-country fully surrounded by Italy. It shares much of it’s culture with Italy, and is also near the Northern part of Italy where the COVID-19 outbreak started.

    It is a very small country, with a population just north of 33,000. To date, they have 187 cases and 21 deaths. This results in nearly 5 active cases per 1000 people and 0.6 deaths per thousand. Their death to confirmed case ratio is 12%, which is extremely high.

    The above chart also brings to mind the case of Iceland (ISL), who has one of the larger numbers for COVID-19 cases per 1000. Part of this is because Iceland has created a strategic sampling approach for COVID-19 so they can find out where it is hitting hardest and triage quickly. This article gives us more of the story too…

    Fifty-six individuals have recovered from COVID-19, 9,013 individuals are currently in quarantine (681 are in isolation), 2,096 individuals have completed quarantine, and 15 individuals are currently in the hospital diagnosed with COVID-19, two in intensive care, and one in a ventilator. A total of 11,727 tests have been administered. Of those diagnosed with COVID-19 in Iceland, two have died.

    This is interesting in light of the chart below, which shows that Iceland’s Confirmed Cases are equal to 70% of the number of hospital beds in the country. But from the data above, only 15 of the ~700 diagnosed people are in the hospital. Because Iceland was randomly testing, they seem to have been able to keep people in quarantine and out of the hospital. This is basically the opposite of the Italy and San Marino experiences. According to the San Marino ministry of health (big kudos to Googles Italian to English translator!), 41% of their confirmed cases are hospitalized (0.2% of their entire population). For comparison, Iceland’s rate of hospitalization is just 2.3% of their confirmed cases and 0.004% of their total population.

    Conclusion: This might confirm the strategy of America’s CDC (at least their recent strategy) of “flattening the curve” to spread the infection out and keep the numbers low in the hospitals. But it also makes the STRONG CASE for randomized testing to identify and mitigate more quickly.

  • COVID-19 Analysis – Why do Italy, Spain, and Switzerland have essentially the same infection rate per 1000 people but very different Death Rates??

    See article on LinkedIn with significant analysis and chart material.

    Note how Switzerland, Italy, and Spain all have the highest infection rates per 1000 people, but Switzerland has 1/10th of the deaths. Why?
  • COVID-19: Data Normalized by Population 3/23/2020

    Much of the results I’ve been showing in the last few days has been total counts of cases, deaths, etc., usually by country or state. Yesterday i decided that there’s probably enough data at this point to evaluate whether all the counts should be normalized by the country’s population.

    Reasons for doing this are probably obvious… we saw a large spike in cases quickly in Hubei Province in China and then later saw a huge spike in cases in Italy. At some point last week Italy passed China in number of counts of cases, but was that really the most interesting measure? Since Italy has a much smaller population than China, the outbreak in Italy per 1000 people living there was much, much higher.

    Here are the rankings (minimum population of 5 million) after being normalized for population.

    Now comparisons can be made that are more relevant. Note that Italy and Switzerland have nearly the same number of Active Cases per 1000 population but Italy has about 10x the number of deaths. Why is this? I look across the blended statistics for each country that I’m using to do COVID severity correlations and I see only two things that are really different. 1) Switzerland has over 1 more hospital bed per 1000 people than Italy and 2) Switzerland has a significantly lower number of people over 65 than Italy. Both of these together might make sense.

    Here’s my updated Correlation Chart – Now the Slope of the Confirmed Cases is done on Confirmed Cases normalized by population.

    FactorCorrelation with Slope (normalized by pop.)
    femaleSmokingRate26.93%
    Over 65 Population – %22.38%
    Mean_BMI_male17.00%
    Population Density8.70%
    totalSmokingRate6.40%
    STD_BMI_female3.88%
    IncomeGroup-1.06%
    hospital_beds-1.56%
    STD_BMI_male-1.96%
    Mean_BMI_female-3.09%
    2019 Diabetes – % of pop.-5.89%
    HIV – % of pop.-7.64%
    Diabetes growth in last 10 years-8.02%
    Population Growth Rate-8.76%
    2019 Population-9.42%
    Tuberculosis – % of pop.-12.21%
    maleSmokingRate-12.50%
    Area of Country – sq-ft-13.11%
    2020 Population-13.59%

    How do I capture this slope? I fit a linear regression line to the data after a country hits 50 cases (arbitrary – I’ve seen others use 100). I then take the slope of the fitted line and project that as the rate of increase of confirmed cases for that country. Of course, most of these outbreaks are non-linear, but it’s simpler to fit a line than a curve, so I believe it makes sense to approach the problem this way. Regardless, this slope is a measure of how severe the outbreak is. Now that I’m using normalized data, I think it’s more relevant and better reflects the amount of pressure COVID-19 is putting on a specific country and its population.

    Lots of discussion could be had on the correlated factors and why they’re positively or negatively correlated with this COVID-19 pressure. Feel free to weigh in.

  • COVID-19: Cases and Deaths by Latitude

    I saw a chart a while back that predicted the cases/deaths by latitude. They plotted it over the world map and had colorbands describing COVID-19 potential risk. Something like that… But based on the amount of data we have, that’s a pretty wild, probably overfit, prediction.

    Here’s what we DO know about cases by latitude. This chart shows total confirmed cases and deaths stacked on one bar (light blue+red) and current (yesterday) reported cases and deaths stacked on a second bar (green+dark orange).

    Takeaway: Most of the cases have occurred between 30 and 60 degrees latitude. Around 40-45% of the world’s population does live in this belt, of course. But we have 92.5% of the cases in this range!

    Any ideas why??

  • COVID-19 Update 3/21/20 – New Spikes and (small) Indicators of Recovery

    I’m entering this late on 3/20 because most of the world’s data is already in. Some interesting things to discuss. First off, we saw a huge spike in Active cases in NY on 3/20. I can only hope that this spike is an anomaly and that we won’t see more like it in the US. You can see this in the bar chart showing change in the last 24 hours, but also in the time series chart immediately following. While we may have hopes that Washington and New Jerseys outbreaks MIGHT be slowing, it is clear that NY continues to accelerate.

    As for the rest of the world, you can see in the chart below that Italy continues to accelerate as well. They announced around 6K new cases yesterday and over 600 new deaths. France and Spain also announced ~200 new deaths. One bright spot as an American, though, is that New York’s death rate is extremely small compared to the number of cases. When normalized by the total number of cases, it is far below that of the large European countries and even smaller ones like Switzerland.

    Finally, here’s something interesting to watch. I’m looking for any country at all that might be recovering from their initial outbreak. I’ll be watching Japan for the next week or so to see if their recovery cycle-time matches the ~20 days that we saw in the Hubei province data. That might become really interesting data. Note that around the 200 case mark, we see the horizontal distance between the Confirmed line and the Recovery line is now measurable (looks like roughly 22 days. If the recoveries accelerate, then we may see a consistent 20-25 day cycle-time eventually.

  • COVID-19 Update – 3/19/2020

    There’s not a lot new in today’s data, other than continued increases in confirmed cases and deaths (primarily in Europe). I’ll update the maps and one or two of the charts here. I’m also continuing to work on the correlation effort I posted about in a previous entry. That will improve as the data improves (i.e., more testing).

    The world as the virus continues.  Italy continues to have the most active cases (and deaths), while Europe and New York state are catching up
    The world as the virus continues. Italy continues to have the most active cases (and deaths), while Europe and New York state are catching up
    The United states map showing overall number of active cases and new active cases in the last 24 hrs
    The United states map showing overall number of active cases and new active cases in the last 24 hrs
  • COVID-19 Severity Correlation – Early Results

    I have blended smoking rates for both males and females by country as well as the 2020 population for the countries with the data. I’m curious about how correlated these features are with the severity “slope” of confirmed cases.

    Results: See correlation plot below. The severity slope is most correlated with the country’s population, then with male smoking rates. Female smoking rate is slightly negatively correlated with severity, but that could be anomalous due to the granularity of the smoking and population data (I have it by country, not by state or county).

    So, what does this tell us? Smoking rates have an impact (particularly male smoking) on the severity, but not near as much as the overall country’s population (and therefore local population density is probably even more correlated).

    Correlation Visualization – Best way to read is to look down (or across) the “slope” column/row. The more yellow colors are high correlation with the severity and the darker blue colors are less (or negatively) correlated. The zero color looks a whole lot like the color of “totalSmokingRate’.
    Correlation Results Below
    slope
    slope 1.000000
    pop2020 0.141106
    maleSmokingRate 0.069965
    totalSmokingRate 0.028203
    femaleSmokingRate -0.052708
  • COVID-19 World Update – 3/18/2020

    In the data from the rest of the world, we continue to see some of the same countries continue to increase their active cases at a high rate. Germany outpaced Italy for the first time. New York State (the first ‘US’ on the chart below) also continues to increase their case load.