I’ve been fascinated throughout this outbreak at how it breaks across latitude ranges. Here’s the latest info on Latitude.
COVID-19 Cases by Latitude Range – Cumulative values of Cases and Deaths per 1000 – 6/23/20
The above chart shows cumulative numbers normalized by the population for each region. This shows the upper North latitudes continue to lead in total cumulative cases and deaths per 1000 persons but a couple of the South latitude ranges are starting to catch up (Brazil, Ecuador, Chile, Argentina, South Africa).
Notably, the middle latitude ranges are still far less affected cumulatively.
COVID-19 Instantaneous Rates of Change for Cases and Deaths by Latitude Range – 6/23/20
Now above we see the instantaneous rates of change across the latitude bands. This chart shows us where today’s hot spots are. Note that the US latitudes are very low because overall, US cases are low compared to population. Most of the US cases right now are happening in the 30-40 N Latitude range (Arizona, Texas, California), but the rates in these locations are not large enough to show a significant spike in instantaneous rates (which are spiking for Africa and South America across the board right now). Note that 20-30 N. Latitude are starting to show increased slope in cases and deaths. This is largely due to a growing number of cases in India and Bangladesh.
Hospitalization data right now seems to be one of the most critical signals that a COVID-19 outbreak in a region is getting serious. However, hospitalization data is really hard to analyze for a number of reasons:
Hospitals don’t like to share data. In many cases in the United States (including Arizona) there was no hospitalization data during the first part of COVID-19. This was not the case with European countries. I can make a number of guesses about this including Health Information Privacy (HIPAA), inconsistent data collection, and even a sense of unwillingness by private and public hospitals alike to reveal too much about their business. However, with COVID-19 there seems to be a renewed sense that hospitals are a public good and need to be more transparent. Arizona has a new executive order (23-2020) governing the reporting of COVID-19 data.
There doesn’t seem to be a strong central governance around hospitalization data. Before COVID-19 this was always what I assumed that CDC did, but now I think it’s just a function of the state’s Health Department. I think that if CDC created guidelines and reporting rules, we would have much richer and much more predictive data sets around the health of people in the US. Until then, however, it requires someone to clean data, hand-build datasets, etc., to extract useful information.
Hospital data tends towards the anecdotal side. I have had many forwards from people on LinkedIn or Facebook that came from the cousin of their sister-in-law, who is a surgeon in New York explaining how overwhelmed the ICU is there in whatever town they serve. Then an hour later I get another forward from some connection in the very next county in New York explaining why their hospital has no COVID cases. This is very, very common. I think some of this is due to the above lack of transparency in hospitals, where data is even hidden from employees. I’ve had more than one person who works in a hospital in Arizona tell me at some point during the COVID-19 outbreak that there is only around XX people in the COVID ward right now — “But don’t tell anyone”. I don’t understand the perceived secrecy of this data, but due to the secrecy and poor data reporting, the ancecdote tends to carry the day. Until the next day when the opposite story comes out.
Hospitalizations classified as COVID-19 may not have initially sought treatment for COVID. Florida is starting to run into a new kind of COVID-asymptomatic hospital patient who seeks care for an unrelated issue (broken leg, etc.) and then is tested and found to have COVID-19. This is challenging. Does the patient need to go to the COVID ward? Initially, it seems that yes, they were, but now the state is starting to handle these patients differently (and save the COVID ward for those with COVID symptoms). This is unlikely to be affecting ICU bed numbers, of course, but is possibly affecting inpatient bed counts (which are already reaching maximums as well).
The hospital business and processes are not well understood by the layman, even by the hospital employee at times. This results in lack of understanding of the real meaning behind a data visualization.
Overview of Arizona ICU Bed Management during COVID-19
One thing that is very interesting to me is ICU bed management. Obviously hospitals want to leverage their invested ICU bed capital to make money. This would seem to require limited excess capacity in the ICU — i.e., most beds full most of the time. During COVID-19 one of the earlier stories was how COVID would overrun the ICU’s at most hospitals. I believe this did happen to an extent in New York, but it hasn’t happened yet in Arizona. COVID-19 patients are still less than 40% of all the ICU beds occupied in Arizona, but the number has been growing. See the chart below which compares the percentage of all occupied ICU beds that have a COVID-19 patient in them.
Thoughts:
It seems like the COVID patients peaked as a percentage of the total ICU bed population in mid-April and then gradually tapered off until the lockdown easing was fairly much complete. We then see acceleration in cases drive up the percentage of COVID-19 patients to near 40%. Note there is still 15+% unoccupied ICU beds (though I’m not sure if they’re in the right places). But clearly, whoever the non-COVID patients in the ICU are, they’re decreasing. There may be an element of elective surgeries in the non-COVID ICU population, but I don’t think they’re as many as usual.
I can’t fully get my arms around what this chart is telling me, other than perhaps it shows that hospitals know how to manage their ICU bed resources. The total percentage of ICU beds filled in the state has gone from 74% around 5/21 to about 85% today (total about 130 beds). They have done this while COVID-19 cases in the ICU have increased by about 230 people (hence, the now-higher percentage of COVID patients in ICU beds). I don’t know how they made up those extra 100 people, but they did it somehow. They have some margin to work with, I suppose, because even today, 60+% of all inhabitants of the ICU are non-COVID.
This management is why the increase in hospital bed numbers has been linear while the COVID-19 case growth has been exponential. Here’s a view of hospitalization just compared with the numbers of 65+ COVID-19 cases. Note that the 65+ group which looks pretty linear when compared to the 20-44 age group cases still looks exponential when compared to the hospitalization (especially the ICU) rates.
Comparison of AZ Confirmed Cases over 65 with hospitalization Rates (Maricopa County Data)
The state AZHS Dashboard provides a download and a map of COVID-19 Cases by Zip code. I was playing with the data and noticed that most of the regions with higher cases per 1000 were areas that were known to have lower median incomes. This intrigued me, because we really don’t know much about who is involved in this current wave of COVID-19 infections (other than the age demographics that I presented in previous articles).
COVID-19 Outbreak Info by the richest and poorest zip codes in Arizona
First caveat… due to tribal regulations, I have zero data for tribal regions, many of which would qualify as areas with very low median incomes. This is too bad and if anyone from these regions is interested in having their data analyzed securely, please contact me.
COVID-19 Cases per 1K by AZ Zip Code compared to Median Income – 6/20/20
My thoughts on this chart:
It is pretty clear that COVID-19 outbreaks are much higher in Zip Codes with lower median incomes. The yellow trend line on the chart shows a R2 score of just under 0.5, which indicates that the trend is a pretty solid fit considering this is real-world data. The average number of cases per 1000 in the poorest 20 zip codes is over 9 and the average of cases per 1000 for the richest 20 zip codes is 1.9. Even if you subtract the Yuma and Nogales zip codes that have some of the highest case rates in the country, the average for the 20 poorest zip codes is 6.5. I posted this chart on Facebook looking for theories on why the situation is this pronounced. There was a lot of good discussion about this, and it is clear that whatever is causal for this disparity, it is comprised of multiple cultural and economic variables.
Lots of businesses that were shut down during the state’s COVID-19 lockdown (gyms, restaurants, and even churches) probably have a much higher representation in the wealthier zip codes, where very large gyms, restaurants, and churches thrive. This would be an interesting study. It does seem like the current outbreak probably has much less to do with these kinds of businesses than we would have guessed.
Some of these regions that have much higher cases per 1000 people are agricultural areas. Most of these particular regions also have a very low death rate. Perhaps there’s something they’re doing that makes them more likely to get infected but less likely to be badly affected.
I also suspect that one influential variable is mask usage. My observation in Tucson is that the Foothills region has been much more diligent around face coverings than other parts of Tucson (and certainly rural Arizona). This may be one reason the Tucson Foothills zip code COVID-19 cases per 1000 are extremely low. This may also apply to regions in Phoenix that are similar to the Tucson Foothills.
Now that Pima and Maricopa county are mandating face coverings in public, we have a great opportunity for a natural experiment on the value of Face Coverings. My guess would be that we’ll see the case count flatten out in about two weeks. The question is whether this would have happened anyway. Hopefully we can compare mask vs. no mask regions afterwards.
Zip Codes with the largest Percent Increases in COVID-19 Cases
Arizona had a few really big case numbers in the days since I posted my first chart comparing cases with median income. Below are the zip codes that had the highest percent increase in the last 2 days. As you can see, these areas of fastest increase are generally in lower-income areas.
COVID-19 Cases by Zip Codes Sorted by Median Age
Unsurprisingly, the zip codes that trend younger are also showing a higher case count in general. This aligns with data where we see the 20-44 age group far outpacing the others in new cases. Interesting trivia: Based off the data from usa.com (came from the American Community Survey of 2010-1014), Colorado City’s median age is 15.
COVID-19 Cases and sorted by Median Age for AZ Zip Codes – 6/20/20
COVID-19 Cases Plotted Against Population Density
Following a similar approach, I also put together a scatter plot showing COVID-19 Cases per 1K people plotted against the Population Density of a Zip Code. The trend is one of the strongest yet (the R2 Coefficient is .29 which is usually pretty decent with non-laboratory data). Not a real surprise, but I imagine that density might be a good proxy for large apartment complexes. I’m thinking about ventilation, etc., when I wonder if people in apartment complexes (perhaps less expensive ones have poorer filtration?) have a higher risk of becoming infected with COVID-19.
COVID-19 Cases per 1K plotted against Population Density by Zip Codes in AZ – 6/20/20
I continue to focus a bit on my home state, Arizona, due to the attention the large increase in post-easing cases has brought nationally. Over the last week most of the trends have continued.
Summary:
Cases continue growing, but it looks like the rate over the last week might be linear. I’ll show data from the two largest counties, Maricopa (Phoenix) and Pima (Tucson).
Border cases continue to grow. Santa Cruz county now has a case rate of over 20 cases per 1000 people. This is larger than the case rate for the state of New York. Yuma County’s case rate is 14 per 1000 people.
Arizona’s case rate as a whole is under 5 per 1000 people. There have been misleading stories that appear to indicate that Arizona has passed New York in normalized cases. This is only true when one considers the number of cases per person over the last week. Cumulatively, New York’s 19 cases per 1000 people is nearly 4x that of Arizona. Arizona’s case rate puts it at #23 in the nation, whereas New York’s is #1.
Arizona’s death rate as a whole is 0.166 deaths per 1000 people. For comparison, New York’s is 1.57 deaths per 1000. Arizona’s death rate is also 23rd in the nation (strange). Our counties range from 0.78 in both Navajo and Apache Counties down to 0.03 in both Graham and Yavapai Counties. Maricopa and Pima are 0.12 and 0.21 respectively.
Hospitalization has increased across the board in the state. It’s hard to pin down where exactly the hospitals are filling and where they aren’t due to the way the state shares data. Statewide, ICU bed usage is at 82%, which represents 1,347 ICU beds in use. This is up from 1,192 last week, a difference of 155 patients. Of these, 74, or about 1/2 are COVID-19 (or suspected) patients. When I add up all the hospital bed increases (not necessarily due to COVID patients) over the last month I get a 69% increase. The increase in hospitalization due just to COVID-19 patients was 72%. This sounded very high to me, so I went to the CDC Influenza Tracker site for comparison. The flu season in 2020 saw a one-month 34% increase in all hospital beds across the country (only due to flu). I then looked at what I knew was a bad flu year (2017-2018) and saw a 186% increase in hospitalization due to flu alone that year. One caveat is that one can’t determine what KIND of hospital beds were occupied during this flu season, so hard to compare perfectly to our situation. However, our 72% over the last month is about double the worst flu month this year and less than half of the worst month during a bad flu year. I’ll keep watching the numbers and hopefully it doesn’t keep increasing.
Normalized Cases by Age Group
What I’ve done below is figure out the populations of each Arizona age group below and then normalize the numbers of cases per age group by their population. This allows me to see for a given age group the number of cases per 1000 members of that group. I do the same for Deaths and Hospitalizations.
Interesting things to note:
Other than the <20 group, the number of cases per 1000 is pretty close for all other groups. This seems to confirm that the <20 group is much less likely to get infected by COVID. This group has been less tested than the other groups, but the number of tests seems to be a reasonable sample to give a decent estimate for the population.
The other metrics (deaths, hospitalizations) differ widely across the different age groups, as one might expect.
What this shows is that even at this stage, this disease is still pretty rare (at least per the formal counts). Other than the <20 group, somewhere under 1% of the population has been recorded to be infected. For the Arizona 20-44 age population that is driving most of the new cases, only .04% (of the whole group) has been hospitalized and .003% has died. 4.85% of the confirmed cases in this group has required hospitalization and 0.35% of the confirmed cases has died. Hopefully that explanation helps you read the table. Clearly, getting COVID-19 if you’re over 65 is really concerning (even though only .83% of that population has gotten it yet) due to the 28% hospitalization rate and 16% death rate for people who actually get the virus.
Update on Cases by Demographic
Note that age demographic case trends have continued at pretty much the same rates over the last week. Pima shows a one-day slowdown, but I don’t read much into that yet because it is most likely to be a data collection issue if it doesn’t continue for 3-4 days.
Hospitalizations Compared to Cases
The hospitalization rate (though I showed that it was increasing up above) still looks flat when compared to the case growth. Even though we know that hospitals have limited number of beds, I take this as good because the increase in hospitalization is much, much lower than the increase in cases. In Maricopa we saw about 200 new hospitalizations (only about 40 were ICU beds) to about 6000 new cases over the last week, about 3% of all cases. And looking at the chart, it appears that if trends continue, we’ll be seeing even lower percentages of all cases in the future. Again, this points to a boom in recording less serious cases. It is most likely that this is largely due to the greatly increased number of tests the state has been conducting. These asymptomatic cases may have been around all along but only now are we measuring them. This is why I think looking at hospitalization rates is the best metric now.
After posting part one of this overview, I had an idea of how to gather data on the thing that I was most curious about, the age groupings of the new cases. I was fortunate that Fox10 in Phoenix had been taking screenshots of the daily cases and breakdowns from Maricopa county, so I went through their blog and pulled the numbers out by hand. It turns out that this was a very valuable activity, so I scrounged around on the internet until I found a similar situation for Pima County. Combined, these account for most of the population of the state, so their numbers should be representative of what’s happening.
Case Breakdown by Age Groups
Case Demographics from Maricopa County since 4/14Case Demographics from Pima County since 4/09
These two plots tell a very interesting story since they contain data from the state lockdown (started easing on 5/1) as well as data from the post-lockdown. There are some obvious things to see:
Exponential case growth starts around 5/28 in all but one age demographic. Case acceleration is largest in both counties by far in the 20-44 year group. There is slight acceleration in the other groups (note that the Pima chart separates the 45-54 and 55-64 brackets whereas Maricopa does not).
The one exception to the above is the 65+ bracket, which seems to have had the same case slope since 4/9. This is interesting, because it makes the case that the accelerating case growth in the state since 5/28 has not impacted the most vulnerable population (~80% of deaths and ~70% of hospitalizations).
The lockdown seems to have succeeded in maintaining linear growth in cases for all age groups. The lockdown started gradually easing in about 1 week increments on 5/1. The exponential growth started about 3 weeks after easing started. Hard to take much away from this other than the first steps of easing (opening up some restaurants, etc.) appears to not be responsible for the exponential growth by themselves.
Conclusion from this data
The main conclusion I take away is that the factors driving 65+ case growth do not seem to have been affected by the state lockdown. Why do I make this conclusion? The 65+ group is the ONLY age group with linear growth in cases at a constant slope on this graph. The lockdown did not affect the slope and the easing has not affected the slope. This needs to be studied more, but it makes the case that the state lockdown in AZ was not effective in achieving the goal of protecting the most vulnerable age group. Since this group drives the hospitalization numbers (which we are all concerned about, of course), it would seem that whatever approaches we take to COVID-19 should be targeted at decreasing their COVID-19 infection slope.
To the above point, evidence continues to grow that superspreading events are responsible for most of the case growth. See this preprint for the latest research on this effect. According to this research from Hong Kong, 20% of the cases were responsible for 80% of the COVID-19 transmission and 70% of infected people never transmit the disease at all. They also claimed that social events were more responsible for transmission than family or work (unless you work in a meat packing plant, I guess…). This does seem to give some weight to the notion that better managing the events that might be associated with superspreading (including the oversight of workers in group homes, etc.) would have more effect on the hospitalization and death rate than economic lockdowns, which appear to have little to no effect on over 65 case growth rates.
Hospitalization Data
I was fortunate that Maricopa county is also collecting data on hospitalization by day and was able to build a time series plot there as well. This data is presented for the current date in the state’s Tableau Dashboard, but I haven’t found anyone yet that has captured that data every day so that trends could be analyzed. So all I have is Maricopa County, but what it shows is interesting.
What do we see here? First, we note that as the number of cases started going exponential (accelerating) on 5/28, hospitalization just took a small tick up and then stayed flat. There may be a few reasons for this:
At some point during the lockdown, hospitals stopped doing elective surgeries. Sometimes these surgeries end up populating the ICU. This is bad for the hospitals because elective surgeries are fairly lucrative economically for the hospital. Therefore, as soon as it made sense after easing started hospitals went back to elective surgeries. My guess is that they’re quite good at managing their hospital bed usage (ICU or otherwise) due to the economic effects of being good at doing so. This might explain how flat the hospitalization growth is in the chart. It would seem that once COVID-19 cases started growing again on 5/28 the hospitals saw a bump up in their numbers as COVID patients started flowing again into ICU’s which had been collecting patients from elective surgeries, etc., during the easing. Once absorbed, the rates have flattened off again (at a higher number, but perhaps that’s for the time being). I’m curious about the unusually high hospitalization cycle time for COVID (I hear 14 days…) and how that impacts the hospitals’ abilities to manage flow through their ICU beds.
Since hospitalization rates aren’t really growing (and certainly not at an exponential rate, which is what everyone has feared), the chart above makes the case that all the concern about AZ case growth rates is misplaced (though I suppose it makes for a good story on a slow news day) as nearly all of the cases each day appear to be either asymptomatic or at least low enough symptoms to not require hospital admission. This is good and probably is exactly what the case and hospitalization data would look like in flu season if we were to track it this closely.
We know that ~70% of COVID-19 hospitalizations are in the 65+ group, so I think this strengthens the case that most of the new cases in the younger demographics are not overly symptomatic. Add this to the testing results I discovered in part 1 of this series and it seems clear that probably close to all of the new cases that are contributing to the growth were discovered during the huge testing blitzes that have been happening and are not reasons for concern.
Conclusion
The media has been excessively focused on case growth in Arizona, but I believe the data indicates clearly that there is no emergency happening.
Most of the new cases are in younger demographics, are not requiring hospitalization, and were uncovered through testing. This is good and is part of the reality of having a novel virus in the environment.
Hospitalization growth is not a concern and reflects something that hospitals manage every year during outbreaks of flu or other pathogens. They are very good at managing their case loads (but I’ve heard they need to manage their beds at somewhere near 80% capacity to make money anyway). Once hospital bed occupancy hit 75% the growth slowed to a crawl and now seems to be flat.
Case growth in the most susceptible age group does not seem to have been affected by the economic shutdown. Additionally, the growth of mask wearing in Pima County that started in May also appears to have had no impact on the 65+ growth rate. This makes me wonder if other approaches may be more effective at ultimately limiting deaths and hospitalizations that overwhelmingly come from this group. This is an area that obviously bears more study.
Questions I have
After writing this I still have some questions that are interesting to me. I will use this space to capture a few:
I note above that the lockdown seem to have had no effect on growth of cases in the 65 plus group. My evidence for this is that post-easing, we’re seeing exponential growth in all other groups but not 65+. I have a few questions that I can’t answer about this:
Does the incredible growth in testing invalidate the assertion that case growth is higher after easing than before? I.e., are we catching cases now due to testing that we would not have caught before testing started? If true, perhaps there is no case acceleration, we just changed the way we measure in mid-stream. Also, if true, there may be less indication that case growth in the other demographics doesn’t affect case growth in over 65.
Is the shallow, linear slope of 65+ case growth due to their increased likelihood at wearing face coverings? I have no data on this, but it would stand to reason that the group who’s lives and health is most threatened by this virus may take greater precautions. I suspect that testing in this group hasn’t increased nearly as much as it has in other groups because I’m pretty confident that most tests conducted prior to the testing blitz were only given to people who had a strong prior for being infected. That would mean that most tests would have been given to people with symptoms (and a very high percentage of the other age groups are asymptomatic)
I also have questions about the efficacy of face coverings on asymptomatic people as source control. I understand the logic that it is easier to mandate that everyone do something that appears to have benefits, but I’m not sure if the evidence indicates that there is a compelling reason to deprive liberty in this way. I keep evaluating this…
Evidence continues to mount that wearing face coverings is more effective than extreme social distancing. A very recent paper published in the proceedings of the National Academy of Sciences (LINK) makes the case that it is airborne transmission via aerosols that is the dominant route for transmission of COVID and that a facemask as source control on an infected person is the most effective mechanism to prevent atomization of the virus-containing droplets. If the virus is atomized, six feet of distance is no longer protective, and indeed, some evidence indicates that cloth masks on passers-by do not filter out these aerosols. My question, however, is whether this kind of atomization can occur in a pre-symptomatic person who is not coughing? Also, can atomization be caused by a pre-symptomatic person talking loud? I have seen evidence of atomization of flu and other viruses occurring through coughing, but that is not pre-symptomatic behavior.
Arizona is about three weeks after reopening and there has been a lot of increase in case growth. Headlines like this are catching attention with the leader:
PHOENIX – Experts around the country and in Arizona are raising alarms about the state’s COVID-19 situation because cases and hospitalizations have increased for the past two weeks. The increase in cases can’t solely be attributed to increased testing in Arizona, experts say.
Lets look at the data and see if it doesn’t give us some clues. There’s a good chance the “experts” haven’t done this…
Here is data on all the counties in Arizona from 6/10/2020. I have sorted these by the case growth rate, which as you might remember is the slope of the tangent to the curve (involves calculus, just trust me, this is how you find the change rate of a curve).
Data on Counties in Arizona sorted by Instantaneous Rate of Change of New Confirmed Cases (normalized by population). 6/10/2020
What do we see here?
The highest two counties by instantaneous rate of change (IROC_confirmed) are Santa Cruz and Yuma. Both of these happen to have highly traveled border crossings. The Nogales port of entry in Santa Cruz County sees around 10M people cross this border legally per year. The crossing near Yuma sees about 7M people cross. The Calexico crossing in California an hour or so west of Yuma sees about 12M people cross per year (and yes, Imperial County, CA, is seeing similar numbers to Yuma and Santa Cruz counties. So I have to assume that there is some contagion between Mexico’s cases and these two port of entry counties. Depending on how the state’s case growth is being counted, the numbers from these two counties could be significant. The high acceleration in case growth in these counties (dIROC_confirmed) indicates that they are seeing exponential growth. This is concerning and there’s a possibility cases may spread northward in Arizona from these counties. Both are fairly isolated, however, so perhaps that will minimize it.
Maricopa county continues to see the majority of the actual cases in the state. This isn’t surprising due to their large population (about 4x larger than Pima county). We are clearly seeing exponential growth in cases — but not deaths. See time series charts later in this post. I’d guess that in this large, wealthy county most of the case growth is due to increased testing and social distancing lapses. There’s no data to indicate that protests are causing any of these cases. Note however that the acceleration of case growth is very small still, so most likely we’re not seeing a major exponential growth situation like we saw in New York City.
Apache, Navajo, and Coconino counties all have Native American reservation outbreaks. It does seem like these are slowing down (see time series charts).
Deaths per 1000 persons in the state is still very small compared to most other states.
Testing
comparison of AZ Testing Blitz with increase in cases.
As you can see in the chart above, AZ started a testing blitz a bit before they took the first phase of opening the economy. They report that they’ve conducted 416K tests so far. It does appear that the lions share of these testa have occurred after May 1. Assuming an average of 9K tests per day and over around 25 days (maybe some days they took off), we get at least 225,000 tests since May 1. The state’s advertised average positive test rate is 6.4%. Multiplying these, we get 14,400 positive tests that came from the increased testing. This accounts for nearly all the cases shown on the AZ Case curve above. Since it is widely reported that 50% of people with COVID-19 never know that they have it, lets assume only 7K of these new cases are symptomatic. Compare this to the 8-9K of cases before May 1, which one might assume are nearly all symptomatic (because testing was only given to symptomatic people), then you may see that symptomatic cases are flattening out. Not sure if this is true, but it seems likely that the graph now is showing at least 7K cases which would not have appeared before May 1.
Arizona’s Rank in the US per COVID-19 Cases
There’s a misleading chart that NPR has put out (it’s an experimental graph that Johns Hopkins is showing) in this article that makes a comparison between Arizona and New York. Both axes are logarithmic, largely because the New York data is so much greater than any other state, but the X axis is measuring total cases since January. The y-axis is averaging the last seven days’ worth of cases. The intent is to highlight case growth late in the outbreak cycle. In theory, states who had their outbreaks early will look as if their cases are dropping and states who did not have early outbreaks will show increases as large spikes. I think the whole format is not informative and is very misleading when comparing two states. Also, the cases aren’t normalized (which would allow them to not use logarithmic axes). Anyway, very bad plot which tells us nothing. It certainly doesn’t tell us that AZ is in crisis. See below for more measured data that tells us to be cautious, but not that we’re in difficult straits. First, I show a chart of the top states ranked by the Rate of Change of Confirmed Cases per 1000 persons.
Data on US States sorted by Instantaneous Rate of Change of New Confirmed Cases (normalized by population). 6/10/2020
Note that Arizona has moved up to 4th on the list. This (IROC_confirmed) is a volatile metric as the case growth curves are not a cumulative measure and can change to represent recent trends. But it is far more accurate and informative than a 7 day case average! To demonstrate the volatility, Arizona was 10-15 places further down the list earlier in the week. Here we see a bit of a resurgence in Louisiana along with some spillover cases from DC (which had a very steep bloom in cases a few weeks back). I was told by a friend in Virginia that the bars in DC are far more accessible right now than those in VA or MD due to more extreme social distancing in those states. So this might explain the VA and MD cases. Things to note:
Note that there is a really legitimate difference between Deaths per 1000 and Cases per 1000 between Arizona (and Alabama) and the other states at the top of this list.
Also, note that the COVID-19 deaths that occurred on this day in these states (Delta_Deaths) is fairly low. This has been the trend ever since probably mid-May. For an idea of the magnitude of these deaths, see below:
Table showing comparison of COVID-19 daily deaths with expected Deaths per Day of the top causes of death (data collected from https://www.cdc.gov/nchs/fastats/deaths.htm)
I put this table together using CDC US-wide stats and assuming uniform distribution of these deaths across all states (may not be a perfect assumption, but I imagine it’s good enough for demonstration). The table shows the COVID-19 deaths on 6/9 compared to the expected deaths per day of the top causes of death in the US (2017 numbers — the CDC always seems to be behind). As you can see COVID-19 deaths, even in a period of lower death rates ARE STILL A MAJOR CAUSE OF DEATH. However, they are not the leading cause of death in a majority of these states (and in California, they are well below the top 2). The point is that numbers like this, while troubling, do not pose a threat to our existence. Hopefully that is helpful. Keep this table handy as the numbers change. If deaths go up or down, this will remain a handy reference to risks that we are already capable of understanding and accepting.
Hospitals
This is a different issue, as there’s not clear data on the hospital situation. Headlines like this from Tucson’s newspaper are very concerning. The assertion is that ICU space is running out in Maricopa and Pima counties. Arizona doesn’t collect data very well (although they have improved quite a bit since the crisis started) and therefore we’re lacking good access to hospitalization data and recovery data. We know from the above table that there were 595 new active cases in the state yesterday. How many of those will eventually wind up in a hospital? This is hard to predict, as it depends on who is getting infected. If 300 of the 595 are younger people who have lapsed in social distancing then the number could be small. We don’t know this, because the state doesn’t break down new cases by demographics (see AZDHS data dashboard). All the state tells us is the number of hospitalizations per day (and it appears there’s about a 1 week lag on this data) and some data on the numbers of different kinds of hospital beds in use. In the chart below, we see that the percentage of ICU beds that are in use right now has gone up from about 62% of capacity (the chart tells me this equals 741 beds) in early March (before COVID hit AZ) to 79% (1302 beds) yesterday. This seems very significant, but I wonder how many of these 1302 beds are in use due to COVID-19? Fortunately, we have that data from the state (assuming it is correct and there’s no lag) and I have pasted it below. It does seem like there’s some average number of beds that were in use during the early COVID-19 period from looking at the first couple of months and averaging… (ignore the first day, that looks like bad data collection). So it would seem that before our recent surge in cases there were around 300 beds in use on average due to COVID. So our number today (413) minus 300 would seem to be the growth in ICU usage due to the recent surge in cases. From the top chart, I infer that there must be around 1650 ICU beds in the state. Therefore, since the lockdown was ended, Arizona has seen COVID-19 cases consume 25% of its total ICU bed capacity (413/1650), compared to 18% (300/1650) during the lockdown period.
AZ ICU Beds in use by % of total Capacity.AZ ICU beds in use by positive or suspected COVID-19 patients (6/9/20
Ventilators
From the chart below, it would seem like ventilator capacity is still fairly significant. From a different chart that the state puts out (again, assuming all this data is accurate), it would appear like ventilator usage is up about 70 from the lockdown period. What we don’t know is if these ventilators are in the right places or not… There are other charts on the AZDHS data dashboard that show details about different kinds of hospital beds, etc. It would be very useful if they could show cumulative hospital admissions and recoveries over time, because if that data was available, it would be easier to understand the cycle time of this disease on the hospital system.
Ventilators in use by % of total capacity, 6/9/20
Time Series Plots by Arizona County
This is for the Data Nerds. Here are a few of today’s time series plots of cases by county (and deaths for a couple of counties). Note that most of these counties are seeing acceleration of case growth (i.e., exponential growth) but the two largest are showing potential deceleration in their death rates.
Conclusion
Essentially, yes, cases are growing but deaths are decreasing. We don’t know enough to understand how much concern to have. I continue keeping in mind that most people are going to get this virus. I read that it binds 10x better to the ACE2 receptor than SARS, so while SARS may not have been a threat to most of us, COVID-19 will likely hit us all… Probably no reason to be panicked, but certainly reason to take caution.
Yuma County cumulative confirmed casesSanta Cruz County Cumulative Confirmed CasesMaricopa County Cumulative Confirmed CasesMaricopa County Cumulative DeathsNavajo County Cumulative Confirmed CasesApache County Cumulative Confirmed CasesPima County Cumulative Confirmed CasesPima County Cumulative Deaths
We’re seeing Case Growth in various regions and this is being noticed by news media, etc. It isn’t clear yet what is causing the new case growth, but states exiting lockdowns are probably a part of the reason at least. Here are some visualizations about one week apart to give kind of a time-lapse effect on case growth. Note that these diagrams are straight counts of cases and deaths (not normalized by population). This may give the effect that a region with a large population is being hard hit when they may not be… Also, these snapshots show one day’s numbers, not cumulative counts.
The Color represents the number of new cases, the diameter of the bubble represents the number of new deaths.
Data from 6/3/2020
Case and Death numbers from 6/3/2020
Data from 5/27/2020
Case and Death numbers from 5/27/2020
Data from 5/20/2020
Case and Death numbers from 5/20/2020
Data from 5/13/2020
Case and Death numbers from 5/13/2020
Data from 5/6/2020
Case and Death numbers from 5/6/2020
Conclusion
Take a look at the images above. We have seen lots of different phases of this disease in the last month and clearly things change fast. A few things to take away:
Deaths are significantly down. It seems like this is the case across the country, especially in the Northeast region.
Cases are up in certain regions. California and Illinois are seeing new cases and both are seeing deaths. However, there are numerous regions with cases and zero deaths that don’t show up because the diameter of the bubble (deaths) is so small. Perhaps this is good and is a picture of where we are re: treating this virus. Or maybe the deaths will start increasing in these regions in the next few days.
Overall, Cases are down. Note that the legend on the colorbar tops out at ~3500 on 5/6 but on the most recent chart it’s about ~1000. This is because case counts were higher overall a month ago.
Most regions are still relatively untouched by COVID-19.
Here’s a bit of an update on cases/deaths per 1000 population and the growth rates of cases/deaths per 1000 by latitude range. Below are the cumulative numbers of cases and deaths per 1000 summed up by latitude. It’s no surprise that the 40-45 band has been by far the hardest hit region. Now we’re seeing things slow down in these regions.
US Cases and Deaths per 1000 population by latitude ranges
The next chart show the current Case and Death per 1000 rates. You can think of these as the current “hot spots”. These hot spots may have high rates for a while or they may decrease sharply tomorrow. This is just an indicator of where cases and deaths are happening right now.
Growth Rates of US Cases and Deaths per 1000 population by latitude ranges
Overview
The blue-salmon chart at the top is relatively fixed and probably looks the same as it did last time you saw it. The 40-45 latitude is still by far the hardest hit (all the Northeast states, Michigan, Washington state, Illinois are in this band) and it still has a pretty high growth rate. However, we see the 35-40 region (DC, Maryland, Las Vegas) having the highest growth rates at the moment. Does this mean the virus is shifting to other regions? I’m not sure, but things seem to be changing. Rates may still be positive and some rates are increasing, but nothing like they did during the early phases of the outbreak. Deaths, likewise, seem to have slowed significantly in all regions. If this is due to better awareness by the population and better understanding of the disease by the medical world, then it does seem like the lockdown did buy us some time.
I was on vacation for a week. Here is what has happened.
Data from 5/29: This table and the one further below (from 5/22) show the top few states sorted by their COVID-19 Case Growth. Number of new cases is a metric that I’m seeing a lot of news articles show, but it’s not a good one. This metric shows the slope of their case growth, and therefore smooths out unusual spikes (like those due to irregular data collection approaches). STILL, I’m not a big fan of counting cases… it might be fun to do, but it’s not revealing much. Why? 1) We’re testing more now, so clearly cases should be growing. That should not be news. 2) I believe cases will continue to grow for the near term (and might start again when the weather cools off). This is a new disease for mankind that seems to spread quickly like the common cold or like influenza and we ought to get used to it. What we should really care about is that we can manage the cases well and keep people out of hospitals. Imagine if we reported flu cases the way we’ve been reporting COVID-19 cases?!?
What is interesting in the data below? First, Illinois continues to lead the country in the steepness of it’s case growth. However, their growth has slowed from .2217 cases per 1000 per day to .1871. Still growing, but not growing as quickly. Possibly due to their case growth, Illinois slowed their reopening and I believe today is the day they move into Phase 3 of reopening. I note that Nebraska’s numbers are falling fast, which makes sense to me because they were only being driven by one large county (which may have been related to a meat packing plant outbreak in a neighboring county). Illinois’ cases, however, were spread across a number of counties, and therefore may take longer to process through. The good news, though, is that the numbers of deaths are slowing significantly. Case growth with low deaths starts to look manageable, like influenza.
State Data Table from 5/29/2020 – sorted by case growth ratesState Data Table from 5/22/2020 – sorted by case growth rates
Arizona Latest Data sorted by Case Growth Rates
Since I live in Arizona, and since it has been an interesting state for COVID-19, I’ll show the latest for my state. Note that the top counties in case growth are Apache and Navajo, two neighboring counties that contain multiple Native American reservations (including the Navajo). Many have heard about how severe the outbreaks have been in the Navajo Nation. You can see this in the data… cases are spreading in these smaller counties very rapidly, especially when compared to the states’ tables above. Santa Cruz county is the county with the fastest accelerating rate and Yuma County is right behind. As both border Mexico and have heavily traveled border crossings, I suspect that there is a relationship between their new cases and the large number of new cases across Mexico. The two largest counties, Maricopa and Pima, have both slowed down a bit and Pima county’s growth rate is decelerating slightly. Fortunately, again, deaths are very low across Arizona (the row below with NaN for county is the results for the state as a whole).
Below is a really poor graphic put together by NBC (find HERE). All I can figure is that a young data scientist somewhere within the bowels of NBC wanted to present a story about the dangers of not locking down a state and therefore consciously or unconsciously laid out a graph that would confirm their intuition. My hope is that presenting a comparison of how NBC approached this data with my more careful representation of the same data will help people recognize particularly manipulative data presentation.
Below is what NBC presented:
Poorly presented data from NBC (https://www.nbcnews.com/health/health-news/here-are-stay-home-orders-across-country-n1168736)
Maybe the problems with this are obvious. Perhaps it’s true that states that never issued a lockdown order are especially hard hit right now, but this chart does not make that case, as 1) it just compares these states to each other, 2) it uses narrow, one-month time scales, an approach that makes the curves look very alarming, and 3) it doesn’t normalize the cases by population. Lots of cases in a very large region doesn’t equal a few less cases in a very small region. Obviously that second, smaller, region is going to be having a more difficult time. See below where I have taken this same data, normalized by population, and compared to a range of other states of different sizes with Governors of either party. I leave NY off intentionally so as not to dwarf the other states.
Here is How I Present the Same Data
Tod’s presentation of the “non-lockdown” state cases normalized by population and compared to other lockdown states for reference.
Conclusion
What do we see in the second chart? When we compare the “non-lockdown” states to other states, we see that a couple of them (Nebraska, SD) are comparable to the relatively-hard-hit Illinois, but the others are more comparable to less-hard-hit states like Arizona, Kentucky, and Texas. A quick google shows me that the cases in Nebraska and SD are largely driven by super-spreader activities at single meat-packing plants in each state. Perhaps a lockdown would have prevented this, but that bears more research. Looking at Nebraska vs. Illinois, we see two very different approaches to COVID-19. Nebraska is a small state that did no lockdown. Illinois is a larger state with big cities that is still on lockdown as of today’s date. As you can see, both of these states have a very high case growth slope (while New Jersey and South Dakota seem to be decelerating). Hopefully I’m making the point that being honest with the data and taking a scientific approach to presentatin of data is better for all of us. It is less sensational and more representative of what is actually happening. Plus, it doesn’t drive any false narratives.