Why Use This Data Source In Your Models?
The COVID‑19 pandemic is an ongoing global pandemic of coronavirus disease 2019 (COVID‑19), caused by severe acute respiratory syndrome coronavirus 2 (SARS‑CoV‑2). From the business focused data scientist there are three main measures to inform the economic impact- cases diagnosed, available hospital beds and recovered cases.
Seasonal Impact
Source:
The COVID Tracking Project
Release:
COVID-19 Currently On Ventilator (State)
Units:
Persons
Frequency:
Daily
Available Through:
03/07/2021
Suggested Treatment:
The data shows auto correlation and a non-normal distribution. The data should be differenced. While the Order Norm transformation, provides the best normality, the Boxcox variable will also perform well.
Grain Transformation:
Data is unable to be distributed by time or geography. The roll up method used is Max.
Data shows auto correlation indicating a need for differencing
The ACF indicates 2 order differencing is appropriate.
Further differencing is reccommended
The Kwiatkowski-Phillips-Schmidt-Shin (KPSS) test, KPSS Trend = 0.51 p-value = 0.01 indicates that the data is not stationary.
The Shapiro-Wilk test returned W = 0.93 with a p-value =0.00 indicating the data does not follow a normal distribution.
A skewness score of -0.56 indicates the data are moderately skewed.
Hartigan's dip test score of 0.03 with a p-value of 0.86 inidcates the data is unimodal
Auto Correlation Function
Auto Correlation Function After Differencing
Partial Auto Correlation Function
Seasonal and Trend Decompostion
Data Notes:
As of March 7, 2021 The COVID Tracking Project are no longer collecting new data
Citation:
The COVID Tracking Project, Data API; https://covidtracking.com/data/api, Retrieved daily.
Use our platform to aggregate, normalize, and profile open source and premium control data. Spend less time finding and wrangling data, and more time building efficient and feature-rich machine learning data pipelines.
Instantly apply industry-standard
data science treatments and transformations, including (but not limited to) Differencing, Lead/Lag, Box Cox. Easily manipulate data across different time and geographic grains.
Our Patent Pending iterative testing engine allows you to upload your target variable, and the platform will test for possible statistical relationships across all available data sources. Saving you time and removing analyst bias.
Easily integrate your Ready Signal data to the data science platform of your choice. Connect directly to Ready Signal through our API or using one of our pre-built data connectors or download directly in Excel or CSV format.