The results below are likely only meaningful to subject matter experts because the source dataset employs abbreviations, jargon and/or otherwise non-obvious labels. You may get in touch to help improve the source data, or you may browse Analyst-2 to find more accessible datasets.

COVID-19 Coronavirus Dataset

Coronavirus Data released by John Hopkins (vignesh1694/covid19-coronavirus)   []

Context

A SARS-like virus outbreak originating in Wuhan, China, is spreading into neighboring Asian countries, and as far afield as Australia, the US a and Europe.

On 31 December 2019, the Chinese authorities reported a case of pneumonia with an unknown cause in Wuhan, Hubei province, to the World Health Organisation (WHO)’s China Office. As more and more cases emerged, totaling 44 by 3 January, the country’s National Health Commission isolated the virus causing fever and flu-like symptoms and identified it as a novel coronavirus, now known to the WHO as 2019-nCoV.

The following dataset shows the numbers of spreading coronavirus across the globe.

Content

Sno - Serial number Date - Date of the observation Province / State - Province or state of the observation Country - Country of observation Last Update - Recent update (not accurate in terms of time) Confirmed - Number of confirmed cases Deaths - Number of death cases Recovered - Number of recovered cases

Acknowledgements

Thanks to John Hopkins CSSE for the live updates on Coronavirus and data streaming. Source: https://github.com/CSSEGISandData/COVID-19 Dashboard: https://public.tableau.com/profile/vignesh.coumarane#!/vizhome/DashboardToupload/Dashboard12

Inspiration

Inspired by the following work: https://gisanddata.maps.arcgis.com/apps/opsdashboard/index.html#/bda7594740fd40299423467b48e9ecf6

Data summary

  • File '2019_nCoV_data.csv'

    • Table ‘2019 nCoV data’ consists of 24,709 data rows along eight dimensions: ‘Sno’, ‘Date’, ‘Province/State’, ‘Country’, ‘Last Update’, ‘Confirmed’, ‘Deaths’ and ‘Recovered’
  • File 'time_series_covid19_confirmed.xlsx'

    • Table ‘time series covid19 confirmed’ consists of 248 data rows along 69 dimensions: ‘Province/State’, ‘Country/Region’, ‘Lat’, ‘Long’, ‘Wed Jan 22 00:00:00 UTC 2020’, ‘Thu Jan 23 00:00:00 UTC 2020’, ‘Fri Jan 24 00:00:00 UTC 2020’, ‘Sat Jan 25 00:00:00 UTC 2020’, ‘Sun Jan 26 00:00:00 UTC 2020’, ‘Mon Jan 27 00:00:00 UTC 2020’ and 59 other dimensions
  • File 'time_series_covid19_deaths.xlsx'

    • Table ‘time series covid19 deaths’ consists of 248 data rows along 69 dimensions: ‘Province/State’, ‘Country/Region’, ‘Lat’, ‘Long’, ‘Wed Jan 22 00:00:00 UTC 2020’, ‘Thu Jan 23 00:00:00 UTC 2020’, ‘Fri Jan 24 00:00:00 UTC 2020’, ‘Sat Jan 25 00:00:00 UTC 2020’, ‘Sun Jan 26 00:00:00 UTC 2020’, ‘Mon Jan 27 00:00:00 UTC 2020’ and 59 other dimensions
  • File 'time_series_covid19_recovered.xlsx'

    • Table ‘time series covid19 recovered’ consists of 234 data rows along 69 dimensions: ‘Province/State’, ‘Country/Region’, ‘Lat’, ‘Long’, ‘Wed Jan 22 00:00:00 UTC 2020’, ‘Thu Jan 23 00:00:00 UTC 2020’, ‘Fri Jan 24 00:00:00 UTC 2020’, ‘Sat Jan 25 00:00:00 UTC 2020’, ‘Sun Jan 26 00:00:00 UTC 2020’, ‘Mon Jan 27 00:00:00 UTC 2020’ and 59 other dimensions

Size: 353.8 KBSource: KaggleLast updated: 2022-02-14 09:40

  • COVID-19 Coronavirus Dataset - 29 statistical outliers identified in distribution of values in column ‘Wed Mar 25 00:00:00 UTC 2020’ in table ‘time_series_covid19_confirmed’

Analyst-2 explores entire data repositories and data lakes, autonomously analyzing each dataset using the Inspirient Automated Analytics Engine.

If you would like Analyst-2 to surface insights in your company's data repository or data lake, please get in touch!

Creative Commons License

These analysis results by Inspirient GmbH are licensed under a Creative Commons Attribution 4.0 International License in conjunction with the licence of the source dataset.