Tuesday, March 10, 2020

COVID-19 data -2-

After my first post about COVID-19 data i did find another interesting data source
Nextstrain is an open-source project to harness the scientific and public health potential of pathogen genome data. They provide a continually-updated view of publicly available data alongside powerful analytic and visualization tools for use by the community. Their goal is to aid epidemiological understanding and improve outbreak response.

It contains interesting graphs about COVID-19 and the different genomes found in the samples. This nCoV data was generously shared to Nexstrain via GISAID.  From the data on GISAID nCoV genomes are not included as part of that repo as many of them are protected by the terms of GISAID sharing. Also due to some sharing agreements not everything is made public.
As i only see a limited number of cases not all the data about all cases is available. I don't know if this is due to the time needed for analysis or perhaps not in all cases genome data will be analysed and/or shared. However in the Nexstrain github page https://github.com/nextstrain/ncov you can find interesting data to use in your own graphs. Also an api for the data and json seem to be available on the Nextstrain website.
The data can be imported in your spreadsheet using =IMPORTDATA(<url>) as mentioned in my first post about COVID-19 data.

No comments: