Skip to content

TDP-43 motifs in the GISAID Database v1.0.0-alpha

Latest
Compare
Choose a tag to compare
@florez-alberto florez-alberto released this 13 Feb 01:00
· 4 commits to master since this release

TDP-43 motifs in the GISAID Database - Version v1.0.0-alpha

DOI

This repository contains Python scripts that automate the process of downloading, merging, and processing data from the GISAID website. The scripts are organized into two directories: GISAID-crawler and TDP-43, each with its own README file detailing the specific operations performed by the scripts within.

Details

Please refer to each script code to install the appropriate packages via pip3. The scripts are run using Python 3.10.0. Be situated on each working directory to execute the script. Refer to each README file for more information.

Disclaimer: in order to access the information in the GISAID database you must have your own access by creating a username and being given a password. In these scripts, I did not include any data contained in the database, in accordance with the GISAID terms and conditions. This data has not been shared to anyone nor cross-examined with any other influenza database. A separate table acknowledging all sources of the original data will be added.

License

This software is released under the MIT License.

Acknowledgments

Freunde von GISAID and all the researchers that deposited their sequences in their database. Nadia Naffakh for the cannonical sequences max and min sizes and discussion. Darren Hart for discussion.