Skip to content

Latest commit

 

History

History
31 lines (17 loc) · 978 Bytes

README.md

File metadata and controls

31 lines (17 loc) · 978 Bytes

Synopsis

This repo has code and data for bioinformatics removal of host sequence from DNA or RNA sequence data derived from infected cells.

Code Example

Motivation

Host subtraction is a first step in the characterization of unrecognized elements among DNA or RNA sequence. Host subtraction is useful for discovery of symbionts, pathogens, and host mutations. Host Subtraction DB is a system for processing DNA or RNA sequence reads to remove host sequence. Host Subtraction DB includes tools for constructing and using a reference database specific to a host genome.

Installation

Components

  • fastq-filter-by-name.pl : Given a list of read names and a FASTQ file of reads, write another FASTQ that either includes or excludes the named reads. Run the command with no options to receive command-line usage instructions.

Reference

Tests

Contributors

Jason Miller, J. Craig Venter Institute

License

GNU General Public License, version 3.