Tutorial for general UNIX computers without docker

Git clone this pipeline and move into it.

  $ git clone https://github.com/ENCODE-DCC/chip-seq-pipeline2
  $ cd chip-seq-pipeline2

Download cromwell.

  $ wget https://github.com/broadinstitute/cromwell/releases/download/34/cromwell-34.jar
  $ chmod +rx cromwell-34.jar

Download a SUBSAMPLED paired-end sample of ENCSR936XTK.

  $ wget https://storage.googleapis.com/encode-pipeline-test-samples/encode-chip-seq-pipeline/ENCSR936XTK/ENCSR936XTK_fastq_subsampled.tar
  $ tar xvf ENCSR936XTK_fastq_subsampled.tar

Download pre-built genome database for hg38.

  $ wget https://storage.googleapis.com/encode-pipeline-genome-data/test_genome_database_hg38_chip.tar
  $ tar xvf test_genome_database_hg38_chip.tar

Install Conda

Install Conda dependencies.

  $ bash installers/uninstall_dependencies.sh  # to remove any existing pipeline env
  $ bash installers/install_dependencies.sh

Run a pipeline for the test sample.

  $ source activate encode-chip-seq-pipeline # IMPORTANT!
  $ INPUT=examples/local/ENCSR936XTK_subsampled.json
  $ java -jar -Dconfig.file=backends/backend.conf cromwell-34.jar run chip.wdl -i ${INPUT}

It will take about an hour. You will be able to find all outputs on cromwell-executions/chip/[RANDOM_HASH_STRING]/. See output directory structure for details.
See full specification for input JSON file.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!