- 
                Notifications
    
You must be signed in to change notification settings  - Fork 201
 
Add GATK ValidateVariants #390
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
          
     Open
      
      
            G-kodes
  wants to merge
  5
  commits into
  snakemake:master
  
    
      
        
          
  
    
      Choose a base branch
      
     
    
      
        
      
      
        
          
          
        
        
          
            
              
              
              
  
           
        
        
          
            
              
              
           
        
       
     
  
        
          
            
          
            
          
        
       
    
      
from
G-kodes:ValidateVariants
  
      
      
   
  
    
  
  
  
 
  
      
    base: master
Could not load branches
            
              
  
    Branch not found: {{ refName }}
  
            
                
      Loading
              
            Could not load tags
            
            
              Nothing to show
            
              
  
            
                
      Loading
              
            Are you sure you want to change the base?
            Some commits from the old base branch may be removed from the timeline,
            and old review comments may become outdated.
          
          
  
     Open
                    Changes from all commits
      Commits
    
    
            Show all changes
          
          
            5 commits
          
        
        Select commit
          Hold shift + click to select a range
      
      9376252
              
                Init ValidateVariants GATK
              
              
                G-kodes 7c2131b
              
                House cleaning
              
              
                G-kodes e523a03
              
                Removed PreLoader in favor of shell().prefix()
              
              
                G-kodes 65be8f5
              
                Merge branch 'snakemake:master' into ValidateVariants
              
              
                G-kodes 0819f4f
              
                Merge branch 'master' into ValidateVariants
              
              
                johanneskoester File filter
Filter by extension
Conversations
          Failed to load comments.   
        
        
          
      Loading
        
  Jump to
        
          Jump to file
        
      
      
          Failed to load files.   
        
        
          
      Loading
        
  Diff view
Diff view
There are no files selected for viewing
  
    
      This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
      Learn more about bidirectional Unicode characters
    
  
  
    
              | Original file line number | Diff line number | Diff line change | 
|---|---|---|
| @@ -0,0 +1,7 @@ | ||
| channels: | ||
| - bioconda | ||
| - conda-forge | ||
| - defaults | ||
| dependencies: | ||
| - gatk4 ==4.1.4.1 | ||
| - snakemake-wrapper-utils ==0.1.3 | 
  
    
      This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
      Learn more about bidirectional Unicode characters
    
  
  
    
              | Original file line number | Diff line number | Diff line change | 
|---|---|---|
| @@ -0,0 +1,12 @@ | ||
| name: GATK ValidateVariants | ||
| description: Interleave two paired-end FASTA/Q files | ||
| url: https://gatk.broadinstitute.org/hc/en-us/articles/360037057272-ValidateVariants | ||
| authors: | ||
| - Graeme Ford | ||
| input: | ||
| vcf: VCF file to be validated | ||
| output: | ||
| - VCF output file | ||
| params: | ||
| extra: any extra commands as a string | ||
| notes: Multiple threads can be used during compression of the output file with ``pigz``. | 
  
    
      This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
      Learn more about bidirectional Unicode characters
    
  
  
    
              | Original file line number | Diff line number | Diff line change | ||||
|---|---|---|---|---|---|---|
| @@ -0,0 +1,13 @@ | ||||||
| rule vcf_spec_validation: | ||||||
| input: | ||||||
| vcf="sample.vcf", | ||||||
| output: | ||||||
| "results/sample_VALID.vcf", | ||||||
| log: | ||||||
| "results/sample_VALID.log", | ||||||
| params: | ||||||
| R="genome.fasta", | ||||||
| 
         There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. I cannot find this to be used in the wrapper. On the other hand, extra is missing. Maybe you meant to write 
        Suggested change
       
    
  | 
||||||
| resources: | ||||||
| mem_mb=1024, | ||||||
| wrapper: | ||||||
| "master/bio/gatk/validatevariants" | ||||||
  
    
      This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
      Learn more about bidirectional Unicode characters
    
  
  
    
              | Original file line number | Diff line number | Diff line change | 
|---|---|---|
| @@ -0,0 +1,3 @@ | ||
| @HD VN:1.5 | ||
| @SQ SN:ref LN:45 M5:7a66cae8ab14aef8d635bc80649e730b UR:file:/home/johannes/scms/snakemake-wrappers/bio/picard/createsequencedictionary/test/genome.fasta | ||
| @SQ SN:ref2 LN:40 M5:1636753510ec27476fdd109a6684680e UR:file:/home/johannes/scms/snakemake-wrappers/bio/picard/createsequencedictionary/test/genome.fasta | 
  
    
      This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
      Learn more about bidirectional Unicode characters
    
  
  
    
              | Original file line number | Diff line number | Diff line change | 
|---|---|---|
| @@ -0,0 +1,4 @@ | ||
| >ref | ||
| AGCATGTTAGATAAGATAGCTGTGCTAGTAGGCAGTCAGCGCCAT | ||
| >ref2 | ||
| aggttttataaaacaattaagtctacagagcaactacgcg | 
  
    
      This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
      Learn more about bidirectional Unicode characters
    
  
  
    
              | Original file line number | Diff line number | Diff line change | 
|---|---|---|
| @@ -0,0 +1,2 @@ | ||
| ref 45 5 45 46 | ||
| ref2 40 57 40 41 | 
  
    
      This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
      Learn more about bidirectional Unicode characters
    
  
  
    
              | Original file line number | Diff line number | Diff line change | 
|---|---|---|
| @@ -0,0 +1,18 @@ | ||
| ##fileformat=VCFv4.0 | ||
| ##fileDate=20170110 | ||
| ##source=pindel | ||
| ##reference=hg38 | ||
| ##INFO=<ID=END,Number=1,Type=Integer,Description="End position of the variant described in this record"> | ||
| ##INFO=<ID=HOMLEN,Number=1,Type=Integer,Description="Length of base pair identical micro-homology at event breakpoints"> | ||
| ##INFO=<ID=PF,Number=1,Type=Integer,Description="The number of samples carry the variant"> | ||
| ##INFO=<ID=HOMSEQ,Number=.,Type=String,Description="Sequence of base pair identical micro-homology at event breakpoints"> | ||
| ##INFO=<ID=SVLEN,Number=1,Type=Integer,Description="Difference in length between REF and ALT alleles"> | ||
| ##INFO=<ID=SVTYPE,Number=1,Type=String,Description="Type of structural variant"> | ||
| ##INFO=<ID=NTLEN,Number=.,Type=Integer,Description="Number of bases inserted in place of deleted code"> | ||
| ##FORMAT=<ID=PL,Number=3,Type=Integer,Description="Normalized, Phred-scaled likelihoods for genotypes as defined in the VCF specification"> | ||
| ##FORMAT=<ID=GT,Number=1,Type=String,Description="Genotype"> | ||
| ##FORMAT=<ID=RD,Number=1,Type=Integer,Description="Reference depth, how many reads support the reference"> | ||
| ##FORMAT=<ID=AD,Number=2,Type=Integer,Description="Allele depth, how many reads support this allele"> | ||
| #CHROM POS ID REF ALT QUAL FILTER INFO FORMAT a | ||
| ref 8 . A G . PASS END=140434574;HOMLEN=5;HOMSEQ=AAAAA;SVLEN=35;SVTYPE=INS GT:AD 0/0:317,6 | ||
| ref2 10 . A AGTTA . PASS END=55238278;HOMLEN=7;HOMSEQ=CTGCCAC;SVLEN=-51;SVTYPE=DEL GT:AD 0/0:40789,1734 | 
  
    
      This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
      Learn more about bidirectional Unicode characters
    
  
  
    
              | Original file line number | Diff line number | Diff line change | 
|---|---|---|
| @@ -0,0 +1,25 @@ | ||
| __author__ = "Graeme Ford" | ||
| __copyright__ = "Copyright 2021, Graeme Ford" | ||
| __email__ = "[email protected]" | ||
| __license__ = "MIT" | ||
| 
     | 
||
| from snakemake.shell import shell | ||
| from snakemake_wrapper_utils.java import get_java_opts | ||
| 
     | 
||
| extra = snakemake.params.get("extra", "") | ||
| java_opts = get_java_opts(snakemake) | ||
| 
     | 
||
| log = snakemake.log_fmt_shell(stdout=True, stderr=True) | ||
| 
     | 
||
| extra = snakemake.params.get("extra", "") | ||
| 
     | 
||
| 
     | 
||
| shell( | ||
| "gatk " | ||
| "--java-options '{java_opts}' " | ||
| "ValidateVariants " | ||
| "-V {snakemake.input.vcf} " | ||
| "{extra} " | ||
| "{log} " | ||
| "> {output}" | ||
| ) | 
  Add this suggestion to a batch that can be applied as a single commit.
  This suggestion is invalid because no changes were made to the code.
  Suggestions cannot be applied while the pull request is closed.
  Suggestions cannot be applied while viewing a subset of changes.
  Only one suggestion per line can be applied in a batch.
  Add this suggestion to a batch that can be applied as a single commit.
  Applying suggestions on deleted lines is not supported.
  You must change the existing code in this line in order to create a valid suggestion.
  Outdated suggestions cannot be applied.
  This suggestion has been applied or marked resolved.
  Suggestions cannot be applied from pending reviews.
  Suggestions cannot be applied on multi-line comments.
  Suggestions cannot be applied while the pull request is queued to merge.
  Suggestion cannot be applied right now. Please check back later.
  
    
  
    
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I think the output should just have a .txt suffix or so, since this tool will not create a vcf file, right?