Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

call_cnv output .cnv.bed is empty #6

Open
yingchen69 opened this issue Mar 31, 2017 · 6 comments
Open

call_cnv output .cnv.bed is empty #6

yingchen69 opened this issue Mar 31, 2017 · 6 comments

Comments

@yingchen69
Copy link

Hi,

I am trying to use CLAMMS to make cnv calls to my targeted panel NGS data. All steps went well except the last step with call_cnv. My bam files are mapped to ucsc hg19, so all chromosomes are started with "chr". call_cnv just gave a segmentation error message with no detail. So I removed "chr" from all chromosome names in all bed files and re-sorted all bed files. This time the call_cnv did not throw out segmentation error message, but the output cnv.bed file is empty. I attached the models.bed and one sample norm.cov.bed file here.

Thanks a lot for the help!

Ying

clamms_models.bed.txt
WGC085336U.norm.cov.bed.txt

@rgcgithub
Copy link
Owner

Your files look good to me, and I also get no output from call_cnv for your sample. I noticed that you have only ~5k targets, much fewer than whole exome capture. Before I dig too deep, are you sure that your sample has any CNVs in your targeted regions? It's entirely possible that there are no high-confidnece CNVs, which would produce no output.

If you aren't sure, I would try running more samples or using a sample with a known CNV as a test case. You can try increasing the --cnv_rate parameter (increases sensitivity) and also the --mean_cnv_length parameter (used to define the expected CNV size in genomic distance, which may be different from our default parameters in your target platform) if the default settings do not pick up what you are expecting.

Evan

@yingchen69
Copy link
Author

yingchen69 commented Apr 4, 2017 via email

@rgcgithub
Copy link
Owner

I'm guessing that CNVs in those 223 genes are rare, so it's entirely possible that your one test sample had no CNVs detected by CLAMMS. Before we assume that's a bug, I would test the other samples in your cohort and see what you pick up. If that yields no results we can look into it further.

Evan

@yingchen69
Copy link
Author

yingchen69 commented Apr 8, 2017 via email

@rgcgithub
Copy link
Owner

I'm surprised that you see no CNVs in your entire cohort. In fact, if you look at your models file, cols 11-17 provide a breakdown of the number of samples in the training set (reference panel) estimated at each copy number (0-6). You'll see that at most targets, column 13 (diploid state) has ~500 samples implying the full reference panel appear diploid. But there are regions where you'll see a consistent non-zero count in column 12 (het deletion) or column 14 (het duplication). For example, if you look at the known, recurrent locus 1q21.1 (1:147083326-147381445), you'll see you have a sequence of 2.0's in the het deletion column. So you should have two carriers of that deletion in your cohort. (You also appear to have one sample in copy number 0 over ~ all targets, likely a failed sample).

So while you shouldn't base the calls off the numbers in the model file, it at least gives you an idea that there should be a handful of CNV carriers in your cohort. If you aren't picking them up, I'd first try increasing the --cnv_rate parameter in the call_cnv module if you're using the default value (3.0e-8). Your normalized coverage variance looks pretty clean, so you could likely go several orders of magnitude - maybe 5.0e-6 to start?

Good luck,
Evan

@rgcgithub
Copy link
Owner

rgcgithub commented Apr 10, 2017

PS - Here's a test you should run to make sure you have the right input parameters set.

Using the data you sent, you can force a synthetic CNV to occur in your chrX regions by trying both "M" and "F" for the --sex parameter. It will give you a duplication of X if it's a female, deletion of X if it's male.

./call_cnv WGC085336U.norm.cov.bed.txt clamms_models.bed.txt --sex M
X 31115743 153650115 X:31115743-153650115 WGC085336U DUP 2 196 999 0.972 NA NA NA NA 91 22104 99 -991
./call_cnv WGC085336U.norm.cov.bed.txt clamms_models.bed.txt --sex F
no output

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants