-
Notifications
You must be signed in to change notification settings - Fork 3
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
ValueError: shape mismatch #9
Comments
Hi Nicole @nt365, Best regards! |
Thank you so much, I am now getting a new error (attached). What I realized in the last week is that Phylo-Hmrf does not convert interaction points in the hic files to the reference coordinates. Originally I thought that I could submit the hic coordinates specific to the species. Can you confirm that I need to convert the species specific coordinates to the reference genome coordinates prior to running phylo-hmrf on the data? Thank you, |
Hi Nicole @nt365, For the second question, yes, please convert the species specific coordinates of the Hi-C data to the reference genome coordinates prior to running phylo-hmrf. Phylo-hmrf does not perform the coordinate conversion. We did the coordinate conversion in the data preprocessing. For each interaction, we converted the coordinates in the genome of the corresponding species to the reference genome. I am very sorry I did not write clear descriptions in the documentation. I will update the documentation soon. Please let me know if you have any other questions. Many thanks! Best regards! |
Hi Yang, Thank you so much for the clarification. I fixed the floating point numbers in the synteny blocks and converted the hic interaction points to the reference coordinates. I ran Phylo-HMRF with the new utility file on two species as a test and received the following errors in "error_1.txt". I then commented out lines 378-393 in utility.py but then I get "error_2.txt". Any feedback you could offer here would be appreciated. Thank you, |
Dear Nicole @nt365, For the errors in the error_2.txt, I think it is because the definition of the variable region_list was commented, but later it is to be printed by the program. I do apologize for this problem and apologize for the inconvenience caused. I have updated the code on Line 385-390 of utility.py accordingly. Please change region_points_vec on Line 385 according to the data in your study, or please comment Line 385-390 if region_points_vec is not needed. Region_points_vec was used to store the centromere positions of chr3 and chr6 in genome hg38. The large synteny blocks on chr3 and chr6 were divided into smaller parts according to the centromere positions to reduce the computation cost. Please see the updated utility.py. Please let me know if you have any other questions. I do apologize for the delay of my reply again. Many thanks! Best regards! |
Hello,
I have created all of the files to run phylo-hmrf with the correct formatting and file names. I am not using hg38 so I commented out lines 387-393 in utility.py, please let me know if this is not the correct approach. I also tried tried to run the software without commenting out these lines and the same error resulted.
The software starts running and then results in the error in the file attached. If you could lend feedback about what might be causing this error that would be greatly appreciated.
Thank you,
Nicole
I run the following command:
python phylo_hmrf.py -r 1 --resolution 5000 --chromvec 1 --ref_species dmel -p ./dmel_phylo/
phylohmrf_error.txt
The text was updated successfully, but these errors were encountered: