You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Hello! I was using this package to generate OTU assignments and to understand the communities for a set of metagenomic raw reads that I have. While converting the table to a wide format suitable for phyloseq objects, I noticed that sequences within the same OTU were having different levels of taxonomic assignment (Some upto order and other upto genus level for example).
This conversion to a wide format was done through a custom R script and cause some problems especially while building the taxonomy table since there were these varying taxonomic assignments for the same OTU. This issue didn't occur when I used the function in the package to get the wide format OTU table.
My questions are:-
Why is this discrepancy occurring in the first place?
As far as resolving through the functions in the package, does the code only retain the taxonomic classification till the point where there is agreement between all the assignments? Is that how this issue is being resolved through the package.
Thanks in Advanced!
The text was updated successfully, but these errors were encountered:
I think you have guessed right - the taxonomic assignment of each OTU is based on a summary of the taxonomic assignment of all reads that go into that OTU. So if an OTU has 2 sequences where one is from the phylum actinobacteria and the other is from acidobacteria, then the OTU gets assigned to bacteria only. But then if you ran the next sample and there was only 1 sequence in the OTU, assigned to actinobacteria, then the OTU from the second sample would be assigned Bacteria; Actinobacteria.
There is some pesky information loss when you are summarising a pre-existing OTU table, since the taxonomic classification of the reads that went into each OTU are not known (not in the OTU table), so it may make a decision different than if the full information was available.
taxonomic assignment is based on the protein sequence, when the nucleotide sequence might provide better resolution, so in the future we will explore that e.g. by believing close OTU matches to GTDB genomes' sequences.
Hello! I was using this package to generate OTU assignments and to understand the communities for a set of metagenomic raw reads that I have. While converting the table to a wide format suitable for phyloseq objects, I noticed that sequences within the same OTU were having different levels of taxonomic assignment (Some upto order and other upto genus level for example).
This conversion to a wide format was done through a custom R script and cause some problems especially while building the taxonomy table since there were these varying taxonomic assignments for the same OTU. This issue didn't occur when I used the function in the package to get the wide format OTU table.
My questions are:-
Why is this discrepancy occurring in the first place?
As far as resolving through the functions in the package, does the code only retain the taxonomic classification till the point where there is agreement between all the assignments? Is that how this issue is being resolved through the package.
Thanks in Advanced!
The text was updated successfully, but these errors were encountered: