Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

dudes revision #4

Open
pirovc opened this issue Feb 23, 2024 · 0 comments
Open

dudes revision #4

pirovc opened this issue Feb 23, 2024 · 0 comments

Comments

@pirovc
Copy link
Owner

pirovc commented Feb 23, 2024

Due to the recent update to v0.10.0, promising results with metaproteomics data (thanks to @rababerladuseladim) and the potential to use of the core methodology from DUDes in different applications, I suggest a couple of changes and features for this repository:

  • code refactoring
    • simplification in many areas, removal of unnecessary classes and better structure
    • use of multitax to handle taxonomy. This will simplify a good part of the code and allow usage of the method with many other existing and custom taxonomies (e.g. GTDB)
    • removal of GI support for genomic data, since it is long outdated
    • properly parse acc2txid files similar to this.
    • re-evaluate the role and necessity of having a dudesdb
  • generalization of the deepest uncommon descendant (DUD) method
    • independence of input data type
    • standardize the inputs and outputs for the core method
    • create standard modules for specific data types, for example the parse_sam.py for metagenomics and parse_diamond_blast.py for metaproteomics, which could be easily extended to other types
    • possibility to input raw text data
  • infrastructure
    • increase and improve testing and CI
    • documentation
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant