Skip to content

Conversation

@mthang
Copy link
Contributor

@mthang mthang commented Aug 27, 2025

FOR CONTRIBUTOR:

  • I have read the CONTRIBUTING.md document and this tool is appropriate for the tools-iuc repo.
  • License permits unrestricted use (educational + commercial)
  • This PR adds a new tool or tool collection
  • This PR updates an existing tool or tool collection
  • This PR does something else (explain below)

This is a collection of tools for Autocycler - https://github.com/rrwick/Autocycler .

@mthang
Copy link
Contributor Author

mthang commented Sep 23, 2025

@bernt-matthias Can you review this autocycler suite and merge it when you can. Many thanks

@mthang
Copy link
Contributor Author

mthang commented Sep 26, 2025

@bernt-matthias Thank you for reviewing the tools . I am working on the improvements at the moment and will resubmit them when I am done.

@bgruening
Copy link
Member

@mthang please ping us when you think a new review is needed :)

Thanks a lot this looks great!

@mthang
Copy link
Contributor Author

mthang commented Oct 15, 2025

@bgruening @bernt-matthias It took me a while to fix the wrapper and the test-data. Can you please review the wrappers again. Many thanks.

@bgruening
Copy link
Member

The shed.yml was for some reason broken, tests are now running. Lets see what happens :)

Thanks a lot @mthang

Copy link
Contributor

@bernt-matthias bernt-matthias left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks for your efforts. I added some more comments. Some apply for many/all tools.

<test>
<param name="in_gfa" value="consensus_assembly_unfiltered.gfa"/>
<param name="remove" value="743,782"/>
<output name="cleaned_gfa" file="clean.gfa" ftype="gfa1">
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Having file and assert_contents seems redundant.

Copy link
Contributor Author

@mthang mthang Oct 17, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I removed the assert_contents and keep the file only. Should I remove all assert_contents if I use "file" in my test_unit?

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

If an assertion captures the essence of the test I would even prefer the assertion over the file.

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I added the assertion back to the autocycler_clean.xml to ensure the test works. The planemo failed after I removed the file="clean.gfa" ftype="gfa1"

</inputs>
<outputs>
<collection name="subsampled_reads" type="list" label="${tool.name} on ${on_string}: subsampled reads">
<discover_datasets pattern="(?P&lt;designation&gt;.+)\.fastq" directory="out_dir" format="fastq" />
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

infer format from input?

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Like so: ae231f6 ?

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thank you for providing the changes ! I was puzzled what "infer format from input" means in Galaxy. I understand now. As you probably know, the autocycler subsample produces uncompressed fastq files instead of compressed fastq.gz. That's the reason I didnt update the test section. Anyway, thanks a lot!

Copy link
Contributor

@bernt-matthias bernt-matthias left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks for the efforts. This looks nearly ready to go :)

@bernt-matthias bernt-matthias marked this pull request as draft November 3, 2025 16:18
@bernt-matthias
Copy link
Contributor

Hi @mthang I took some time to rewrite quite a bit to have a more Galaxy look-and-feel, i.e. mainly I removed the nested collectiond and split collections containing fasta and gfa.

What do you think?

<inputs>
<param argument="--reads" type="data" format="fastqsanger,fastqsanger.gz" label="Input long reads (FASTQ format)"/>
<param name="assembler" type="select" label="Tool">
<option value="genome_size">genome_size</option>
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I was thinking if we should only support the genome_size command?

We have tools for most assemblers, except for metamdbg, plassembler, lja, and myloasm.

This would also reduce the number of requirements.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants