Skip to content

[New Tutorial] Annotate, prepare tests and publish on workflow registries Galaxy workflows #6119

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Open
wants to merge 12 commits into
base: main
Choose a base branch
from

Conversation

clsiguret
Copy link
Contributor

Hello 😄

With @bebatut and @nagoue , we have written a new tutorial on workflow FAIRification in Galaxy.

We got the idea while working on the Hackathon MicroGalaxy IWC event.

We will be presenting this tutorial at the "Printemps de la donnée" event next week.

Thanks a lot! 😃

Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Is it possible to also add an SVG so we can adjust the image in the future?

Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This image comes directly from the journal. We do not have SVG

Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

We can still adjust the display size in the tutorial itself

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I don't see any svg format in the publication 😬

Copy link
Member

@bgruening bgruening left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This is so nice. Thanks a lot. A few comments inline.


Research data is accumulating at an unprecedented rate, presenting significant challenges for achieving fully reproducible science. As a result, implementing high-quality management of scientific data has become a global priority. One key aspect of this effort is the use of computational workflows, which describe the complex, multi-step methods used for data analysis. In Galaxy, workflows are a powerful feature that allows researchers to link multiple steps of complex analyses seamlessly. To maximize their impact, these workflows should adhere to best practices that make them **FAIR: Findable, Accessible, Interoperable, and Reusable**.

The FAIR principles —-Findable, Accessible, Interoperable, and Reusable—- provide practical guidelines for enhancing the value of research data:
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

maybe link to our FAIR trainings?

Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Added a comment box below


# Prepare the workflow

Here, we will use a workflow running FastQC and MultiQC but you can use your own workflow.
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Can we use Falco instead of FastQC? Or Fastp?

Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Good point. I will change in the tutorial

Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Can Falco output be used in MultiQC? I can not find Falco in the list of tools in MultiQC

Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Falco is a drop-in replacement for FastQC, you can use FastQC module of MQC

Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Will try then

> {: .solution}
{: .question}

Galaxy workflow interface supports some metadata. Are they enough to fulfill Tip 2 and also Tip 1 *"Register the workflow."*. WorkflowHub, a workflow registry we will explain more later, supports the [following metadata](https://about.workflowhub.eu/docs/metadata-list/):
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Galaxy workflow interface supports some metadata. Are they enough to fulfill Tip 2 and also Tip 1 "Register the workflow.".

Is that a question?

- Tip 4 (**Accessibility**): *"Provide example input data and results along with the workflow."*
- Tip 10 (**Reusability**): *"Provide clear and concise workflow documentation."*

The workflow annotation provides a short and concise description of the workflow. So Tip 10 is partially fullfilled. In addition to the workflow annotation, each step could annotated so users could have an idea about the purpose of the steps
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Can we mention here the fancy new workflow annotations from Laila as well? markdown inside the workflow, color encoded separations etc ... they are Galaxy specific but are imho also enhancing the documentation.

Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Good idea. Where do you have documentation for that?


### Find input datasets

To test the workflow, we need input datasets. By fulfilling Tip 4 (*"Provide example input data and results along with the workflow."*), we generated a toy dataset. We now need to publish it to [**Zenodo**](https://zenodo.org/) to have a permanent URL, also allowing others to **easily retrieve and reuse the data** when running or validating the workflow.
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants