Currently --pipeline takes an alias or a directory, resolving to a directory containing knowledge.yaml, grounded_skills.yaml, and freeform_skills.yaml and the appropriate pipeline is chosen based on the data provided (aka "taxonomy routing"
There is a use case where, during experimentation, the user may want to run a specific pipeline directly
See instructlab/dev-docs#109 (comment) where we discussed a "router" design for this use case
This would allow ilab data generate support the starting point for data generation tasks use case described in aakankshaduggal@61eff48 rather adding a new entrypoint