Dear Cosmos Team,
Thank you for doing the community a service by releasing your models open-source. In the report you dedicate section 5.2.4 to describing how you optimized the inference time speed of Cosmos-Predict-4B to real-time with a variety of techniques including Medusa speculative decoding.
Would you consider releasing this fine-tuned model to the public too? I am confident many researchers like myself would find this extremely useful for downstream use cases that require real-time speed.
Thank you!
Dear Cosmos Team,
Thank you for doing the community a service by releasing your models open-source. In the report you dedicate section 5.2.4 to describing how you optimized the inference time speed of Cosmos-Predict-4B to real-time with a variety of techniques including Medusa speculative decoding.
Would you consider releasing this fine-tuned model to the public too? I am confident many researchers like myself would find this extremely useful for downstream use cases that require real-time speed.
Thank you!