Question about the Data in Multi-task Pretraining Stage #63

sdzhangbo · 2023-09-07T09:37:04Z

Why would you consider continue using Pre-training data like COYO, LAION, and drop out LAION-COCO? I think it is more reasonable to add more caption dataset like TextCaps, Laion-coco and COCOCaps. Besides, what if you do not use the pre-training data in the multi-task pre-training stage? Would the representation of visual encoder being affected?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Question about the Data in Multi-task Pretraining Stage #63

Question about the Data in Multi-task Pretraining Stage #63

sdzhangbo commented Sep 7, 2023

Question about the Data in Multi-task Pretraining Stage #63

Question about the Data in Multi-task Pretraining Stage #63

Comments

sdzhangbo commented Sep 7, 2023