finetune with fixed context_length and pred_length #75

NaNYeYuan · 2024-06-19T10:14:18Z

I am trying to finetune with fixed context_length and pred_length via loading train data with SimpleEvalDatasetBuilder.
However, the eval prediction result is extraordinarily large.
What's right way to finetune with fixed context_length and pred_length?

NaNYeYuan · 2024-06-19T11:25:32Z

By the way, how to get prediction result from a raw sequence list?

chenghaoliu89 · 2024-12-04T09:53:25Z

Hi @NaNYeYuan sorry for the late response.

The default_train_transfrom define the data processing pipeline. If you want fixed context_length and pred_length, I would suggest you to modify MaskedPrediction Class, which originally randomly sample the prediction length and context length.

It seems to be a common feature for model fine-tuning, we will implement the FixedMaskedPrediction in the future.

chenghaoliu89 added the enhancement New feature or request label Dec 4, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

finetune with fixed context_length and pred_length #75

finetune with fixed context_length and pred_length #75

NaNYeYuan commented Jun 19, 2024

NaNYeYuan commented Jun 19, 2024

chenghaoliu89 commented Dec 4, 2024

finetune with fixed context_length and pred_length #75

finetune with fixed context_length and pred_length #75

Comments

NaNYeYuan commented Jun 19, 2024

NaNYeYuan commented Jun 19, 2024

chenghaoliu89 commented Dec 4, 2024