Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Self-critical sequence training #11

Open
Stephanie-Shen324 opened this issue May 16, 2021 · 2 comments
Open

Self-critical sequence training #11

Stephanie-Shen324 opened this issue May 16, 2021 · 2 comments

Comments

@Stephanie-Shen324
Copy link

Stephanie-Shen324 commented May 16, 2021

Hi,

First of all, thank you for sharing your excellent work and nicely-written code!

I'm wondering if you have tried implementing self-critical sequence training for this model? I am currently trying to implement it on Google Colab. But I always get a CUDA Out Of Memory issue. If you have tried, did you encounter similar issues?

Best wishes,
Stephanie Shen

@zhjohnchan
Copy link
Contributor

Dear @Stephanie-Shen324,

MAKE SURE you set torch.with_no_grad() when you sample a baseline report when performing SCST. This always causes the CUDA OOM problem.

Best,
Zhihong

@wlufy
Copy link

wlufy commented Mar 28, 2022

Hi,

First of all, thank you for sharing your excellent work and nicely-written code!

I'm wondering if you have tried implementing self-critical sequence training for this model? I am currently trying to implement it on Google Colab. But I always get a CUDA Out Of Memory issue. If you have tried, did you encounter similar issues?

Best wishes, Stephanie Shen

Hi, sorry to disturb you.
I am trying to implement self-critical sequence training for this model. But i have some difficulties, i can't understand the differences between labels and gts. Like the following:
def forward(self, fc_feats, att_feats, labels, masks, att_masks, gts, gt_indices, sc_flag, struc_flag, drop_worst_flag):
Could you share the code with me? Thanks!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants