Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Notebooks don't show up pretty in the Github diff viewer. You can see the pretty notebook here: https://github.com/eihli/NEKO/blob/vqa-eval-exploration/dev_notebooks/vqa_playground.ipynb
Explores the difference between training and evaluation runs. Trying to find why evaluation loss doesn't decrease.
Here's something that I found curious. The evaluation generates the correct target answer. But it includes some additional text after the correct text.
See the last screenshots.
Target answer: breakfast
Predicted answer : breakfast Haw
Target answer: breakfast
Predicted answer : breakfaststasy
etc...
The evaluation