Which one is the best one? #6

timsoraro · 2020-06-27T16:01:53Z

Hi! Thank you for your work on this repo.

So after all your testing, what is the best architecture in terms of quality of generation?

gmftbyGMFTBY · 2020-07-06T07:37:19Z

Sorry for the late response, I have been busy
recently.

After my experiments, I found that DSHRED-WA is the best one. But it also costs lots of time to converge. I recommend you to follow the GPT-2, which is much more hopeful in the future. And I will release a package for transformer based dialog model in about a month.

timsoraro · 2020-07-06T07:40:06Z

Thanks for the response! I find GPT-2 pretty good, but I wanted to know if an RNN model the same size could potentially beat it.

gmftbyGMFTBY · 2020-07-07T10:35:56Z

Lol, this is also the motivation of this repo. But the transformer based model seems more powerful than the RNN based models. If you have some ideas, we can make some conversations about the improving the RNN based models.

timsoraro · 2020-07-07T10:43:28Z

But was there a fair comparison (model with the same number of parameters as GPT-2, trained on the same data)?

gmftbyGMFTBY · 2020-07-24T06:43:30Z

Hi, i compare the GPT-2 model (transformers), and use the data to train it from scratch.

GPT-2 model can achieve better distinct score (better diversity). But the BLEU and embedding-based score are similar with these models. Maybe I will leverage human annotations to measure the performance in the future.

Sorry for the late response 😅

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Which one is the best one? #6

Which one is the best one? #6

timsoraro commented Jun 27, 2020 •

edited

Loading

gmftbyGMFTBY commented Jul 6, 2020

timsoraro commented Jul 6, 2020

gmftbyGMFTBY commented Jul 7, 2020

timsoraro commented Jul 7, 2020

gmftbyGMFTBY commented Jul 24, 2020 •

edited

Loading

Which one is the best one? #6

Which one is the best one? #6

Comments

timsoraro commented Jun 27, 2020 • edited Loading

gmftbyGMFTBY commented Jul 6, 2020

timsoraro commented Jul 6, 2020

gmftbyGMFTBY commented Jul 7, 2020

timsoraro commented Jul 7, 2020

gmftbyGMFTBY commented Jul 24, 2020 • edited Loading

timsoraro commented Jun 27, 2020 •

edited

Loading

gmftbyGMFTBY commented Jul 24, 2020 •

edited

Loading