Hi all! Thanks for the great piece of work. I find that there are more terms in the code than in the paper, and their names don't exactly match. Could u please confirm my identification of the rewards in your function against the ones in the code? 
Hi all!
Thanks for the great piece of work.
I find that there are more terms in the code than in the paper, and their names don't exactly match.
Could u please confirm my identification of the rewards in your function against the ones in the code?