The similarity computation is cosine-related function in the paper, but here is torch.bmm, may be such function is not important.
The similarity computation is cosine-related function in the paper, but here is torch.bmm, may be such function is not important.