You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Hello, this is brilliant work, I want to use the binary gumbel-softmax for my work. But there are some problems.
I used the soft mask for the first layer only (just apply the generated mask to the features after the first layer),and I found a strange phenomenon。The gumbel noise seemed to influence the training process too much. I plotted the sparsity loss only, and I found I usually couldn't obtain the sparsity target I set. Is this process right?
temp=5.0
temp=1.0
The text was updated successfully, but these errors were encountered:
Hello, this is brilliant work, I want to use the binary gumbel-softmax for my work. But there are some problems.
![微信截图_20211206151218](https://user-images.githubusercontent.com/49725436/144803418-062d98e6-2450-4ff1-96f7-abec947d0600.png)
![later](https://user-images.githubusercontent.com/49725436/144803651-cd2a9dc2-6054-4373-9bc8-a6674b4d57fb.png)
I used the soft mask for the first layer only (just apply the generated mask to the features after the first layer),and I found a strange phenomenon。The gumbel noise seemed to influence the training process too much. I plotted the sparsity loss only, and I found I usually couldn't obtain the sparsity target I set. Is this process right?
temp=5.0
temp=1.0
The text was updated successfully, but these errors were encountered: