You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I want to ask you about the idea of the article, that is, why you should use Gumbel-softmax - if I understand correctly, the input in this article is not sampling.Even with softmax, there should be no differentiable problems without reparameterization.
From the perspective of solving argmax‘s problem, I think softmax with temperature τ should also achieve a similar effect.
My understanding is that according to the statement in Alg1 and 2, a one-hot control variable needs to be output, but the specific explanation is still vague.
Best wishes!
The text was updated successfully, but these errors were encountered:
Hi author, thanks for your work!
I want to ask you about the idea of the article, that is, why you should use Gumbel-softmax - if I understand correctly, the input in this article is not sampling.Even with softmax, there should be no differentiable problems without reparameterization.
From the perspective of solving argmax‘s problem, I think softmax with temperature τ should also achieve a similar effect.
My understanding is that according to the statement in Alg1 and 2, a one-hot control variable needs to be output, but the specific explanation is still vague.
Best wishes!
The text was updated successfully, but these errors were encountered: