I don't understand the use of Hard Attention in your code. #2

bankanikhil29 · 2019-07-19T07:21:17Z

Hard attention has such low accuracy on the validation data even after 10 epochs.
Is it just used for drawing the heat map?
Why is my accuracy affected in this case compared to my soft Attention?

From the papers I learnt that Soft Attention gives to multiple objects as compared to Hard attention. To cross check this I gave my model an image having 2 digits.

Soft Attention works well by giving attention to both the digits.

But in Hard attention I expected it to give attention to just one of the digits. Though the result is same compared to Soft Attention and prediction is affected.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

I don't understand the use of Hard Attention in your code. #2

I don't understand the use of Hard Attention in your code. #2

bankanikhil29 commented Jul 19, 2019 •

edited

Loading

I don't understand the use of Hard Attention in your code. #2

I don't understand the use of Hard Attention in your code. #2

Comments

bankanikhil29 commented Jul 19, 2019 • edited Loading

bankanikhil29 commented Jul 19, 2019 •

edited

Loading