You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Currently, the value of masked options is not ignored when computing TD targets for Macro-DQN updates.
We need a function that produces an option mask for given states, which can be used here and elsewhere where option masking is needed. Ideally, this should be able to take batches of states and produce batches of option masks. This function will be called potentially many times per time step, so it should be performant.
The text was updated successfully, but these errors were encountered:
BaRL-SimpleOptions/simpleoptions/function_approximation/agents/macro_dqn_options_agent.py
Lines 121 to 125 in f82e945
Currently, the value of masked options is not ignored when computing TD targets for Macro-DQN updates.
We need a function that produces an option mask for given states, which can be used here and elsewhere where option masking is needed. Ideally, this should be able to take batches of states and produce batches of option masks. This function will be called potentially many times per time step, so it should be performant.
The text was updated successfully, but these errors were encountered: