From 836b6c813c0981d6b06ee864a6fc686828ab4971 Mon Sep 17 00:00:00 2001
From: mwydmuch See also: Note: added in 1.1.2. C++ Python `set_action(actions: list `set_action(action: list Sets the player’s action for the next tics.
+ Sets the player’s action for the following tics until the method is called again with new action.
Each value corresponds to a button previously specified
with Processes the specified number of tics. If Processes the specified number of tics, the last action set with This method combines functionality of This method combines functionality of isMultiplayerGa
setAction
-
+void setAction(std::vector<double> const &actions)
void setAction(std::vector<double> const &action)
-
-
+addAvailableButton
, or setAvailableButtons
methods,
or in the configuration file (in order of appearance).advanceAction
-
updateState
argument is set,
-the state will be updated after the last processed tic and a new reward will be calculated.
+setAction
+method will be repeated for each tic. If updateState
argument is set,
+the state will be updated after the last processed tic
+and a new reward will be calculated based on all processed tics since last the last state update.
To get the new state, use getState
and to get the new reward use getLastReward
.
@@ -556,9 +558,10 @@ makeAction
-
setAction
, advanceAction
and getLastReward
.
-Sets the player’s action for the next tics, processes the specified number of tics,
-updates the state and calculates a new reward, which is returned.setAction
, advanceAction
,
+and getLastReward
called in this sequance.
+Sets the player’s action for all the next tics (the same action will be repeated for each tic),
+processes the specified number of tics, updates the state and calculates a new reward from all processed tics, which is returned.
Flow control methodsSee also:
Note: added in 1.1.2.
@@ -400,7 +400,7 @@Sets the player’s action for the next tics. +
Sets the player’s action for the following tics until the method is called again with new action.
Each value corresponds to a button previously specified
with add_available_button()
, or set_available_buttons()
methods,
or in the configuration file (in order of appearance).
Processes the specified number of tics. If update_state
argument is set,
-the state will be updated after the last processed tic and a new reward will be calculated.
+
Processes the specified number of tics, the last action set with set_action()
+method will be repeated for each tic. If update_state
argument is set,
+the state will be updated after the last processed tic
+and a new reward will be calculated based on all processed tics since last the last state update.
To get the new state, use get_state()
and to get the new reward use get_last_reward()
.
This method combines functionality of set_action()
, advance_action()
and get_last_reward()
.
-Sets the player’s action for the next tics, processes the specified number of tics,
-updates the state and calculates a new reward, which is returned.
This method combines functionality of set_action()
, advance_action()
,
+and get_last_reward()
called in this sequance.
+Sets the player’s action for all the next tics (the same action will be repeated for each tic),
+processes the specified number of tics, updates the state and calculates a new reward from all processed tics, which is returned.