Skip to content

Commit

Permalink
Internal change
Browse files Browse the repository at this point in the history
PiperOrigin-RevId: 527383364
Change-Id: I547671eaad4f979d0503a61b4e846c56dd7b2f01
  • Loading branch information
Brax Team authored and erikfrey committed Apr 26, 2023
1 parent 18392cf commit ac3b374
Show file tree
Hide file tree
Showing 4 changed files with 560 additions and 5 deletions.
5 changes: 2 additions & 3 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -44,9 +44,8 @@ and closing the gap between simulation and the real world.
Explore Brax easily and quickly through a series of colab notebooks:

* [Brax Basics](https://colab.research.google.com/github/google/brax/blob/main/notebooks/basics.ipynb) introduces the Brax API, and shows how to simulate basic physics primitives.
* [Brax Training](https://colab.research.google.com/github/google/brax/blob/main/notebooks/training.ipynb)
introduces the Brax v2 API, and shows how to train a policy with the
generalized backend.
* [Brax Training](https://colab.research.google.com/github/google/brax/blob/main/notebooks/training.ipynb) introduces Brax's training algorithms, and lets you train your own policies directly within the colab. It also demonstrates loading and saving policies.
* [Brax Training with PyTorch on GPU](https://colab.research.google.com/github/google/brax/blob/main/notebooks/training_torch.ipynb) demonstrates how Brax can be used in other ML frameworks for fast training, in this case PyTorch.

## Using Brax Locally

Expand Down
2 changes: 1 addition & 1 deletion brax/training/agents/es/train.py
Original file line number Diff line number Diff line change
Expand Up @@ -200,7 +200,7 @@ def compute_delta(
Returns:
"""
# NOTE - -> len(weights) * perturbation_std" is
# NOTE: The trick "len(weights) -> len(weights) * perturbation_std" is
# equivalent to tuning the l2_coef.
weights = jnp.reshape(weights, ([population_size] + [1] * (noise.ndim - 1)))
delta = jnp.sum(noise * weights, axis=0) / population_size
Expand Down
2 changes: 1 addition & 1 deletion brax/v1/experimental/composer/agent_utils.py
Original file line number Diff line number Diff line change
Expand Up @@ -33,7 +33,7 @@
e.g. equivalent to agent1=(..., action_agents=('agent1',), ...)
agent_groups currently defines which rewards/actions belong to which agent.
observation is the same among all agents (TODO -.
observation is the same among all agents (TODO: add optionality).
"""

from collections import OrderedDict as odict
Expand Down
Loading

0 comments on commit ac3b374

Please sign in to comment.