Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

switch to mujoco env #381

Closed
wants to merge 3 commits into from
Closed

switch to mujoco env #381

wants to merge 3 commits into from

Conversation

KuoHaoZeng
Copy link
Collaborator

No description provided.

@KuoHaoZeng KuoHaoZeng requested a review from jordis-ai2 July 1, 2024 17:39
Copy link
Collaborator

@jordis-ai2 jordis-ai2 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I totally agree in removing the duplicate version of the BatchEnvironment example, but I'm not sure about the design changes in BatchedTask (please see my comments below). In nay case, I'm happy to discuss, as I might be over-engineering things that can be easily handled by other means 👍

@@ -407,6 +408,11 @@ def __init__(
self.tasks = [task_classes[0](env=env, sensors=sensors, task_info=task_info, max_steps=max_steps, batch_index=0, **kwargs)]
self.tasks[0].batch_index = 0

self.frames = None
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I see why you're doing this, but I guess I had a thought on overloading the render() command, so that it produces all the observations we might want to extract from the environment (including, for example, also expert actions). I'm not sure we should be explicit at this level about which type of data are rendered, but I'm happy to discuss it.


def update_state_views(self):
for idx, state_view in enumerate(self.state_views):
updated_state_view = self.tasks[0].env.call(state_view)
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I know I introduced this notation, but we might prefer to use the env reference in the task sampler rather than that in task 0? I was not sure of what seems cleaner, but I'm currently inclined for the task sampler reference as the better option. Again, happy to discuss what seems best :)

@KuoHaoZeng KuoHaoZeng closed this Jul 10, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants