-
Notifications
You must be signed in to change notification settings - Fork 7
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Skill VLA eval #290
base: main
Are you sure you want to change the base?
Skill VLA eval #290
Conversation
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
look good! let's debug this when we depoly on the hardware
@@ -0,0 +1,169 @@ | |||
# Copyright (c) Meta Platforms, Inc. and its affiliates. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
we recreate a new base env? what are the issues if we use the old one?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
there's a lot of stuff in the old one that might not be needed anymore, it was easier to pull out parts that I needed into this one. For an example, the old one passes the RGB images through segmentation model, whereas we now want to use the RGB as input to our policy and not use any segmentation models.
skill_vla/examples/test_skill_vla.py
Outdated
policy.reset() | ||
while not done: | ||
action = policy.act(observations) | ||
observations, _, done, _ = env.step(base_action=action) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
good point! I guess what you do here is to create a simple while loop RL env, so that we do not need to go through the complex existing env stuff
Motivation and Context
New evaluation code for VLA policies
How Has This Been Tested
Types of changes
Checklist