-
Notifications
You must be signed in to change notification settings - Fork 264
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Bad performance on experiment reproduction #51
Comments
And the wierd thing is that the reward reflection during the running process is almost the same: Iteration 1: User Content: Iteration 2: User Content: Iteration 3: User Content: The values are totally the same. I think there must be something wrong with the training process. |
Atom light value from 0 to 1 |
Hi, I have successfully run the whole project and tested on several gym tasks, like FrankaCabinet and Humanoid. But the experiment result is not so good as I expected. What may be the reason?
My workstation environment is:
Ubuntu 22.04
12GB RTX 4080 GPU
16GB CPU
And the command lines I have used are:
python eureka.py env=FrankaCabinet sample=5 iteration=5 model_name=gpt-4
python eureka.py env=Anymal sample=5 iteration=5 model_name=gpt-4
The final success rate is only approximately 0.1. Does it related to the number of samples? My workstation can only run 5 samples in parallel due to the limit of GPU memory.
The text was updated successfully, but these errors were encountered: