[Question] "Soft" Constraining #2

DefinitlyEvil · 2024-08-23T21:43:55Z

Hello,
Thanks for the great research paper, however I found that in the paper you added constraint term in the loss function, in this way, will the robot still shake a lot during initial training since the parameters aren't adapted to the soft constraint yet?
Thanks, this have been confusing me for a while.
Toby.

realquantumcookie · 2024-08-25T14:51:50Z

Hi Toby,

Thank you for the question. In our experiments when we put a reasonably big enough hyper-parameter $C$, the robot will only shake in the first 5-10 seconds of initial training since we have a very high UTD ratio.

Best,
Yunhao

DefinitlyEvil · 2024-08-26T10:00:12Z

Ah, gotcha! Thanks, that cleared up my confusion so much! Keep up the good work!

DefinitlyEvil · 2024-09-05T20:46:35Z

Hi again, sorry for the disruption again.
As stated by Levine, I tried to control at 10Hz, but now training takes 0.3s it blocks the running severely (M1 Max). What technology did you use to enable async training? Thanks Yunhao. <3

realquantumcookie · 2024-09-06T14:57:20Z

Hi! In our experiments we were using an Ubuntu 18.04 LTS machine with an RTX 2070 installed. Not sure if Mac will run the real world control code at all, also make sure the jax library you run has whatever PRJT extension installed for your machine (in your case if you want to use mac you need this). Also we don't have any tested async training code (I guess our hardware was just good enough)

DefinitlyEvil · 2024-09-07T18:18:11Z

Ah you used fully sync code? That was amazing to have 20 gradient updates at 10Hz rate...
I built a custom hardware IO library with HTTP API, but I guess the improvement range is huge.
I have a RTX4090 PC though maybe I should switch to that.
My hardware is a self-designed servo-driven robot, everything is from scratch so things are tricky and challenging. xD
Anyway, thanks for sharing your training details and I really love your work. <3 Hope to study with you and Levine some day. Haha! Have a great day!

realquantumcookie · 2024-09-09T17:02:22Z

Yes, we actually had 20 UTD @ 20 Hz. Best of luck with your future endeavors! I will keep this issue opened in case someone has similar questions as yours.

DefinitlyEvil closed this as completed Aug 26, 2024

realquantumcookie reopened this Sep 6, 2024

realquantumcookie added question Further information is requested resolved labels Sep 9, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Question] "Soft" Constraining #2

[Question] "Soft" Constraining #2

DefinitlyEvil commented Aug 23, 2024

realquantumcookie commented Aug 25, 2024

DefinitlyEvil commented Aug 26, 2024

DefinitlyEvil commented Sep 5, 2024 •

edited

Loading

realquantumcookie commented Sep 6, 2024 •

edited

Loading

DefinitlyEvil commented Sep 7, 2024

realquantumcookie commented Sep 9, 2024

[Question] "Soft" Constraining #2

[Question] "Soft" Constraining #2

Comments

DefinitlyEvil commented Aug 23, 2024

realquantumcookie commented Aug 25, 2024

DefinitlyEvil commented Aug 26, 2024

DefinitlyEvil commented Sep 5, 2024 • edited Loading

realquantumcookie commented Sep 6, 2024 • edited Loading

DefinitlyEvil commented Sep 7, 2024

realquantumcookie commented Sep 9, 2024

DefinitlyEvil commented Sep 5, 2024 •

edited

Loading

realquantumcookie commented Sep 6, 2024 •

edited

Loading