Bus error (core dumped) in deepmd multitask training #180
Unanswered
ChiahsinChu
asked this question in
Q&A
Replies: 1 comment
-
You may need to find out which node the job was running on, then ssh to that node and run |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
Summary
When performing multi-task training with deepmd v2.2.11 in Zeus HPC, bus error (core dumped) is thrown during neighbour stat (i.e., before training starts).
Software
DeepMD-kit v2.2.11 + Tensorflow v2.14.0
(installed from the official off-line package)
Details
Run command:
Error:
Input and output files are attached:
tf-v2-train.tar.gz
Beta Was this translation helpful? Give feedback.
All reactions