make TransformerLayer accept a bshd
or sbhd
tensor format
#2894
Job | Run time |
---|---|
0s | |
17m 24s | |
17m 24s |
bshd
or sbhd
tensor format
#2894
Job | Run time |
---|---|
0s | |
17m 24s | |
17m 24s |