make TransformerLayer accept a bshd
or sbhd
tensor format
#2315
Job | Run time |
---|---|
1m 9s | |
1m 9s |
bshd
or sbhd
tensor format
#2315
Job | Run time |
---|---|
1m 9s | |
1m 9s |