float16 support for AMD64 target #23536
Unanswered
georgen117
asked this question in
Other Q&A
Replies: 0 comments
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
I am working on adding MLAS_TARGET_AMD64 support to the fp16_common.h header.
This currently has implemented a lot of inline functions of NEON intrinsic instructions.
I have implemented the functions using intrinsic instructions from immintrin.h.
What is the preference for adding the code.
Does the community prefer
Option 1 implement all the AMD64 versions of the function then all the ARM NEON version of the functions
Or is this still prefered?
Option2 interleave the AMD64 and ARM NEON code
I hope it is clear what I am asking.
For the float32 version of similar code found in mlasi.h option 2 was used.
However the code I have been implementing currently uses Option 1 I wanted to know if there was a preference so I could switch it it needed.
Beta Was this translation helpful? Give feedback.
All reactions