Updated Support for AArch64 (markdown)

JayDDee
2023-10-30 00:37:19 -04:00
parent 8f7b769f09
commit 90c68f32a6

@@ -59,6 +59,24 @@ Known problems:
* SWIFFTX: Multiple issues with NEON,using unoptimized.
* Algos not mentioned have either been deferred or have not been analyzed. They may or may not work on ARM.
Short term plan:
Continue fixing parallel hash functions for x17 before propagating them to the rest of the X family.
Figure out what's going on with verthash.
Extend suport to x21s, x22i, x25x.
Add support for the short algos like skein2, keccak, blake2s.
Complete any other work needed to bring parity with SSE2.
Longer term:
Find NEON optimization opportunities that exploit it's architecture and instruction set.
Apply lessons learned to x86_64.
Even longer term:
ARM SVE
RISC-V
Some notable observation about the problems observed:
Verthash is a mystery, it only produces rejects on ARM even with no targtetted code, only compiled C. The same C source works on x86_64 but not on AArch64. Tried with -O3 & -O2. In all other cases falling back to C was always successful. Verthash data file creation and verification work.