Siddhesh Poyarekar 30a81dae5b aarch64: Optimized memcmp for medium to large sizes
This improved memcmp provides a fast path for compares up to 16 bytes
and then compares 16 bytes at a time, thus optimizing loads from both
sources.  The glibc memcmp microbenchmark retains performance (with an
error of ~1ns) for smaller compare sizes and reduces up to 31% of
execution time for compares up to 4K on the APM Mustang.  On Qualcomm
Falkor this improves to almost 48%, i.e. it is almost 2x improvement
for sizes of 2K and above.

	* sysdeps/aarch64/memcmp.S: Widen comparison to 16 bytes at a
	time.
2018-03-06 19:22:40 +05:30
..
2018-01-27 13:31:38 +01:00
2018-02-17 13:46:50 -05:00
2018-02-15 12:41:05 +00:00
2018-02-15 12:41:05 +00:00
2018-02-27 01:15:39 +01:00
2018-03-05 21:46:55 +00:00
2018-02-14 14:03:40 -02:00
2018-02-22 16:44:58 -03:00
2018-02-15 12:41:05 +00:00