Ling Ma
5c74e47cd6
Add x86_64 memset optimized for AVX2
...
In this patch we take advantage of HSW memory bandwidth, manage to
reduce miss branch prediction by avoiding using branch instructions and
force destination to be aligned with avx & avx2 instruction.
The CPU2006 403.gcc benchmark indicates this patch improves performance
from 26% to 59%.
* sysdeps/x86_64/multiarch/Makefile: Add memset-avx2.
* sysdeps/x86_64/multiarch/memset-avx2.S: New file.
* sysdeps/x86_64/multiarch/memset.S: Likewise.
* sysdeps/x86_64/multiarch/memset_chk.S: Likewise.
* sysdeps/x86_64/multiarch/rtld-memset.S: Likewise.
2014-06-19 15:14:08 -07:00
..
2014-06-11 21:33:32 -07:00
2014-01-01 22:00:23 +10:00
2014-06-18 12:32:01 +00:00
2014-06-19 15:14:08 -07:00
2014-06-12 10:08:24 -07:00
2014-06-11 21:33:32 -07:00
2014-01-01 22:00:23 +10:00
2014-01-01 22:00:23 +10:00
2014-01-01 22:00:23 +10:00
2014-01-01 22:00:23 +10:00
2014-01-01 22:00:23 +10:00
2014-01-01 22:00:23 +10:00
2014-01-01 22:00:23 +10:00
2014-01-01 22:00:23 +10:00
2014-04-09 15:38:09 -07:00
2014-04-09 15:38:09 -07:00
2014-01-01 22:00:23 +10:00
2014-01-01 22:00:23 +10:00
2014-02-10 15:07:12 +01:00
2014-01-01 22:00:23 +10:00
2014-02-10 15:07:12 +01:00
2014-01-01 22:00:23 +10:00
2014-01-01 22:00:23 +10:00
2014-01-01 22:00:23 +10:00
2014-04-16 14:46:49 -07:00
2014-04-16 14:46:49 -07:00
2014-01-01 22:00:23 +10:00
2014-01-01 22:00:23 +10:00
2014-01-01 22:00:23 +10:00
2014-01-01 22:00:23 +10:00
2012-05-30 14:13:18 -07:00
2014-01-01 22:00:23 +10:00
2014-01-01 22:00:23 +10:00
2011-08-11 14:07:04 -04:00
2014-01-01 22:00:23 +10:00
2014-05-30 10:50:21 -07:00
2010-07-30 00:14:04 -07:00
2014-01-01 22:00:23 +10:00
2014-01-01 22:00:23 +10:00
2014-03-13 11:19:08 -07:00
2014-01-01 22:00:23 +10:00
2014-01-01 22:00:23 +10:00
2014-01-01 22:00:23 +10:00
2014-01-01 22:00:23 +10:00
2014-01-01 22:00:23 +10:00
2014-01-01 22:00:23 +10:00
2013-02-17 21:57:26 +00:00
2014-01-01 22:00:23 +10:00
2014-01-01 22:00:23 +10:00
2014-01-01 22:00:23 +10:00
2014-01-01 22:00:23 +10:00
2014-01-01 22:00:23 +10:00
2013-10-30 17:32:08 +10:00
2013-10-30 17:32:08 +10:00
2014-01-01 22:00:23 +10:00
2014-01-01 22:00:23 +10:00
2014-01-01 22:00:23 +10:00
2014-01-01 22:00:23 +10:00
2014-01-01 22:00:23 +10:00
2014-01-01 22:00:23 +10:00
2014-01-01 22:00:23 +10:00
2013-09-23 00:52:09 -04:00
2014-01-01 22:00:23 +10:00
2014-01-01 22:00:23 +10:00
2012-01-07 23:57:22 -05:00
2010-07-30 00:14:04 -07:00
2010-07-30 00:14:04 -07:00
2014-01-01 22:00:23 +10:00
2014-01-01 22:00:23 +10:00
2014-01-01 22:00:23 +10:00
2014-01-01 22:00:23 +10:00
2014-01-01 22:00:23 +10:00
2014-01-01 22:00:23 +10:00
2014-01-01 22:00:23 +10:00
2014-01-01 22:00:23 +10:00
2012-01-07 23:57:22 -05:00
2010-08-14 22:04:01 -07:00
2010-08-14 22:04:01 -07:00
2013-03-18 07:39:12 +01:00
2014-01-01 22:00:23 +10:00
2014-01-01 22:00:23 +10:00
2013-02-17 21:57:26 +00:00
2014-01-01 22:00:23 +10:00
2014-01-01 22:00:23 +10:00
2014-01-01 22:00:23 +10:00
2014-01-01 22:00:23 +10:00
2012-07-17 11:30:58 +00:00
2014-04-03 10:47:14 -07:00
2013-04-25 19:23:11 +00:00
2013-04-25 19:23:11 +00:00
2013-04-25 19:23:11 +00:00
2013-04-25 19:23:11 +00:00
2013-04-25 19:23:11 +00:00
2014-03-13 11:19:08 -07:00
2014-01-01 22:00:23 +10:00
2013-04-25 19:23:11 +00:00
2013-04-25 19:23:11 +00:00
2013-04-25 19:23:11 +00:00
2013-04-25 19:23:11 +00:00
2013-04-25 19:23:11 +00:00
2013-04-25 19:23:11 +00:00
2013-04-25 19:23:11 +00:00
2013-04-25 19:23:11 +00:00
2013-04-25 19:23:11 +00:00
2013-04-25 19:23:11 +00:00
2013-04-25 19:23:11 +00:00
2014-03-13 11:19:08 -07:00
2014-03-13 11:19:08 -07:00
2014-01-01 22:00:23 +10:00
2014-01-01 22:00:23 +10:00
2012-05-10 17:05:06 -07:00
2012-05-10 17:05:06 -07:00
2012-05-10 17:05:06 -07:00
2014-01-01 22:00:23 +10:00
2012-05-10 17:05:06 -07:00
2014-01-01 22:00:23 +10:00
2012-05-10 17:05:06 -07:00
2014-01-01 22:00:23 +10:00
2011-04-01 19:38:21 -04:00
2014-01-01 22:00:23 +10:00
2014-01-01 22:00:23 +10:00
2014-01-01 22:00:23 +10:00
2014-01-01 22:00:23 +10:00