Liubov Dmitrieva
4b43400f6a
optimize the following memcpy: sysdeps/i386/i686/multiarch/memcpy-ssse3.S
...
I've improved the following implementation of memcpy:
"sysdeps/i386/i686/multiarch/memcpy-ssse3.S".
The patch includes some minor style fixes, but the important part is
just using prefetch loops for the case:
DATA_CACHE_SIZE_HALF <= len < SHARED_CACHE_SIZE_HALF and
src and dst pointers have unequal 16 byte alignments.
This gives from 6% - 50% performance boost on the atom machine, about
24,73% in geometric mean.
2012-03-30 16:45:27 -04:00
..
2010-01-12 11:22:03 -08:00
2010-01-12 11:22:03 -08:00
2012-02-09 23:18:22 +00:00
2010-01-12 11:22:03 -08:00
2010-01-12 11:22:03 -08:00
2012-02-09 23:18:22 +00:00
2010-01-12 11:22:03 -08:00
2009-08-28 14:54:46 -07:00
2011-11-13 09:50:13 -05:00
2011-12-17 14:39:23 -05:00
2012-02-09 23:18:22 +00:00
2012-02-09 23:18:22 +00:00
2012-02-09 23:18:22 +00:00
2012-02-09 23:18:22 +00:00
2012-02-09 23:18:22 +00:00
2012-02-09 23:18:22 +00:00
2012-02-09 23:18:22 +00:00
2012-02-09 23:18:22 +00:00
2012-03-30 16:45:27 -04:00
2012-02-09 23:18:22 +00:00
2012-02-09 23:18:22 +00:00
2010-01-12 11:22:03 -08:00
2010-01-12 11:22:03 -08:00
2012-02-09 23:18:22 +00:00
2012-02-09 23:18:22 +00:00
2010-01-12 11:22:03 -08:00
2010-01-12 11:22:03 -08:00
2012-02-09 23:18:22 +00:00
2011-10-12 11:42:04 -04:00
2012-02-09 23:18:22 +00:00
2012-02-09 23:18:22 +00:00
2012-02-09 23:18:22 +00:00
2012-02-09 23:18:22 +00:00
2012-02-09 23:18:22 +00:00
2012-02-09 23:18:22 +00:00
2012-02-09 23:18:22 +00:00
2011-10-12 11:42:04 -04:00
2011-10-12 11:42:04 -04:00
2012-02-09 23:18:22 +00:00
2011-10-23 16:30:40 -04:00
2012-02-09 23:18:22 +00:00
2012-02-09 23:18:22 +00:00
2012-02-09 23:18:22 +00:00
2012-02-09 23:18:22 +00:00
2011-06-24 14:15:32 -04:00
2011-06-24 14:15:32 -04:00
2011-06-24 14:15:32 -04:00
2011-06-24 14:15:32 -04:00
2011-06-24 14:15:32 -04:00
2011-06-24 14:15:32 -04:00
2012-02-29 22:37:38 +00:00
2011-11-14 18:24:35 -05:00
2011-11-13 09:50:13 -05:00
2011-11-13 09:50:13 -05:00
2011-11-13 09:50:13 -05:00
2012-02-09 23:18:22 +00:00
2010-07-21 08:44:35 -07:00
2012-02-09 23:18:22 +00:00
2012-02-09 23:18:22 +00:00
2012-02-09 23:18:22 +00:00
2012-02-09 23:18:22 +00:00
2012-02-09 23:18:22 +00:00
2012-02-09 23:18:22 +00:00
2012-02-09 23:18:22 +00:00
2012-02-09 23:18:22 +00:00
2012-02-29 22:37:38 +00:00
2012-02-09 23:18:22 +00:00
2012-02-09 23:18:22 +00:00
2012-02-09 23:18:22 +00:00
2012-02-09 23:18:22 +00:00
2012-02-09 23:18:22 +00:00
2012-02-09 23:18:22 +00:00
2012-02-09 23:18:22 +00:00
2012-02-29 22:37:38 +00:00
2011-11-14 18:24:35 -05:00
2011-11-13 09:50:13 -05:00
2011-11-13 09:50:13 -05:00
2011-11-13 09:50:13 -05:00
2012-02-09 23:18:22 +00:00
2011-08-04 15:33:38 -04:00
2011-08-04 15:33:38 -04:00
2011-08-04 15:33:38 -04:00
2011-08-04 15:33:38 -04:00
2010-02-15 11:17:50 -08:00
2010-02-15 11:17:50 -08:00
2010-02-15 11:17:50 -08:00
2010-02-15 11:17:50 -08:00
2011-06-24 14:15:32 -04:00
2011-06-24 14:15:32 -04:00
2011-06-24 14:15:32 -04:00
2011-06-24 14:15:32 -04:00
2011-10-28 12:02:08 +02:00
2011-10-23 15:17:23 -04:00
2012-02-09 23:18:22 +00:00
2012-02-09 23:18:22 +00:00
2012-02-09 23:18:22 +00:00
2012-02-09 23:18:22 +00:00
2012-02-09 23:18:22 +00:00
2011-07-14 20:47:54 -07:00
2010-08-27 12:10:11 -07:00
2010-08-24 11:35:01 -07:00
2010-04-14 22:27:59 -07:00
2011-12-17 14:39:23 -05:00
2012-02-09 23:18:22 +00:00
2012-02-09 23:18:22 +00:00
2011-11-12 00:50:26 -05:00
2012-02-09 23:18:22 +00:00
2012-02-09 23:18:22 +00:00
2011-12-17 14:39:23 -05:00
2012-02-09 23:18:22 +00:00
2012-02-09 23:18:22 +00:00
2011-11-12 00:50:26 -05:00
2012-02-09 23:18:22 +00:00
2012-02-09 23:18:22 +00:00
2011-12-17 14:39:23 -05:00
2012-02-09 23:18:22 +00:00
2012-02-09 23:18:22 +00:00
2011-11-12 00:50:26 -05:00
2011-10-15 11:10:08 -04:00
2011-10-15 11:10:08 -04:00
2012-02-09 23:18:22 +00:00