Ulrich Drepper
370a7d88f7
WP fixes
2011-12-17 14:41:05 -05:00
Ulrich Drepper
1d3e4b618a
Optimized wcschr and wcscpy for x86-64 and x86-32
2011-12-17 14:39:23 -05:00
Ulrich Drepper
aff2453df7
Fix more warnings
2011-12-03 21:49:35 -05:00
Ulrich Drepper
34372fc6d3
Fix test of non-ASCII locales in x86-64 strcasecmp et.al.
2011-11-01 16:46:23 -04:00
Ulrich Drepper
52e4b9eb62
More cleanups of x86-64 strstr
2011-10-28 19:01:48 -04:00
Ulrich Drepper
fd52bc6dc4
Clean up x86-64 strcasestr
...
Actually describe in the C code what is going on.
2011-10-28 18:18:04 -04:00
Ulrich Drepper
a5b81e1fb7
Remove code without too much effects
...
Some of the AVX-specific code is not giving enough speed-up to
justify the extra code.
2011-10-28 16:55:01 -04:00
Ulrich Drepper
e0016b11d6
Add AVX optimized versions for some x86-64 math functions
2011-10-25 21:34:55 -04:00
Ulrich Drepper
618280a192
Optimize x86-64 SSE4.2+ strcmp a bit more
2011-10-25 14:50:31 -04:00
Ulrich Drepper
31ea014d8b
Use VEX encoding in inline math functions on x86-64 when possible
2011-10-25 08:17:57 -04:00
Ulrich Drepper
31d3cc00b0
Cleanup FMA4 patch
...
Move the FMA4 code into its own section. Avoid some of the duplication
of data resulting from the double use of source files.
2011-10-25 00:56:33 -04:00
Ulrich Drepper
202c9deb15
Better DLA_FMS
...
It's better to use __builtin_fma if it works. Use it for gcc 4.6 and
higher. Move the x86-64 dla.h to the correct place.
2011-10-24 22:11:21 -04:00
Ulrich Drepper
a0cf1edd4c
Use inline asm for DLA_FMS because of broken old compilers
2011-10-24 21:17:10 -04:00
Ulrich Drepper
af968f62f2
Optimize accurate 64-bit routines for FMA4 on x86-64
2011-10-24 20:19:17 -04:00
Ulrich Drepper
09229f3e1b
Fix WS
2011-10-23 14:57:28 -04:00
Liubov Dmitrieva
ce7dd29f28
Optimized strnlen and wcscmp for x86-64
2011-10-23 14:56:04 -04:00
Ulrich Drepper
f17424ed53
Fix WS
2011-10-23 13:35:24 -04:00
Liubov Dmitrieva
95584d3b33
Fix signedness in wcscmp comparison
2011-10-23 13:34:15 -04:00
Ulrich Drepper
774a2669af
Clean up FMA use
...
The macro's name should reflect that subtraction is being done. And
use __builtin_fma, it seems to work after all.
2011-10-23 13:31:01 -04:00
Ulrich Drepper
c8b3296bbe
Clean up last dla.h change
2011-10-23 12:50:28 -04:00
Ulrich Drepper
fb24de5932
Fix typo in last change
2011-10-22 20:09:58 -04:00
Ulrich Drepper
0d355eb7c7
Update ULPs for x86-64
2011-10-22 20:06:23 -04:00
Ulrich Drepper
c196fed8f0
Fix compilation problems in x86-64 init-arch
2011-10-21 20:47:20 -04:00
Ulrich Drepper
1a97a8c78f
Don't use NULL in last s_fma{,f} change
2011-10-21 07:39:28 -04:00
Ulrich Drepper
ed72b6545f
Check for FMA4 support and generate appropriate fma functions
2011-10-20 22:43:15 -04:00
Ulrich Drepper
8d4f46c613
Move fma routines to right place
2011-10-20 21:55:41 -04:00
Ulrich Drepper
855d156018
Optimize x86-64 rawmemchr and add test
2011-10-19 22:22:29 -04:00
Ulrich Drepper
d9a4d2ab27
Add optimized str{,n}casecmp for AVX on x86-64
2011-10-19 12:42:38 -04:00
Andreas Schwab
8f3b1ffefa
Fix PLT use for feraiseexcept on x86_64
2011-10-19 13:03:31 +02:00
Ulrich Drepper
d9a8d0abcc
Use new internal libc_fe* interfaces in more functions
2011-10-18 15:11:31 -04:00
Ulrich Drepper
4855e3ddf5
Provide combined internal feholdexcept/fesetround interface
2011-10-18 09:59:04 -04:00
Ulrich Drepper
23ce562780
Pretty print last change to x86-64 mathinline.h
2011-10-18 09:38:47 -04:00
Ulrich Drepper
581d30e386
Add optimized nearbyint{,f} for x86-64
2011-10-18 09:13:23 -04:00
Ulrich Drepper
d38f1dba00
Start optimizing the use of the fenv interfaces in libm itself
2011-10-18 09:00:46 -04:00
Andreas Schwab
83c7615c2d
Fix last change
2011-10-18 14:11:29 +02:00
Andreas Schwab
caa6c9d845
Fix linkage conflict with feraiseexcept
2011-10-18 11:46:51 +02:00
Ulrich Drepper
228a984d54
Relax asm requirements for recently added x86-64 math interfaces
2011-10-17 20:30:52 -04:00
Ulrich Drepper
c8553a6a6f
Makr x86-64 math_private.h more robust
2011-10-17 16:00:39 -04:00
Ulrich Drepper
ed22dcf691
Provide internal optimizations on x86-64 with SSE4.1
...
Provide macros so that the internal users can, if possible, directly use
the new instructions.
Also fix up the mathinline.h header when compiling with SSE4.1 enabled.
2011-10-17 11:23:40 -04:00
Ulrich Drepper
b171c13768
Fix last x86-64 mathinline change
...
Use correct function names.
2011-10-17 10:37:00 -04:00
Ulrich Drepper
ad0f5cad15
Use rounds{s,d} for x86 rint, ceil, floor
2011-10-16 20:58:17 -04:00
Ulrich Drepper
2d1f3a4db6
Fix WS
2011-10-15 11:11:12 -04:00
Liubov Dmitrieva
be13f7bff6
Optimized memcmp and wmemcmp for x86-64 and x86-32
2011-10-15 11:10:08 -04:00
Andreas Schwab
6b1f68c91f
Fix lost feraiseexcept symbol
2011-10-14 11:21:23 +02:00
Andreas Schwab
714fad23c6
Fix PLT use in feupdateenv on x86_64
2011-10-13 15:26:45 +02:00
Andreas Schwab
81dcc7fb74
Check for zero size in memrchr for x86_64
2011-10-13 13:34:41 +02:00
Ulrich Drepper
0ac5ae2335
Optimize libm
...
libm is now somewhat integrated with gcc's -ffinite-math-only option
and lots of the wrapper functions have been optimized.
2011-10-12 11:27:51 -04:00
Ulrich Drepper
7edb55ce06
Optimize use of isnan, isinf, finite
2011-10-08 10:18:26 -04:00
Ulrich Drepper
66fb11b1da
Fix whitespace
2011-10-07 11:50:21 -04:00
Liubov Dmitrieva
093ecf9299
Improve 64 bit memchr, memrchr, rawmemchr with SSE2
2011-10-07 11:49:10 -04:00