patch: Enhancements to SIMD loop fusion
Stefan Seefeld
stefan at codesourcery.com
Thu Aug 17 10:26:03 UTC 2006
The attached patch adds some optimizations as well as more functionality
(support for complex types, as well as fused multiply-add) to the
SIMD loop fusion harness.
As SSE(2) doesn't provide fused multiply-add, the fma() implementation
falls back on mul() and add(). For AltiVec fma() still needs to be implemented.
No regressions were observed with gcc 4.1.
OK to commit ?
Thanks,
Stefan
--
Stefan Seefeld
CodeSourcery
stefan at codesourcery.com
(650) 331-3385 x718
-------------- next part --------------
A non-text attachment was scrubbed...
Name: simd.patch
Type: text/x-patch
Size: 17179 bytes
Desc: not available
URL: <http://sourcerytools.com/pipermail/vsipl++/attachments/20060817/ce0f2b97/attachment.bin>
More information about the vsipl++
mailing list