[patch] SIMD load_unaligned, scalar load optimization
Jules Bergmann
jules at codesourcery.com
Thu May 17 13:23:01 UTC 2007
This patch contains two unrelated SIMD items:
- Implements a SIMD load_unaligned function for unaligned loads, and
adds unit test.
- Optimizes SIMD loop fusion handling of scalar values to load the value
into a SIMD register once, rather than each time it is accessed.
On a PPC 970FX, this improves floating-point scalar * vector performance
at 2048 points from 241 MFLOP/s to 1942 MFLOP/s.
Patch applied.
-- Jules
--
Jules Bergmann
CodeSourcery
jules at codesourcery.com
(650) 331-3385 x705
-------------- next part --------------
An embedded and charset-unspecified text was scrubbed...
Name: simd.diff
URL: <http://sourcerytools.com/pipermail/vsipl++/attachments/20070517/02d373b5/attachment.ksh>
More information about the vsipl++
mailing list