[patch] SIMD load_unaligned, scalar load optimization

Jules Bergmann jules at codesourcery.com
Thu May 17 13:23:01 UTC 2007


This patch contains two unrelated SIMD items:

- Implements a SIMD load_unaligned function for unaligned loads, and 
adds unit test.

- Optimizes SIMD loop fusion handling of scalar values to load the value 
into a SIMD register once, rather than each time it is accessed.

On a PPC 970FX, this improves floating-point scalar * vector performance 
at 2048 points from 241 MFLOP/s  to 1942 MFLOP/s.

Patch applied.

				-- Jules

-- 
Jules Bergmann
CodeSourcery
jules at codesourcery.com
(650) 331-3385 x705
-------------- next part --------------
An embedded and charset-unspecified text was scrubbed...
Name: simd.diff
URL: <http://sourcerytools.com/pipermail/vsipl++/attachments/20070517/02d373b5/attachment.ksh>


More information about the vsipl++ mailing list