[vsipl++] SIMD threshold with loop fusion

Assem Salama assem at codesourcery.com
Tue May 15 21:23:13 UTC 2007


Jules Bergmann wrote:
>
> [1] This looks good.  However, do you think faux-SIMD should have the
> same "API" as the real SIMD functions below?
>
> For example, AltiVec vgt returns 0xFFFFFFFF or 0x00000000 for each
> position.  That can be used as a mask.  (What does SSE do?)
SSE is the same thing because there is a website that has a 
cross-reference for altivec and sse instructions.
>
> Since faux SIMD returns 1 or 0, it can't be used as mask.  A generic
> routine that uses vgt may not work with faux-simd if it expects
> vgt/vlt to return a value valid for a mask.
Why not? I use normal bit operations on the return values. If I and '1' 
with another value, I get the value, right?
>
> [2] Isn't 'Alg_threshold' already checked in?  I'm confused.
I did check in simd.hpp. I will look and see why this is still a change...




More information about the vsipl++ mailing list