[vsipl++] [patch] SSAR opts

Mark Mitchell mark at codesourcery.com
Wed Sep 17 21:17:36 UTC 2008


Jules Bergmann wrote:

>     If RM is a row-major matrix, and CM is a col-major matrix,
>     the following cannot be dispatched easily to the SPUs:
> 
>         RM_Z = CM_A * RM_B;
> 
>     Instead, its better to transpose first, the multiply:
> 
>         RM_A = CM_A;    // transpose
>         RM_Z = RM_A * RM_B;

Excellent!

Is this something that we think SV++ should be so brave as do itself in
the future?  I guess not: it would require allocating memory, and we
probably don't want to do that without the user's permission.  But, I
guess we could have a diagnostic mode where we said "warning: perhaps
you should do a transpose and then multiply"?

Thanks,

-- 
Mark Mitchell
CodeSourcery
mark at codesourcery.com
(650) 331-3385 x713



More information about the vsipl++ mailing list