[vsipl++] [patch] SSAR opts
Mark Mitchell
mark at codesourcery.com
Wed Sep 17 21:17:36 UTC 2008
Jules Bergmann wrote:
> If RM is a row-major matrix, and CM is a col-major matrix,
> the following cannot be dispatched easily to the SPUs:
>
> RM_Z = CM_A * RM_B;
>
> Instead, its better to transpose first, the multiply:
>
> RM_A = CM_A; // transpose
> RM_Z = RM_A * RM_B;
Excellent!
Is this something that we think SV++ should be so brave as do itself in
the future? I guess not: it would require allocating memory, and we
probably don't want to do that without the user's permission. But, I
guess we could have a diagnostic mode where we said "warning: perhaps
you should do a transpose and then multiply"?
Thanks,
--
Mark Mitchell
CodeSourcery
mark at codesourcery.com
(650) 331-3385 x713
More information about the vsipl++
mailing list