[vsipl++] [patch] Forcing a copy for run-time external data access.

Jules Bergmann jules at codesourcery.com
Mon May 8 22:19:05 UTC 2006


Stefan Seefeld wrote:
> The attached patch rewrites the 1D workspaces used to prepare
> data to be 'sent' to the FFT backends. It now uses Jules' new
> rt_extdata harness, to take advantage of the backend's handling
> of split/interleaved, as well as non-unit strides, if possible.
> Now the number of copies of the data blocks should be minimal.
> 
> As I'm still in the process of debugging 2D and M cases,
> I send in this partial patch, in the hope that it is useful
> such as for benchmarking, to make sure the performance is
> at least on par with what it used to be before the redesign.
> 
> Hopefully I'm able to send out more patches later tonight...

Stefan,

This looks good.

I need to check how Rt_extdata handles requests for 1D 
stride_unit_dense.  In general it recongnizes that stride_unit_dense is 
a stricter requirement than stride_unit (i.e. anything that is 
stride_unit_dense is also stride_unit, but not visa-versa).  It should 
make an exception for 1D since there are no higher dimensions.

Alternatively, for 1D data you could just request stride_unit, since 
that is the minimal requirement.

				-- Jules



-- 
Jules Bergmann
CodeSourcery
jules at codesourcery.com
(650) 331-3385 x705



More information about the vsipl++ mailing list