[patch] MC Performance optimizations

Jules Bergmann jules at codesourcery.com
Fri Jul 21 23:49:00 UTC 2006


This patch has a performance optimziation for parallel assignment:

  - It reorganizes the parallel assignment algorithms to be
    specializations of a common class (using implementation tags).
    This makes it easier to plug in alternative algorithms
    (in fact, this was back ported from the PAS implementation).

  - It provides a new parallel assignment algorithm for
    use with block-distributed and whole-distributed vectors.
    It turns out that creating derived data types is very
    expensive in MPI/Pro.  The new block-vector assignment
    avoids creating these when possible, raising message
    bandwidth in extreme cases from ~5 MB/s to ~210 MB/s.

  - It has a new benchmark (dist_vmul) and extends existing
    benchmarks (primarily copy) to help better characterize parallel
    assignment performance.

It also has several bug fixes related to the multi-dim to 1-dim
expression redimension layer, and one bug fix in a "synthetic"
SAL wrapper.

				-- Jules


-- 
Jules Bergmann
CodeSourcery
jules at codesourcery.com
(650) 331-3385 x705
-------------- next part --------------
An embedded and charset-unspecified text was scrubbed...
Name: mc-release.diff
URL: <http://sourcerytools.com/pipermail/vsipl++/attachments/20060721/578081f3/attachment.ksh>


More information about the vsipl++ mailing list