[patch] MC Performance optimizations
Jules Bergmann
jules at codesourcery.com
Fri Jul 21 23:49:00 UTC 2006
This patch has a performance optimziation for parallel assignment:
- It reorganizes the parallel assignment algorithms to be
specializations of a common class (using implementation tags).
This makes it easier to plug in alternative algorithms
(in fact, this was back ported from the PAS implementation).
- It provides a new parallel assignment algorithm for
use with block-distributed and whole-distributed vectors.
It turns out that creating derived data types is very
expensive in MPI/Pro. The new block-vector assignment
avoids creating these when possible, raising message
bandwidth in extreme cases from ~5 MB/s to ~210 MB/s.
- It has a new benchmark (dist_vmul) and extends existing
benchmarks (primarily copy) to help better characterize parallel
assignment performance.
It also has several bug fixes related to the multi-dim to 1-dim
expression redimension layer, and one bug fix in a "synthetic"
SAL wrapper.
-- Jules
--
Jules Bergmann
CodeSourcery
jules at codesourcery.com
(650) 331-3385 x705
-------------- next part --------------
An embedded and charset-unspecified text was scrubbed...
Name: mc-release.diff
URL: <http://sourcerytools.com/pipermail/vsipl++/attachments/20060721/578081f3/attachment.ksh>
More information about the vsipl++
mailing list