[vsipl++] [patch] CFAR benchmark storage order
Don McCoy
don at codesourcery.com
Wed Sep 13 07:54:24 UTC 2006
Please disregard the previous version(s) of this patch. The attached
version has been checked more thoroughly than before. This time I ran
all the sets with varying storage orders for the CFAR data cube, then I
compared results at the points specified by the HPEC Challenge (in terms
of the number of range gates, RG).
This retesting resulted in a change for the "by-vector" algorithm for
about a 5% performance improvement. See the table below, produced from
data taken from the Xeon cluster at GTRI.
Slice
RG 2-0-1 0-2-1
Set 1 64 293 210
Set 2 3500 186 147
Set 3 1909 187 145
Set 4 9900 202 150
Vector Hybrid
RG 0-1-2 1-0-2 0-1-2 1-0-2
Set 1 64 96 97 384 415
Set 2 3500 125 133 693 666
Set 3 1909 123 130 692 650
Set 4 9900 124 132 697 670
Regards,
--
Don McCoy
don (at) CodeSourcery
(888) 776-0262 / (650) 331-3385, x712
-------------- next part --------------
An embedded and charset-unspecified text was scrubbed...
Name: cb3.changes
URL: <http://sourcerytools.com/pipermail/vsipl++/attachments/20060913/0b9e354c/attachment.ksh>
-------------- next part --------------
An embedded and charset-unspecified text was scrubbed...
Name: cb3.diff
URL: <http://sourcerytools.com/pipermail/vsipl++/attachments/20060913/0b9e354c/attachment-0001.ksh>
More information about the vsipl++
mailing list