[vsipl++] [patch] New benchmark - vector division

Jules Bergmann jules at codesourcery.com
Mon May 1 12:10:10 UTC 2006


Don McCoy wrote:
> Here is a new benchmark for testing element-wise vector division.  Also 
> attached are two performance graphs comparing multiplication and 
> division - one shows mega flops per second and the other latency, or the 
> number of microseconds per operation.

Don, This looks good, please check in in. -- Jules
> 
> The graph showing flops per second is somewhat misleading for two 
> reasons: both divide and multiply for real numbers are each counted as a 
> "flop" even though they take a different number of clock cycles to 
> perform.

It does answer the question of whether a division FLOP is really the 
same as a multiply FLOP.  Depending on problem size, it looks like 1 div 
FLOP ~ 8 mul FLOPS.

>  Second, complex-complex division takes more operations (11, 
> two of which are real-real divisions) than complex-complex 
> multiplication (6).  This gives them a comparable flop count, even 
> though the division takes roughly twice as long.

Comparing complex-multiply MFLOPS vs complex-division MFLOPS is somewhat 
of an apples to oranges comparison.  The latency numbers, or 
alternatively measuring points per second, are a good way to look at it.

What machine/configuration are the results from?

				-- Jules


-- 
Jules Bergmann
CodeSourcery
jules at codesourcery.com
(650) 331-3385 x705



More information about the vsipl++ mailing list