[pooma-dev] Good News. Intel's ICC 8.0 Beta looks promising, now.
Richard Guenther
rguenth at tat.physik.uni-tuebingen.de
Mon Jun 16 09:23:38 UTC 2003
Hi Paul / others!
As you have access to KAI CC, I'd be curious to know what numbers you
get for the BlitzLoops / ABCTest benchmarks. Do we really expect the
POOMA-II numbers match the C / C restrict numbers? I know neither
ICC, nor gcc is really there at the moment. For the BlitzLoops with
my patched gcc3.3 I get (-O2 -march=athlon -fomit-frame-pointer
-funroll-loops)
rguenth at phoenix15 BlitzLoops > ./LINUXgcc33/Loop18 --sim-params 1000 3 1
--no-diags --samples 10
C
N restrict C CppTran PoomaII
1000 637.74 599.74 441.03 416.92
10000 446.39 412.08 364.48 345.99
100000 94.34 83.25 82.50 81.25
1000000 91.77 74.49 74.39 76.55
which is good once we are memory bandwidth limited. With icc 8.0
(-O3 -xK -tpp6 -ip -restrict)
rguenth at phoenix15 BlitzLoops > ./LINUXICC/Loop18 --sim-params 1000 3 1
--no-diags --samples 10
C
N restrict C CppTran PoomaII
1000 857.11 909.79 248.03 187.58
10000 449.74 434.15 230.60 181.60
100000 82.61 83.24 82.04 88.50
1000000 79.18 90.93 80.07 88.65
Again, this looks good for large datasets.
Just for the curious, if I mark both brick engine and brickview engines
data_m member restrict we get the same numbers for Intel icpc, gcc also
doesnt improve with this benchmark.
Richard.
--
Richard Guenther <richard dot guenther at uni-tuebingen dot de>
WWW: http://www.tat.physik.uni-tuebingen.de/~rguenth/
More information about the pooma-dev
mailing list