[pooma-dev] KCC versus icc

Richard Guenther rguenth at tat.physik.uni-tuebingen.de
Wed Feb 26 19:27:12 UTC 2003


Hi!

I remember problems with the inliner, i.e. it refused to inline
some of the expression template machinery. You might want to search
for an option letting you tune the inlining behavior or try profile
directed optimizations. With standard -O3 icc is not always faster
than gcc3.2.2 with -O3.

Richard.

On Tue, 25 Feb 2003, Paul A. Renard wrote:

> Hope I'm asking the correct crowd...
>
> Given the following:
>
>  const int N=128;
>  Array<2,complex<float> > u(N,N);
>  Iota<2>::Iota_t ij(iota(u.domain());
>  Iota<2>::Index_t I(ij.comp(0));
>  Iota<2>::Index_t J(ij.comp(1));
>  Array<1,complex<float> > cx(N), cy(N);
>
>  // Values for u, cx, cy are filled elsewhere.
>
>  // Then the following is called:
>  void compute(){
>   u *= cx(I)*cy(J);    // runs 4X slower with icc than KCC
>  }
>
>  When I time this routine, I find that it runs about 4X slower when compiled with
> Intel's icc (Version 7, -O3 -DNOPAssert -DNOCTASSERT) than with KCC (version 4.0f,
> +K3 -DNOPAssert, -DNOCTAssert).  As expected, the KCC version runs as fast as
> hand-written loops.
>
> Do others observe this same sluggish behavior with icc?  Am I missing some obvious
> compile flag? 
>
> thanks
> Paul
>




More information about the pooma-dev mailing list