[patch] Disable profiling in dot products

Jules Bergmann jules at codesourcery.com
Fri Jun 27 15:25:40 UTC 2008


Some of the profiling event name code for dot product was consuming 
considerable time even when VSIP_IMPL_PROFILER == 0 (see below for an 
example).  This patch disables that when VSIP_IMPL_PROFILER & mask is false.

Ok to apply?

				-- Jules


Pre change

% benchmarks/dot -1 -ops -samples 3
# what             : t_dot1 (1)
# nproc            : 1
# ops_per_point(1) : 2
# riob_per_point(1): 8
# wiob_per_point(1): 0
# metric           : ops_per_sec
# start_loop       : 133341
      16 16.903055 16.885244 16.949444
      32 33.609573 33.514175 33.751736
      64 66.244659 66.077934 66.556038
     128 131.906174 130.535217 133.274078
     256 256.051178 255.275986 258.389465
     512 488.516510 488.469940 490.960785
    1024 859.449341 852.046570 865.133423
    2048 1438.800781 1436.463745 1450.017334
    4096 1849.857788 1845.208984 1853.868042
    8192 2116.325684 2113.715820 2117.107910
   16384 2440.822754 2436.575684 2441.876709
   32768 2707.750977 2697.868652 2708.683594
   65536 2829.450439 2823.039062 2830.946045
  131072 2885.994141 2882.093018 2891.500000
  262144 2871.778809 2868.683350 2883.067627
  524288 1753.057373 1681.539062 1754.856323
1048576 830.176514 826.936584 856.340210
2097152 759.201416 740.848389 761.915283


Post change

% benchmarks/dot -1 -ops -samples 3
# what             : t_dot1 (1)
# nproc            : 1
# ops_per_point(1) : 2
# riob_per_point(1): 8
# wiob_per_point(1): 0
# metric           : ops_per_sec
# start_loop       : 3378559
      16 393.554230 384.410278 424.985718
      32 722.641785 695.435608 730.206116
      64 1371.760376 1341.322266 1387.337769
     128 2211.171875 2210.553955 2213.565918
     256 3231.258545 3229.922119 3234.606445
     512 4069.423584 4054.140625 4075.931396
    1024 4726.631836 4715.732422 4733.557129
    2048 5040.998535 5032.172852 5041.894043
    4096 4921.556641 4913.420410 4929.406738
    8192 2906.246582 2903.860596 2910.550537
   16384 2924.517334 2917.423340 2928.528076
   32768 2962.245605 2960.767822 2964.117432
   65536 2958.519531 2952.603271 2960.056641
  131072 2951.296143 2945.608154 2962.209961
  262144 2858.965332 2855.086182 2867.772949
  524288 1654.068970 1621.036987 1683.348999
1048576 781.400024 758.868774 808.620667
2097152 737.620056 735.490540 743.508240


-- 
Jules Bergmann
CodeSourcery
jules at codesourcery.com
(650) 331-3385 x705
-------------- next part --------------
An embedded and charset-unspecified text was scrubbed...
Name: prof.diff
URL: <http://sourcerytools.com/pipermail/vsipl++/attachments/20080627/805cd8b0/attachment.ksh>


More information about the vsipl++ mailing list