[patch] Profiling bits
Jules Bergmann
jules at codesourcery.com
Wed Nov 15 02:42:34 UTC 2006
This patch:
- adds row/col to Fftm event names to indicate which type of Fft is
being done.
- adds info to the name returned by the Transpose_tag evaluator
name() to distinguish between copy and transpose.
- adds a '-sum' option to fmt-profile to sum the times for events
nested under events with 0 operations (and compute the mflop/s).
- adds a '-extra <event>' option to fmt-profile to add synthetic
events with the extra time not accounted for in nested events.
The rationale for requiring the event name to be specified is that
many nested library events will have unaccounted for time (i.e.
Ffts with scaling).
An example profile output with '-sum' and '-extra "Kernel1 total"'
options is attached.
Don, is this OK to commit?
-- Jules
--
Jules Bergmann
CodeSourcery
jules at codesourcery.com
(650) 331-3385 x705
-------------- next part --------------
An embedded and charset-unspecified text was scrubbed...
Name: prof.diff
URL: <http://sourcerytools.com/pipermail/vsipl++/attachments/20061114/5980dad2/attachment.ksh>
-------------- next part --------------
An embedded and charset-unspecified text was scrubbed...
Name: profile.txt
URL: <http://sourcerytools.com/pipermail/vsipl++/attachments/20061114/5980dad2/attachment.txt>
More information about the vsipl++
mailing list