cpluspluscourse icon indicating copy to clipboard operation
cpluspluscourse copied to clipboard

Expand section on profilers (perf and VTune)

Open amadio opened this issue 3 years ago • 6 comments

I've focused more on perf than VTune, but this is intended to close #43. I think the online documentation for VTune is good enough that we can just point students there. However, if you think the VTune section should be expanded further, let me know.

amadio avatar Nov 01 '22 09:11 amadio

I wonder whether the presented content is too detailed. But I would let other people comment on this.

You can always skip what you don't need, but the content is useful for people just looking at the slides as a reference. That said, @hageboeck had the same concern.

amadio avatar Nov 01 '22 16:11 amadio

Are any changes needed? From my side this should be ready for merging.

amadio avatar Nov 07 '22 09:11 amadio

I would even like to have more pictures. E.g. I love the microarchitecture analysis with the pipeline visualization. Or how a general hierarchical profile looks like. Or the pane showing contention between threads. Or even better, a live demonstration :)

I could not reply directly to this, so adding as quote above.

Although I would like to, I unfortunately don't have so much more time to invest in improving the slides. I really need to go back to work on Geant4 and XRootD now. In any case, I think the online documentation of VTune is really good already. perf is harder to use just by looking at the docs, therefore my added examples, which are meant to be copy/pasted into the terminal to try out perf even without deep knowledge about it.

amadio avatar Nov 07 '22 13:11 amadio

I would like @sponce and @hageboeck to comment on the complexity of the presented material. For my part, I am fine enough to merge. If I had to present this material, I would probably skip a third of the commands because my knowledge about them is insufficient.

bernhardmgruber avatar Nov 07 '22 15:11 bernhardmgruber

I'm in general not at ease with this one. On one hand it's already far too complex, on the other hand a lot of explanations are missing on concepts used without presenting them. I can see 2 ways out : adding more, but then splitting into a standard part and an expert one. Or simplifying, keeping really only the core, as we did for gdb, in 4 slides total (first 2 with second one split and one example slide.

sponce avatar Nov 08 '22 08:11 sponce

Ok, I think it's better to go with the second route of simplifying things a bit and providing examples only for the more basic usage of perf, and breaking the first slide into two. I will update this pull request in the next few days when I find the time for it.

amadio avatar Nov 08 '22 14:11 amadio