Expand section on profilers (perf and VTune)
I've focused more on perf than VTune, but this is intended to close #43. I think the online documentation for VTune is good enough that we can just point students there. However, if you think the VTune section should be expanded further, let me know.
I wonder whether the presented content is too detailed. But I would let other people comment on this.
You can always skip what you don't need, but the content is useful for people just looking at the slides as a reference. That said, @hageboeck had the same concern.
Are any changes needed? From my side this should be ready for merging.
I would even like to have more pictures. E.g. I love the microarchitecture analysis with the pipeline visualization. Or how a general hierarchical profile looks like. Or the pane showing contention between threads. Or even better, a live demonstration :)
I could not reply directly to this, so adding as quote above.
Although I would like to, I unfortunately don't have so much more time to invest in improving the slides. I really need to go back to work on Geant4 and XRootD now. In any case, I think the online documentation of VTune is really good already. perf is harder to use just by looking at the docs, therefore my added examples, which are meant to be copy/pasted into the terminal to try out perf even without deep knowledge about it.
I would like @sponce and @hageboeck to comment on the complexity of the presented material. For my part, I am fine enough to merge. If I had to present this material, I would probably skip a third of the commands because my knowledge about them is insufficient.
I'm in general not at ease with this one. On one hand it's already far too complex, on the other hand a lot of explanations are missing on concepts used without presenting them. I can see 2 ways out : adding more, but then splitting into a standard part and an expert one. Or simplifying, keeping really only the core, as we did for gdb, in 4 slides total (first 2 with second one split and one example slide.
Ok, I think it's better to go with the second route of simplifying things a bit and providing examples only for the more basic usage of perf, and breaking the first slide into two. I will update this pull request in the next few days when I find the time for it.