Andreas Abel
Andreas Abel
@recvfrom It might not be exactly what you are looking for, but you might also find my project https://github.com/andreas-abel/XED-to-XML helpful.
I'm using the instruction files from XED to automatically generate assembler code for measuring the latency, throughput, and port usage of individual x86 instructions on different microarchitectures. You can find...
According to the SDM, the "three high-order doublewords of the destination operand remain unchanged." Thus, the first operand is also read. 
Here is a related discussion: https://stackoverflow.com/questions/72468730/does-rsqrtss-break-the-dependency-on-the-destination-register
This also applies to a several iforms that are not VMOV* iforms, e.g., VPSHAD_XMMdq_XMMdq_XMMdq.
The examples I was looking at were cases where different PATTERN tokens lead to the same iform. There are currently ~6300 different iforms and ~6800 different PATTERNs, so the growth...
The `-arch` and `-alignmentOffset` parameters now support the option `all`.
Are you sure that such a module is suitable for the Linux kernel? Note that the module makes it possible to execute arbitrary code in kernel space.
I'm not sure I understand what the issue here is. There are separate experiments for the `mov r32, ` case on Alder Lake and Zen 2 (see https://uops.info/html-tp/ADL-P/MOV_89_R32_R32-Measurements.html#sameReg, https://uops.info/html-tp/ZEN2/MOV_89_R32_R32-Measurements.html#sameReg). On...
I see now what you mean. The latency experiments for the "same reg" case are only performed if the throughput tests suggest that there is indeed a difference, which is...