asm2vec-pytorch icon indicating copy to clipboard operation
asm2vec-pytorch copied to clipboard

I tried to recapture the experiments in the Asm2Vec paper using this implementation yet my results are really bad - what is the problem?

Open anandramakris opened this issue 3 years ago • 2 comments

I attempted to use this library to figure out the cosine similarity between the O0 and O3 optimized functions in coreutils version 8.30 (taken from https://github.com/yueduan/DeepBinDiff). In order to try to replicate the results, I used the same options for training as in the paper - embedding dimension 200, learning rate 0.025 - and changed the number of random walks to be 10 as in the paper.

I found the average cosine similarity to be 0.128. Given that the results in the paper show that Asm2Vec should correctly match around 80% of O0 and O3 functions, the score is very poor. Do you have an explanation?

anandramakris avatar May 30 '22 00:05 anandramakris

My original comparison script was faulty, but after fixing it the average cosine similarity was still only 0.151.

anandramakris avatar Jun 01 '22 05:06 anandramakris

Hi, have u solved ur problem?

I find its strange that whatever dataset I use to train the model, it shows a quite bad performance, about only 30% accuracy after 100 epochs.

So I think there is something wrong with this implement.

true-zk avatar Mar 24 '23 06:03 true-zk