functionsimsearch
functionsimsearch copied to clipboard
Some C++ example code to demonstrate how to perform code similarity searches using SimHashing.
The attached file seems to cause an error in the ELF parser: ``` root@3784f1a5cc98:/code/functionsimsearch# ./bin/disassemble --input= /bzip2-1.0.4-gcc-5.3-alpine-3.31 [!] Failure to parse PE file! Error: PE File cannot be parsed. ```...
This commit fixes build failure because of `pe-parser` API change. I ran the default testsuite, which passed. Some tests from the _slow_ testsuite fail.
When extracting the symbols of the binary files of the dataset, base64 of the function prototype is used to build a ground truth of same functions. but with different compilers,...
This was discussed here https://github.com/googleprojectzero/functionsimsearch/issues/17 Note: I am using the shell to do base64 decoding, I did this since I saw you reporting some problems with the python base64 method....
I generated a weight file by setting -train_steps to 100 and got pretty good matching result, but then I tried to generate another weight file by seeing -trains_steps to 500...
Compiled everything in the docker way and code runs. Building the python bindings and then trying to run some code that needs them, breaks though. Step to repro: build python...
As there is already a way to feed a CFG from JSON to the fingerprinter it would be great to make the DynInst code parts there optional. This would make...
This was a desired feature initially to make sure the underlying disassembly is good, but makes tweaking / improving / adding new features difficult without breaking the tests. Particular culprits...