LIU, Jinyang

Results 17 comments of LIU, Jinyang

Oh, the adjacency matrix for the dataset is 52643 * 91599, which is OK! But how to deal with the dataset like 5M * 0.9B?

@Jhy1993 It's true, but sampling may miss some chance to aggregate enough nodes, e.g, node A needs to aggregate 100 nodes but finally gets 10 nodes' information due to sampling.

@ZanisAli I am sorry I do not really understand that, I found that training data can seriously affect the results, could you please explain how to get the hdfs_train?

@ZanisAli I would like to use the semantic information for each tempalte instead of only IDs, so I want to know how hdfs_train is got from the raw log data...

@ZanisAli Thanks for your helpful advice! I checked Issue#3, which is really what I want, but in the code the author provided, the mapping eventid2template.json is missing, which should be...

@ZanisAli Really thanks for your help, I understand how to generate the templates by myself, but the question is I would like to get exactly the same templates-id mapping used...

@ZanisAli Great, thanks for your help. BTW, do you happen to know where to download the full OpenStack dataset used in the original DeepLog paper? The link for homepage of...