LIU, Jinyang comments

Results 17 comments of


                                            LIU, Jinyang

How do you train GCN on such a large dataset "Amazon-Book"?

Oh, the adjacency matrix for the dataset is 52643 * 91599, which is OK! But how to deal with the dataset like 5M * 0.9B?

How do you train GCN on such a large dataset "Amazon-Book"?

@Jhy1993 It's true, but sampling may miss some chance to aggregate enough nodes, e.g, node A needs to aggregate 100 nodes but finally gets 10 nodes' information due to sampling.

hdfs_train sequence file doesn't correspond to the sequence file generated for 100k structured file provided in the repository

Same question.

hdfs_train sequence file doesn't correspond to the sequence file generated for 100k structured file provided in the repository

@ZanisAli I am sorry I do not really understand that, I found that training data can seriously affect the results, could you please explain how to get the hdfs_train?

hdfs_train sequence file doesn't correspond to the sequence file generated for 100k structured file provided in the repository

@ZanisAli I would like to use the semantic information for each tempalte instead of only IDs, so I want to know how hdfs_train is got from the raw log data...

hdfs_train sequence file doesn't correspond to the sequence file generated for 100k structured file provided in the repository

@ZanisAli Thanks for your helpful advice! I checked Issue#3, which is really what I want, but in the code the author provided, the mapping eventid2template.json is missing, which should be...

hdfs_train sequence file doesn't correspond to the sequence file generated for 100k structured file provided in the repository

@ZanisAli Many thanks!!!!!!

hdfs_train sequence file doesn't correspond to the sequence file generated for 100k structured file provided in the repository

@ZanisAli But the file templates.txt for HDFS is also missing.

hdfs_train sequence file doesn't correspond to the sequence file generated for 100k structured file provided in the repository

@ZanisAli Really thanks for your help, I understand how to generate the templates by myself, but the question is I would like to get exactly the same templates-id mapping used...

hdfs_train sequence file doesn't correspond to the sequence file generated for 100k structured file provided in the repository

@ZanisAli Great, thanks for your help. BTW, do you happen to know where to download the full OpenStack dataset used in the original DeepLog paper? The link for homepage of...