node2vec icon indicating copy to clipboard operation
node2vec copied to clipboard

Can node2vec be used in Chinese? A data format similar to my following?

Open HJW3536 opened this issue 5 years ago • 1 comments

image Chinese characters represent nodes, and numbers represent weights.Has anyone done a similar experiment? Ask for advice

HJW3536 avatar Dec 01 '20 02:12 HJW3536

My advice is to encode the text with a label encoder like this:

le = LabelEncoder()
le.fit(df.stack().unique())

# save label encoder
np.save('label_encoder.npy', le)

df['source'] = le.transform(df['source'])
df['destination'] = le.transform(df['destination'])

then

le = np.load('label_encoder.npy', allow_pickle=True).tolist()

# get node_id
le.transform([node_name])[0]

# get node name
le.inverse_transform([node_id])[0]

shoegazerstella avatar Mar 08 '21 10:03 shoegazerstella