word2vec
word2vec copied to clipboard
This tool provides an efficient implementation of the continuous bag-of-words and skip-gram architectures for computing vector representations of words. These representations can be subsequently used...
Is there any library or API available for generating embeddings of each line of a Java code file while preserving AST (Abstract Syntax Tree) structure information? I'm already familiar with...
Thanks to this project for the opportunity to learn. I encountered two confusions while building the dictionary and would like answers. Thanks. 1、When using the function of "word2vec_model.wv.most_similar", I set...
The requested URL /svn/trunk/ was not found on this server.
The if statement `if (word == -1) continue;` will never evaluate to `true` because when filling the `sen` array, words with id `-1` are already being discarded by [this](https://github.com/dav/word2vec/blob/0f29b188b17145d0c1d0953ba0bc80a2208dd8a0/src/word2vec.c#L413) check...
`SortVocab` is removing the sentence end marker "\" from the index 0 in the vocab. I *think* the intent of the original word2vec code is that newlines are replaced with...
I thought this was the Mac-savvy version of the code... but it's failing right out of the box for me on the first allocation (around line 350 of word2vec.c), because...
I have my own training set of 50k words(written in Devanagari script/Indic language), how should I approach? Do I need to change the file path in 'demo-word.sh' or help me...
Memory leaks detected. I'm running program word2vec with command line like: ./word2vec -train ./questions-words.txt -output out.txt ``` ================================================================= ==1469==ERROR: LeakSanitizer: detected memory leaks Direct leak of 96 byte(s) in 1...
In the skipgram part, when computing propagate hidden -> output , use this code : ` for (c = 0; c < layer1_size; c++) f += syn0[c + l1] *...
If a `-read-vocab` file is specified, then all their counts are 0, it seems `SortVocab()` will remove all words in such case. https://github.com/dav/word2vec/blob/5f2e9661025c4f6b496c6a6888be0d090b9c44b3/src/word2vec.c#L323 https://github.com/dav/word2vec/blob/5f2e9661025c4f6b496c6a6888be0d090b9c44b3/src/word2vec.c#L155