word2vec icon indicating copy to clipboard operation
word2vec copied to clipboard

This tool provides an efficient implementation of the continuous bag-of-words and skip-gram architectures for computing vector representations of words. These representations can be subsequently used...

Results 22 word2vec issues
Sort by recently updated
recently updated
newest added

Is there any library or API available for generating embeddings of each line of a Java code file while preserving AST (Abstract Syntax Tree) structure information? I'm already familiar with...

Thanks to this project for the opportunity to learn. I encountered two confusions while building the dictionary and would like answers. Thanks. 1、When using the function of "word2vec_model.wv.most_similar", I set...

The requested URL /svn/trunk/ was not found on this server.

The if statement `if (word == -1) continue;` will never evaluate to `true` because when filling the `sen` array, words with id `-1` are already being discarded by [this](https://github.com/dav/word2vec/blob/0f29b188b17145d0c1d0953ba0bc80a2208dd8a0/src/word2vec.c#L413) check...

`SortVocab` is removing the sentence end marker "\" from the index 0 in the vocab. I *think* the intent of the original word2vec code is that newlines are replaced with...

I thought this was the Mac-savvy version of the code... but it's failing right out of the box for me on the first allocation (around line 350 of word2vec.c), because...

I have my own training set of 50k words(written in Devanagari script/Indic language), how should I approach? Do I need to change the file path in 'demo-word.sh' or help me...

Memory leaks detected. I'm running program word2vec with command line like: ./word2vec -train ./questions-words.txt -output out.txt ``` ================================================================= ==1469==ERROR: LeakSanitizer: detected memory leaks Direct leak of 96 byte(s) in 1...

In the skipgram part, when computing propagate hidden -> output , use this code : ` for (c = 0; c < layer1_size; c++) f += syn0[c + l1] *...

If a `-read-vocab` file is specified, then all their counts are 0, it seems `SortVocab()` will remove all words in such case. https://github.com/dav/word2vec/blob/5f2e9661025c4f6b496c6a6888be0d090b9c44b3/src/word2vec.c#L323 https://github.com/dav/word2vec/blob/5f2e9661025c4f6b496c6a6888be0d090b9c44b3/src/word2vec.c#L155