doc2vec icon indicating copy to clipboard operation
doc2vec copied to clipboard

Model output predication limit of top 100

Open dmhenke opened this issue 9 months ago • 3 comments

At present, I do not see how to output more than the top 100 predicted hits on a model. For example: _``` model <- paragraph2vec(x = df_d2v, type = "PV-DM" vocab <- summary(model, type = "vocabulary", which = "docs") Sentences <- "my bag of words" sentences <- setNames(sentences, sentences) sentences <- strsplit(sentences, split = " ") model_predictions <-predict( model, newdata = sentences, type = "nearest", which = "sent2doc", top_n = 100)

dim(model_predictions) is at max 100 rows

There appears to be no way to output the predictions for a model with more than 100 "vocabulary"s/doc_id.  Is there a workaround for generating predictions on all available "vocabulary"/doc_id?

Thank you,
David

dmhenke avatar Apr 07 '25 16:04 dmhenke