graphx icon indicating copy to clipboard operation
graphx copied to clipboard

Collapsed Gibbs Sampling for LDA

Open jegonzal opened this issue 12 years ago • 5 comments

This is a work in progress implementation of the collapsed Gibbs sampler for the LDA model using the GraphX abstraction primitives. While this is based on the (non-ergodic) bulk synchronous Gibbs sampler, we do exploit local parameter sharing and if document vertex partitioning is used we recover the Newman et al. style sampler.

Remaining tasks:

  • [ ] Unite tests
  • [ ] Raw document processing
  • [ ] Likelihood calculation (requires log gamma transcendental functions)

jegonzal avatar Dec 19 '13 23:12 jegonzal

All automated tests passed. Refer to this link for build results: https://amplab.cs.berkeley.edu/jenkins/job/GraphXPullRequestBuilder/5934/

AmplabJenkins avatar Dec 20 '13 00:12 AmplabJenkins

All automated tests passed. Refer to this link for build results: https://amplab.cs.berkeley.edu/jenkins/job/GraphXPullRequestBuilder/9536/

AmplabJenkins avatar Jan 02 '14 08:01 AmplabJenkins

All automated tests passed. Refer to this link for build results: https://amplab.cs.berkeley.edu/jenkins/job/GraphXPullRequestBuilder/9695/

AmplabJenkins avatar Jan 02 '14 23:01 AmplabJenkins

All automated tests passed. Refer to this link for build results: https://amplab.cs.berkeley.edu/jenkins/job/GraphXPullRequestBuilder/10213/

AmplabJenkins avatar Jan 04 '14 23:01 AmplabJenkins

All automated tests passed. Refer to this link for build results: https://amplab.cs.berkeley.edu/jenkins/job/GraphXPullRequestBuilder/10215/

AmplabJenkins avatar Jan 05 '14 00:01 AmplabJenkins