-
Notifications
You must be signed in to change notification settings - Fork 103
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Collapsed Gibbs Sampling for LDA #113
base: master
Are you sure you want to change the base?
Conversation
All automated tests passed. |
for (i <- 0 until nIter) { | ||
// Resample all the tokens | ||
graph = graph.mapTriplets { | ||
triplet => { |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Can you make this a separate named function?
All automated tests passed. |
All automated tests passed. |
…on the NIPS corpus.
…mprovements in topK words.
All automated tests passed. |
All automated tests passed. |
This is a work in progress implementation of the collapsed Gibbs sampler for the LDA model using the GraphX abstraction primitives. While this is based on the (non-ergodic) bulk synchronous Gibbs sampler, we do exploit local parameter sharing and if document vertex partitioning is used we recover the Newman et al. style sampler.
Remaining tasks: