L-EnsNMF: Boosted Local Topic Discovery via
Ensemble of Nonnegative Matrix Factorization
Sangho Suh1, Jaegul Choo1,
Joonseok Lee2, Chandan K. Reddy3
1Korea University, 2Google Research, 3Virginia Tech
Sampled topics from papers containing keywords, 'dimension' or 'reduction'
Proposed Idea
Local topic discovery to extract
more specific, informative topics
For local topic discovery,
1) Iterate -> Ensemble
2) Boost & Suppress -> Local weighting scheme
=> Localized Ensemble of Nonnegative Matrix Factorization(L-EnsNMF)
1) NMF Topic Modeling
-> Find a set of topics
2) Residual Update
-> Identify unexplained parts (e.g. egyptian cat)
3) Anchor Sampling & Local Weighting
-> Reveal unexplained parts and suppress explained parts
We generated 100 topics (10 keywords each) but only L-EnsNMF extracted local, specific keywords, e.g., ‘hurrican’, ‘sandi’, ‘ireland.’
Dataset: Twitter (New York City in June 2013)
Ireland football team visited New York City in June 2013
to boost a community hit by Hurricane Sandy in 2012
Muchas Gracias
Questions?
E-mail: sh31659@gmail.com
Code: https://github.com/sanghosuh/lens_nmf-matlab