Parents:

Wikipedia
Latent Dirichlet allocation

A generative model that allows sets of observations to be explained by unobserved groups that explain why some parts of the data are similar.
Models the intuition that the topic of a document will probabilistically influence the authorâ€™s choice of words when writing the document. Documents are interpreted as a mixture of topics (a probability distribution over topics), and topics as a probability distribution over words.
Encodes the intuition that documents cover a small number of topics and that topics often use a small number of words
LDA is an extension of [LSI/pLSI](latent_semantic_analysis)

7 Documents (Long List)

- Gaussian LDA for Topic Models with Word Embeddings (2015)

2017-11-21 - Using Gensim for LDA (notebook)

2017-06-02 - Introduction to Latent Dirichlet Allocation

2017-06-02 - pyLDAvis

Python library for interactive topic model visualization. Designed to help users interpret the topics.

see also another notebook dedicated to using it with gensim (include nltk_stopwords,...)

2017-06-02 - Latent Dirichlet Allocation: stability

2014-06-26 - Provable Algorithms for Machine Learning Problems by Rong Ge.

from the abstract:

Modern machine learning algorithms can extract useful information from text, images and videos. All these applications involve solving NP-hard problems in average case using heuristics. What properties of the input allow it to be solved effciently? Theoretically analyzing the heuristics is very challenging. Few results were known.

This thesis takes a different approach: we identify natural properties of the input, then design new algorithms that provably works assuming the input has these properties. We are able to give new, provable and sometimes practical algorithms for learning tasks related to text corpus, images and social networks.

...In theory, the assumptions in this thesis help us understand why intractable problems in machine learning can often be solved; in practice, the results suggest inherently new approaches for machine learning.

2014-04-23 - Real-Time Topic Modeling of Microblogs

2014-04-22

Properties

- sl:creationDate : 2013-08-22
- sl:creationTime : 2013-08-22T11:22:59Z
- sl:describedBy : https://en.wikipedia.org/wiki/Latent_Dirichlet_allocation
- rdf:type : sl:Tag
- skos:altLabel : LDA@en
- skos:prefLabel : Latent Dirichlet allocation@en