Wikipedia Latent Dirichlet allocation
A generative model that allows sets of observations to be explained by unobserved groups that explain why some parts of the data are similar. Models the intuition that the topic of a document will probabilistically influence the author’s choice of words when writing the document. Documents are interpreted as a mixture of topics (a probability distribution over topics), and topics as a probability distribution over words. Encodes the intuition that documents cover a small number of topics and that topics often use a small number of words LDA is an extension of [LSI/pLSI](latent_semantic_analysis)
10 Documents (Long List