Seminar 4. Probabilistic Topic Model

Март 1, 2021

Главная
Английский язык
Seminar 4. Probabilistic Topic Model

Содержание

2. Topic modeling Models of a collection of composites Composites are documents Parts are words (or phrases,
3. Assumptions semantic information can be derived from a word-document co-occurrence matrix; topic is a probability distribution
4. Generative model
5. Probabilistic model
6. Dirichlet distribution
7. Geometric interpretation
9. Скачать презентацию

Слайд 2

Topic modeling
Models of a collection of composites
Composites are documents
Parts are words (or

Topic modeling Models of a collection of composites Composites are documents Parts

phrases, n-grams)
Two outputs:
chance of selecting a particular part when sampling a particular topic
chance of selecting a particular topic when sampling a particular document or composite

Слайд 3

Assumptions
semantic information can be derived from a word-document co-occurrence matrix;
topic is a

Assumptions semantic information can be derived from a word-document co-occurrence matrix; topic

probability distribution over words
to make a new document, one chooses a distribution over topics
for each word in that document, one chooses a topic at random according to this distribution, and draws a word from that topic.
Resulting document is a mixture of topics

Слайд 4

Generative model

Generative model

Слайд 5

Probabilistic model

Probabilistic model

Слайд 6

Dirichlet distribution

Dirichlet distribution

Слайд 7

Geometric interpretation

Geometric interpretation