Data Analysis in Politics and Journalism Winter /Spring 2019. Introduction to topic modelling. Seminar 3
- Главная
- Информатика
- Data Analysis in Politics and Journalism Winter /Spring 2019. Introduction to topic modelling. Seminar 3

Содержание
- 2. What is topic modelling Automatically identifying major themes in a text, usually by identifying informative words.
- 3. B.S. ? Detector Chrome extension Searches all links on a given webpage for references to unreliable
- 4. Dataset 244 websites 12999 news Tagged as “bullshit” by B.S. Detector: "bias" "conspiracy” "fake" "bs" (unlabeled)
- 6. Скачать презентацию
Слайд 2What is topic modelling
Automatically identifying major themes in a text, usually by
What is topic modelling
Automatically identifying major themes in a text, usually by

identifying informative words.
Two main uses:
identifying major topics in unlabeled texts;
identify which words are important for text that is labeled for topic
Two main uses:
identifying major topics in unlabeled texts;
identify which words are important for text that is labeled for topic
Слайд 3B.S. ? Detector
Chrome extension
Searches all links on a given webpage for references to unreliable
B.S. ? Detector
Chrome extension
Searches all links on a given webpage for references to unreliable

sources,
Classification:
Fake News: Sources that fabricate stories out of whole cloth with the intent of pranking the public.
Satire: Sources that provide humorous commentary on current events in the form of fake news.
Extreme Bias: Sources that traffic in political propaganda and gross distortions of fact.
Conspiracy Theory: Sources that are well-known promoters of kooky conspiracy theories.
Rumor Mill: Sources that traffic in rumors, innuendo, and unverified claims.
State News: Sources in repressive states operating under government sanction.
Junk Science: Sources that promote pseudoscience, metaphysics, and other scientifically dubious claims.
Hate Group: Sources that actively promote racism, and other forms of discrimination.
Clickbait: Sources that are aimed at generating online advertising revenue and rely on sensationalist headlines or eye-catching pictures.
Proceed With Caution: Sources that may be reliable but whose contents require further verification.
Classification:
Fake News: Sources that fabricate stories out of whole cloth with the intent of pranking the public.
Satire: Sources that provide humorous commentary on current events in the form of fake news.
Extreme Bias: Sources that traffic in political propaganda and gross distortions of fact.
Conspiracy Theory: Sources that are well-known promoters of kooky conspiracy theories.
Rumor Mill: Sources that traffic in rumors, innuendo, and unverified claims.
State News: Sources in repressive states operating under government sanction.
Junk Science: Sources that promote pseudoscience, metaphysics, and other scientifically dubious claims.
Hate Group: Sources that actively promote racism, and other forms of discrimination.
Clickbait: Sources that are aimed at generating online advertising revenue and rely on sensationalist headlines or eye-catching pictures.
Proceed With Caution: Sources that may be reliable but whose contents require further verification.
Слайд 4Dataset
244 websites
12999 news
Tagged as “bullshit” by B.S. Detector:
"bias"
"conspiracy”
"fake"
"bs" (unlabeled)
"satire"
"hate"
Dataset
244 websites
12999 news
Tagged as “bullshit” by B.S. Detector:
"bias"
"conspiracy”
"fake"
"bs" (unlabeled)
"satire"
"hate"

"junksci”
"state"
Следующая -
Музей Истоки
Что такое WWW. Информация и информационные процессы
Продлить членство в SPE
Самоидентификация в социальных медиа
Унифицированный язык моделирования
Информатика. Тема 1. Введение в информатику. Информация и её свойства
Табличные модели, диаграммы
Клуб RedSquare
Электронная регистрация прав на объекты недвижимости
Информационные технологии в экологии. Часть 2
422196
Информ_лек2_информатика компьютинг (2)
История развития вычислительной техники
Организация как система
Политика и модели безопасности в компьютерных системах
Перспективы развития компьютерной сети МГУЛ Интернет-центр МГУЛ 2002
Основы программирования. Язык программирования С++. Массивы
Расхождение верстки
Графический редактор Paint как инструмент для создания и обработки графики
Информация. Восприятие информации
Пошаговый инструкция по подготовке конкурсных работ. Номинация память о подвиге вечно храним
Современная библиотека
Основы передачи данных. Принципы построения сетей
Дизайн методом Value-Link (ER-метод). Реляционная алгебра
Компьютерное зрение. Математика в задачах обработки изображений
Алгоритмы с ветвящейся структурой. Контрольная работа
Группа компаний ОАО ММК. Сбор и анализ информации по социальному направлению
О введении компьютерного формата ЕНТ в 2021 году
Van Game's