Measuring Political Sentiment on Twitter: Factor Optimal Design for Multinomial Inverse Regression

This article presents a short case study in text analysis: the scoring of Twitter posts for positive, negative, or neutral sentiment directed toward particular U.S. politicians. The study requires selection of a subsample of representative posts for sentiment scoring, a common and costly aspect of s...

Full description

Saved in:
Bibliographic Details
Published inTechnometrics Vol. 55; no. 4; pp. 415 - 425
Main Author Taddy, Matt
Format Journal Article
LanguageEnglish
Published Alexandria Taylor & Francis Group 01.11.2013
American Society for Quality and the American Statistical Association
American Society for Quality
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:This article presents a short case study in text analysis: the scoring of Twitter posts for positive, negative, or neutral sentiment directed toward particular U.S. politicians. The study requires selection of a subsample of representative posts for sentiment scoring, a common and costly aspect of sentiment mining. As a general contribution, our application is preceded by a proposed algorithm for maximizing sampling efficiency. In particular, we outline and illustrate greedy selection of documents to build designs that are D-optimal in a topic-factor decomposition of the original text. The strategy is applied to our motivating dataset of political posts, and we outline a new technique for predicting both generic and subject-specific document sentiment through the use of variable interactions in multinomial inverse regression. Results are presented for analysis of 2.1 million Twitter posts collected around February 2012. Computer codes and data are provided as supplementary material online.
Bibliography:SourceType-Scholarly Journals-1
ObjectType-Feature-1
content type line 14
ISSN:0040-1706
1537-2723
DOI:10.1080/00401706.2013.778791