From intuition to intelligence: a text mining–based approach for movies' green-lighting process

PurposeThe purpose of this paper is to develop a predictive model for box office performance based on the textual information in movie scripts in the green-lighting process of movie production.Design/methodology/approachThe authors use Latent Dirichlet Allocation to determine the hidden textual stru...

Full description

Saved in:
Bibliographic Details
Published inInternet research Vol. 32; no. 3; pp. 1003 - 1022
Main Authors Kim, Jongdae, Lee, Youseok, Song, Inseong
Format Journal Article
LanguageEnglish
Published Bradford Emerald Publishing Limited 09.05.2022
Emerald Group Publishing Limited
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:PurposeThe purpose of this paper is to develop a predictive model for box office performance based on the textual information in movie scripts in the green-lighting process of movie production.Design/methodology/approachThe authors use Latent Dirichlet Allocation to determine the hidden textual structure in movie scripts by extracting topic probabilities as predictors for classification. The extracted topic probabilities are used as inputs for the predictive model for the box office performance. For the predictive model, the authors utilize a variety of classification algorithms such as logistic classification, decision trees, random forests, k-nearest neighbor algorithms, support vector machines and artificial neural networks, and compare their relative performances in predicting movies' market performance.FindingsThis approach for extracting textual information from movie scripts produces a valuable typology for movies. Moreover, our modeling approach has significant power to predict movie scripts' profitability. It provides a superior prediction performance compared to previous benchmarks, such as that of Eliashberg et al. (2007).Research limitations/implicationsThis work contributes to literature on predicting the box office performance in the green-lighting process and literature regarding suggesting models for the idea screening stage in the new product development process. Besides, this is one of the few studies that use movie script data to predict movies' financial performance by proposing an approach to integrate text mining models and machine learning algorithms with movie experts' intuition.Practical implicationsFirst, the authors’ approach can significantly reduce the financial risk associated with movie production decisions before the pre-production stage. Second, this paper proposes an approach that is applicable at a very early stage of new product development, such as the idea screening stage. The authors also introduce an online-based movie scenario database system that can help movie studios make more systematic and profitable decisions in the green-lighting process. Third, this approach can help movie studios estimate movie scripts' financial value.Originality/valueThis study is one of the few studies to forecast market performance in the green-lighting process.
ISSN:1066-2243
2054-5657
DOI:10.1108/INTR-11-2020-0651