LSTMVis: A Tool for Visual Analysis of Hidden State Dynamics in Recurrent Neural Networks

Recurrent neural networks, and in particular long short-term memory (LSTM) networks, are a remarkably effective tool for sequence modeling that learn a dense black-box hidden representation of their sequential input. Researchers interested in better understanding these models have studied the change...

Full description

Saved in:

Bibliographic Details
Published in	IEEE transactions on visualization and computer graphics Vol. 24; no. 1; pp. 667 - 676
Main Authors	Strobelt, Hendrik, Gehrmann, Sebastian, Pfister, Hanspeter, Rush, Alexander M.
Format	Journal Article
Language	English
Published	United States IEEE 01.01.2018 The Institute of Electrical and Electronics Engineers, Inc. (IEEE)
Subjects	Annotations Computational modeling Data models LSTM Machine Learning Nesting Neural networks Pattern matching Progressions Recurrent neural networks Representations Statistical analysis Visualization
Online Access	Get full text

Cover

Loading…

More Information
Summary:	Recurrent neural networks, and in particular long short-term memory (LSTM) networks, are a remarkably effective tool for sequence modeling that learn a dense black-box hidden representation of their sequential input. Researchers interested in better understanding these models have studied the changes in hidden state representations over time and noticed some interpretable patterns but also significant noise. In this work, we present LSTMVis, a visual analysis tool for recurrent neural networks with a focus on understanding these hidden state dynamics. The tool allows users to select a hypothesis input range to focus on local state changes, to match these states changes to similar patterns in a large data set, and to align these results with structural annotations from their domain. We show several use cases of the tool for analyzing specific hidden state properties on dataset containing nesting, phrase structure, and chord progressions, and demonstrate how the tool can be used to isolate patterns for further statistical analysis. We characterize the domain, the different stakeholders, and their goals and tasks. Long-term usage data after putting the tool online revealed great interest in the machine learning community.
Bibliography:	ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 23
ISSN:	1077-2626 1941-0506
DOI:	10.1109/TVCG.2017.2744158