Interpretable Architectures and Algorithms for Natural Language Processing

This thesis has two parts: Firstly, we introduce the human level-interpretable models using Tsetlin Machine (TM) for NLP tasks. Secondly, we present an interpretable model using DNNs. The first part combines several architectures of various NLP tasks using TM along with its robustness. We use this m...

Full description

Saved in:
Bibliographic Details
Published inDoctoral Dissertations at the University of Agder
Main Author Yadav, Rohan Kumar
Format Dissertation
LanguageEnglish
Published University of Agder 2022
Online AccessGet more information

Cover

Loading…
Abstract This thesis has two parts: Firstly, we introduce the human level-interpretable models using Tsetlin Machine (TM) for NLP tasks. Secondly, we present an interpretable model using DNNs. The first part combines several architectures of various NLP tasks using TM along with its robustness. We use this model to propose logic-based text classification. We start with basic Word Sense Disambiguation (WSD), where we employ TM to design novel interpretation techniques using the frequency of words in the clause. We then tackle a new problem in NLP, i.e., aspect-based text classification using a novel feature engineering for TM. Since TM operates on Boolean features, it relies on Bag-of-Words (BOW), making it difficult to use pre-trained word embedding like Glove, word2vec, and fasttext. Hence, we designed a Glove embedded TM to significantly enhance the model’s performance. In addition to this, NLP models are sensitive to distribution bias because of spurious correlations. Hence we employ TM to design a robust text classification against spurious correlations. The second part of the thesis consists interpretable model using DNN where we design a simple solution for complex position dependent NLP task. Since TM’s interpretability comes with the cost of performance, we propose an DNN-based architecture using a masking scheme on LSTM/GRU based models that ease the interpretation for humans using the attention mechanism. At last, we take the advantages of both models and design an ensemble model by integrating TM’s interpretable information into DNN for better visualization of attention weights. Our proposed model can be efficiently integrated to have a fully explainable model for NLP that assists trustable AI. Overall, our model shows excellent results and interpretation in several open-sourced NLP datasets. Thus, we believe that by combining the novel interpretation of TM, the masking technique in the neural network, and the integrated ensemble model, we can build a simple yet effective platform for explainable NLP applications wherever necessary.
AbstractList This thesis has two parts: Firstly, we introduce the human level-interpretable models using Tsetlin Machine (TM) for NLP tasks. Secondly, we present an interpretable model using DNNs. The first part combines several architectures of various NLP tasks using TM along with its robustness. We use this model to propose logic-based text classification. We start with basic Word Sense Disambiguation (WSD), where we employ TM to design novel interpretation techniques using the frequency of words in the clause. We then tackle a new problem in NLP, i.e., aspect-based text classification using a novel feature engineering for TM. Since TM operates on Boolean features, it relies on Bag-of-Words (BOW), making it difficult to use pre-trained word embedding like Glove, word2vec, and fasttext. Hence, we designed a Glove embedded TM to significantly enhance the model’s performance. In addition to this, NLP models are sensitive to distribution bias because of spurious correlations. Hence we employ TM to design a robust text classification against spurious correlations. The second part of the thesis consists interpretable model using DNN where we design a simple solution for complex position dependent NLP task. Since TM’s interpretability comes with the cost of performance, we propose an DNN-based architecture using a masking scheme on LSTM/GRU based models that ease the interpretation for humans using the attention mechanism. At last, we take the advantages of both models and design an ensemble model by integrating TM’s interpretable information into DNN for better visualization of attention weights. Our proposed model can be efficiently integrated to have a fully explainable model for NLP that assists trustable AI. Overall, our model shows excellent results and interpretation in several open-sourced NLP datasets. Thus, we believe that by combining the novel interpretation of TM, the masking technique in the neural network, and the integrated ensemble model, we can build a simple yet effective platform for explainable NLP applications wherever necessary.
Author Yadav, Rohan Kumar
Author_xml – sequence: 1
  fullname: Yadav, Rohan Kumar
BookMark eNqNyj0OwjAMQOEMMPB3B3MApITSgbFCIEAIMbBXJrhppOAgO70_DByA6Q3fm5oRZ6aJOZ-4kLyFCj4SQSO-j4V8GYQUkJ_QpJAllv6l0GWBK34JE1yQw4CB4CbZk2rkMDfjDpPS4teZWR72991x5SVqidxyFmydW9e2rWy1qd3W_vN8AGpJNro
ContentType Dissertation
Copyright info:eu-repo/semantics/openAccess
Copyright_xml – notice: info:eu-repo/semantics/openAccess
DBID 3HK
DatabaseName NORA - Norwegian Open Research Archives
DatabaseTitleList
DeliveryMethod no_fulltext_linktorsrc
ExternalDocumentID 11250_3034519
GroupedDBID 3HK
ID FETCH-cristin_nora_11250_30345190
IngestDate Wed Nov 30 04:21:13 EST 2022
IsOpenAccess true
IsPeerReviewed false
IsScholarly false
Language English
LinkModel DirectLink
MergedId FETCHMERGED-cristin_nora_11250_30345190
Notes Doctoral Dissertations at the University of Agder; no. 388
Yadav, R. K. (2022). Interpretable Architectures and Algorithms for Natural Language Processing [PhD. thesis]. University of Agder.
OpenAccessLink http://hdl.handle.net/11250/3034519
ParticipantIDs cristin_nora_11250_3034519
PublicationCentury 2000
PublicationDate 2022
PublicationDateYYYYMMDD 2022-01-01
PublicationDate_xml – year: 2022
  text: 2022
PublicationDecade 2020
PublicationTitle Doctoral Dissertations at the University of Agder
PublicationYear 2022
Publisher University of Agder
Publisher_xml – name: University of Agder
Score 3.9787595
Snippet This thesis has two parts: Firstly, we introduce the human level-interpretable models using Tsetlin Machine (TM) for NLP tasks. Secondly, we present an...
SourceID cristin
SourceType Open Access Repository
Title Interpretable Architectures and Algorithms for Natural Language Processing
URI http://hdl.handle.net/11250/3034519
hasFullText
inHoldings
isFullTextHit
isPrint
link http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwfV09C8IwED38QBAXRcVPiOAarCEN7ShqEVEnBbfSpqcOWgf9_3gJFYugcyA5yJF37-7xAjBOiEZEWkg-RefEqf6fck-6yJWv0RAEFHYUs92p1UGuj-6xAO8P777sBagYcJ0JvbLGBKUIRWHEyhVt0z_N4UBQh9oiN79uQAHTJqw_wr34imyWa80_GJ3CZtfznTj45fZgVCKyXWS9LtgmaxWyTKxPINKCUbDcz1c8OzxM6YZCG1-Yxee0oUSUHTvAHEQXZYyE74nUvvK0p2JjbpdESgvf60Lv9z69f4t9qAojwLdNgAGUT5SkOCRcfMYvXjlowg
link.rule.ids 230,312,786,891,4071
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Adissertation&rft.genre=dissertation&rft.title=Interpretable+Architectures+and+Algorithms+for+Natural+Language+Processing&rft.DBID=3HK&rft.au=Yadav%2C+Rohan+Kumar&rft.date=2022&rft.pub=University+of+Agder&rft.externalDBID=n%2Fa&rft.externalDocID=11250_3034519