Interpretable Architectures and Algorithms for Natural Language Processing

This thesis has two parts: Firstly, we introduce the human level-interpretable models using Tsetlin Machine (TM) for NLP tasks. Secondly, we present an interpretable model using DNNs. The first part combines several architectures of various NLP tasks using TM along with its robustness. We use this m...

Full description

Saved in:

Bibliographic Details
Published in	Doctoral Dissertations at the University of Agder
Main Author	Yadav, Rohan Kumar
Format	Dissertation
Language	English
Published	University of Agder 2022
Online Access	Get more information

Cover

Loading…

Abstract	This thesis has two parts: Firstly, we introduce the human level-interpretable models using Tsetlin Machine (TM) for NLP tasks. Secondly, we present an interpretable model using DNNs. The first part combines several architectures of various NLP tasks using TM along with its robustness. We use this model to propose logic-based text classification. We start with basic Word Sense Disambiguation (WSD), where we employ TM to design novel interpretation techniques using the frequency of words in the clause. We then tackle a new problem in NLP, i.e., aspect-based text classification using a novel feature engineering for TM. Since TM operates on Boolean features, it relies on Bag-of-Words (BOW), making it difficult to use pre-trained word embedding like Glove, word2vec, and fasttext. Hence, we designed a Glove embedded TM to significantly enhance the model’s performance. In addition to this, NLP models are sensitive to distribution bias because of spurious correlations. Hence we employ TM to design a robust text classification against spurious correlations. The second part of the thesis consists interpretable model using DNN where we design a simple solution for complex position dependent NLP task. Since TM’s interpretability comes with the cost of performance, we propose an DNN-based architecture using a masking scheme on LSTM/GRU based models that ease the interpretation for humans using the attention mechanism. At last, we take the advantages of both models and design an ensemble model by integrating TM’s interpretable information into DNN for better visualization of attention weights. Our proposed model can be efficiently integrated to have a fully explainable model for NLP that assists trustable AI. Overall, our model shows excellent results and interpretation in several open-sourced NLP datasets. Thus, we believe that by combining the novel interpretation of TM, the masking technique in the neural network, and the integrated ensemble model, we can build a simple yet effective platform for explainable NLP applications wherever necessary.
AbstractList	This thesis has two parts: Firstly, we introduce the human level-interpretable models using Tsetlin Machine (TM) for NLP tasks. Secondly, we present an interpretable model using DNNs. The first part combines several architectures of various NLP tasks using TM along with its robustness. We use this model to propose logic-based text classification. We start with basic Word Sense Disambiguation (WSD), where we employ TM to design novel interpretation techniques using the frequency of words in the clause. We then tackle a new problem in NLP, i.e., aspect-based text classification using a novel feature engineering for TM. Since TM operates on Boolean features, it relies on Bag-of-Words (BOW), making it difficult to use pre-trained word embedding like Glove, word2vec, and fasttext. Hence, we designed a Glove embedded TM to significantly enhance the model’s performance. In addition to this, NLP models are sensitive to distribution bias because of spurious correlations. Hence we employ TM to design a robust text classification against spurious correlations. The second part of the thesis consists interpretable model using DNN where we design a simple solution for complex position dependent NLP task. Since TM’s interpretability comes with the cost of performance, we propose an DNN-based architecture using a masking scheme on LSTM/GRU based models that ease the interpretation for humans using the attention mechanism. At last, we take the advantages of both models and design an ensemble model by integrating TM’s interpretable information into DNN for better visualization of attention weights. Our proposed model can be efficiently integrated to have a fully explainable model for NLP that assists trustable AI. Overall, our model shows excellent results and interpretation in several open-sourced NLP datasets. Thus, we believe that by combining the novel interpretation of TM, the masking technique in the neural network, and the integrated ensemble model, we can build a simple yet effective platform for explainable NLP applications wherever necessary.
Author	Yadav, Rohan Kumar
Author_xml	– sequence: 1 fullname: Yadav, Rohan Kumar
BookMark	eNqNyj0OwjAMQOEMMPB3B3MApITSgbFCIEAIMbBXJrhppOAgO70_DByA6Q3fm5oRZ6aJOZ-4kLyFCj4SQSO-j4V8GYQUkJ_QpJAllv6l0GWBK34JE1yQw4CB4CbZk2rkMDfjDpPS4teZWR72991x5SVqidxyFmydW9e2rWy1qd3W_vN8AGpJNro
ContentType	Dissertation
Copyright	info:eu-repo/semantics/openAccess
Copyright_xml	– notice: info:eu-repo/semantics/openAccess
DBID	3HK
DatabaseName	NORA - Norwegian Open Research Archives
DatabaseTitleList
DeliveryMethod	no_fulltext_linktorsrc
ExternalDocumentID	11250_3034519
GroupedDBID	3HK
ID	FETCH-cristin_nora_11250_30345190
IngestDate	Wed Nov 30 04:21:13 EST 2022
IsOpenAccess	true
IsPeerReviewed	false
IsScholarly	false
Language	English
LinkModel	DirectLink
MergedId	FETCHMERGED-cristin_nora_11250_30345190
Notes	Doctoral Dissertations at the University of Agder; no. 388 Yadav, R. K. (2022). Interpretable Architectures and Algorithms for Natural Language Processing [PhD. thesis]. University of Agder.
OpenAccessLink	http://hdl.handle.net/11250/3034519
ParticipantIDs	cristin_nora_11250_3034519
PublicationCentury	2000
PublicationDate	2022
PublicationDateYYYYMMDD	2022-01-01
PublicationDate_xml	– year: 2022 text: 2022
PublicationDecade	2020
PublicationTitle	Doctoral Dissertations at the University of Agder
PublicationYear	2022
Publisher	University of Agder
Publisher_xml	– name: University of Agder
Score	3.9787595
Snippet	This thesis has two parts: Firstly, we introduce the human level-interpretable models using Tsetlin Machine (TM) for NLP tasks. Secondly, we present an...
SourceID	cristin
SourceType	Open Access Repository
Title	Interpretable Architectures and Algorithms for Natural Language Processing
URI	http://hdl.handle.net/11250/3034519
hasFullText
inHoldings
isFullTextHit
isPrint
link	http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwfV09C8IwED38QBAXRcVPiOAarCEN7ShqEVEnBbfSpqcOWgf9_3gJFYugcyA5yJF37-7xAjBOiEZEWkg-RefEqf6fck-6yJWv0RAEFHYUs92p1UGuj-6xAO8P777sBagYcJ0JvbLGBKUIRWHEyhVt0z_N4UBQh9oiN79uQAHTJqw_wr34imyWa80_GJ3CZtfznTj45fZgVCKyXWS9LtgmaxWyTKxPINKCUbDcz1c8OzxM6YZCG1-Yxee0oUSUHTvAHEQXZYyE74nUvvK0p2JjbpdESgvf60Lv9z69f4t9qAojwLdNgAGUT5SkOCRcfMYvXjlowg
link.rule.ids	230,312,786,891,4071
openUrl	ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Adissertation&rft.genre=dissertation&rft.title=Interpretable+Architectures+and+Algorithms+for+Natural+Language+Processing&rft.DBID=3HK&rft.au=Yadav%2C+Rohan+Kumar&rft.date=2022&rft.pub=University+of+Agder&rft.externalDBID=n%2Fa&rft.externalDocID=11250_3034519