MRCBert: A Machine Reading ComprehensionApproach for Unsupervised Summarization
When making an online purchase, it becomes important for the customer to read the product reviews carefully and make a decision based on that. However, reviews can be lengthy, may contain repeated, or sometimes irrelevant information that does not help in decision making. In this paper, we introduce...
Saved in:
Published in | arXiv.org |
---|---|
Main Authors | , , |
Format | Paper |
Language | English |
Published |
Ithaca
Cornell University Library, arXiv.org
01.05.2021
|
Subjects | |
Online Access | Get full text |
Cover
Loading…
Abstract | When making an online purchase, it becomes important for the customer to read the product reviews carefully and make a decision based on that. However, reviews can be lengthy, may contain repeated, or sometimes irrelevant information that does not help in decision making. In this paper, we introduce MRCBert, a novel unsupervised method to generate summaries from product reviews. We leverage Machine Reading Comprehension, i.e. MRC, approach to extract relevant opinions and generate both rating-wise and aspect-wise summaries from reviews. Through MRCBert we show that we can obtain reasonable performance using existing models and transfer learning, which can be useful for learning under limited or low resource scenarios. We demonstrated our results on reviews of a product from the Electronics category in the Amazon Reviews dataset. Our approach is unsupervised as it does not require any domain-specific dataset, such as the product review dataset, for training or fine-tuning. Instead, we have used SQuAD v1.1 dataset only to fine-tune BERT for the MRC task. Since MRCBert does not require a task-specific dataset, it can be easily adapted and used in other domains. |
---|---|
AbstractList | When making an online purchase, it becomes important for the customer to read the product reviews carefully and make a decision based on that. However, reviews can be lengthy, may contain repeated, or sometimes irrelevant information that does not help in decision making. In this paper, we introduce MRCBert, a novel unsupervised method to generate summaries from product reviews. We leverage Machine Reading Comprehension, i.e. MRC, approach to extract relevant opinions and generate both rating-wise and aspect-wise summaries from reviews. Through MRCBert we show that we can obtain reasonable performance using existing models and transfer learning, which can be useful for learning under limited or low resource scenarios. We demonstrated our results on reviews of a product from the Electronics category in the Amazon Reviews dataset. Our approach is unsupervised as it does not require any domain-specific dataset, such as the product review dataset, for training or fine-tuning. Instead, we have used SQuAD v1.1 dataset only to fine-tune BERT for the MRC task. Since MRCBert does not require a task-specific dataset, it can be easily adapted and used in other domains. |
Author | Lim, Sze Chi Jain, Saurabh Tang, Guokai |
Author_xml | – sequence: 1 givenname: Saurabh surname: Jain fullname: Jain, Saurabh – sequence: 2 givenname: Guokai surname: Tang fullname: Tang, Guokai – sequence: 3 givenname: Sze surname: Lim middlename: Chi fullname: Lim, Sze Chi |
BookMark | eNqNy0ELgjAYxvERBVn5HQadhbU1lW4mRRcJrM4y8jUnua1NO_Tp26EP0Ok5_H_PAk2VVjBBAWVsE6VbSucodK4jhNA4oZyzAJ2LMt-DHXY4w4W4t1IBLkHUUj1wrntjoQXlpFaZMVZ7gBtt8U250YB9Swc1vox9L6z8iMGzFZo14ukg_O0SrY-Ha36K_Ps1ghuqTo9W-VRRTjcpieOEs__UF1dfQG4 |
ContentType | Paper |
Copyright | 2021. This work is published under http://creativecommons.org/licenses/by/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License. |
Copyright_xml | – notice: 2021. This work is published under http://creativecommons.org/licenses/by/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License. |
DBID | 8FE 8FG ABJCF ABUWG AFKRA AZQEC BENPR BGLVJ CCPQU DWQXO HCIFZ L6V M7S PIMPY PQEST PQQKQ PQUKI PRINS PTHSS |
DatabaseName | ProQuest SciTech Collection ProQuest Technology Collection Materials Science & Engineering Collection ProQuest Central (Alumni Edition) ProQuest Central ProQuest Central Essentials ProQuest Central Technology Collection ProQuest One Community College ProQuest Central Korea SciTech Premium Collection (Proquest) (PQ_SDU_P3) ProQuest Engineering Collection Engineering Database Publicly Available Content Database ProQuest One Academic Eastern Edition (DO NOT USE) ProQuest One Academic ProQuest One Academic UKI Edition ProQuest Central China Engineering Collection |
DatabaseTitle | Publicly Available Content Database Engineering Database Technology Collection ProQuest Central Essentials ProQuest One Academic Eastern Edition ProQuest Central (Alumni Edition) SciTech Premium Collection ProQuest One Community College ProQuest Technology Collection ProQuest SciTech Collection ProQuest Central China ProQuest Central ProQuest Engineering Collection ProQuest One Academic UKI Edition ProQuest Central Korea Materials Science & Engineering Collection ProQuest One Academic Engineering Collection |
DatabaseTitleList | Publicly Available Content Database |
Database_xml | – sequence: 1 dbid: 8FG name: ProQuest Technology Collection url: https://search.proquest.com/technologycollection1 sourceTypes: Aggregation Database |
DeliveryMethod | fulltext_linktorsrc |
Discipline | Physics |
EISSN | 2331-8422 |
Genre | Working Paper/Pre-Print |
GroupedDBID | 8FE 8FG ABJCF ABUWG AFKRA ALMA_UNASSIGNED_HOLDINGS AZQEC BENPR BGLVJ CCPQU DWQXO FRJ HCIFZ L6V M7S M~E PIMPY PQEST PQQKQ PQUKI PRINS PTHSS |
ID | FETCH-proquest_journals_25218066753 |
IEDL.DBID | BENPR |
IngestDate | Thu Oct 10 17:28:55 EDT 2024 |
IsOpenAccess | true |
IsPeerReviewed | false |
IsScholarly | false |
Language | English |
LinkModel | DirectLink |
MergedId | FETCHMERGED-proquest_journals_25218066753 |
OpenAccessLink | https://www.proquest.com/docview/2521806675?pq-origsite=%requestingapplication% |
PQID | 2521806675 |
PQPubID | 2050157 |
ParticipantIDs | proquest_journals_2521806675 |
PublicationCentury | 2000 |
PublicationDate | 20210501 |
PublicationDateYYYYMMDD | 2021-05-01 |
PublicationDate_xml | – month: 05 year: 2021 text: 20210501 day: 01 |
PublicationDecade | 2020 |
PublicationPlace | Ithaca |
PublicationPlace_xml | – name: Ithaca |
PublicationTitle | arXiv.org |
PublicationYear | 2021 |
Publisher | Cornell University Library, arXiv.org |
Publisher_xml | – name: Cornell University Library, arXiv.org |
SSID | ssj0002672553 |
Score | 3.323971 |
SecondaryResourceType | preprint |
Snippet | When making an online purchase, it becomes important for the customer to read the product reviews carefully and make a decision based on that. However, reviews... |
SourceID | proquest |
SourceType | Aggregation Database |
SubjectTerms | Datasets Decision making Learning Product reviews Summaries |
Title | MRCBert: A Machine Reading ComprehensionApproach for Unsupervised Summarization |
URI | https://www.proquest.com/docview/2521806675 |
hasFullText | 1 |
inHoldings | 1 |
isFullTextHit | |
isPrint | |
link | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwfZ3La8MgGMA_1oTBbnuyR1eE7SqL5qHZZbQlWRmkK90KvZWkGnZK0yS97m-fit0Ogx5FEBX9Xv4-P4BHQUQei5LhOCAFDgThuPBiiknBPBEWSiNwnSicTaPJInhbhksbcGstVrmXiUZQi81ax8ifqNIzXBOZ4Uu9xbpqlH5dtSU0euBS5SlQB9xRMp3Nf6MsNGLKZvb_CVqjPdJTcGd5LZszOJLVORwb6HLdXsB7Nh-PZNM9oyHKDNQokYXakb6njfzSePmmGtqPv5GyMNGiane1vuGtFOjD5J7ZXMpLeEiTz_EE72exsielXf2ty78CR7n88hpQHPGIi5KWkvGgJH4eclnELKfSF4R55Q30D410e7j7Dk6oJjMMttcHp2t28l6p1q4YQI-nrwO7i6qVfSc_soqEjg |
link.rule.ids | 783,787,12779,21402,33387,33758,43614,43819 |
linkProvider | ProQuest |
linkToHtml | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwfZ3PT8MgFMdfdIvRmz_jj6kkeiUW-gPqxczFWnWdRrdkt6UMmp262nb_v0CYHkx2JiFA4H0fj8_jAdxKIvNYFgzHARE4kIRj4cUUE8E8GQqtCNwkCmejKJ0Er9Nw6gJujcMq1zbRGmq5nJsY-R3VOsMNkRk-VN_YVI0yr6uuhMY2dANfC43JFE-ef2MsNGLaY_b_mVmrHck-dD_yStUHsKXKQ9ixyOW8OYL37HPwqOr2HvVRZpFGhRzSjswprdXCwOXLsu--_Ubav0STsllV5nw3SqIvm3nmMimP4SZ5Gg9SvB7FzO2TZvY3K_8EOvrCr04BxRGPuCxooRgPCuLnIVciZjlVviTMK86gt6mn883N17CbjrPhbPgyeruAPWoYDQvw9aDT1it1qUW2FVd2JX8AmG2EAg |
openUrl | ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=MRCBert%3A+A+Machine+Reading+ComprehensionApproach+for+Unsupervised+Summarization&rft.jtitle=arXiv.org&rft.au=Jain%2C+Saurabh&rft.au=Tang%2C+Guokai&rft.au=Lim%2C+Sze+Chi&rft.date=2021-05-01&rft.pub=Cornell+University+Library%2C+arXiv.org&rft.eissn=2331-8422 |