Data-Driven Methods for SMS-Based FAQ Retrieval
SMS text messaging is one of the most popular data applications on mobile phones these days. Other than personal communication, text messaging can also be used for various purposes like bill payment, banking, inquiry, etc. However these messages are extremely noisy and contain misspellings, abbrevia...
Saved in:
Published in | Multilingual Information Access in South Asian Languages pp. 104 - 118 |
---|---|
Main Authors | , , |
Format | Book Chapter |
Language | English |
Published |
Berlin, Heidelberg
Springer Berlin Heidelberg
2013
|
Series | Lecture Notes in Computer Science |
Subjects | |
Online Access | Get full text |
Cover
Loading…
Abstract | SMS text messaging is one of the most popular data applications on mobile phones these days. Other than personal communication, text messaging can also be used for various purposes like bill payment, banking, inquiry, etc. However these messages are extremely noisy and contain misspellings, abbreviations, transliterations, etc. Keeping this in mind, FIRE 2011 introduced a new retrieval task called SMS-based FAQ retrieval in English, Hindi and Malayalam. Within-language and cross-language tasks were designed for this retrieval problem. As solutions we propose various data-driven retrieval techniques that includes noise reduction in the SMS queries and the FAQ corpora. Overall, we find that our methods work well for the retrieval experiments in the different languages. For English, the use of Google Spelling Suggestions and term expansion strategies improve retrieval scores. For Hindi and Malayalam retrieval experiments, we find that translation of queries and corpus to English increases retrieval accuracy. |
---|---|
AbstractList | SMS text messaging is one of the most popular data applications on mobile phones these days. Other than personal communication, text messaging can also be used for various purposes like bill payment, banking, inquiry, etc. However these messages are extremely noisy and contain misspellings, abbreviations, transliterations, etc. Keeping this in mind, FIRE 2011 introduced a new retrieval task called SMS-based FAQ retrieval in English, Hindi and Malayalam. Within-language and cross-language tasks were designed for this retrieval problem. As solutions we propose various data-driven retrieval techniques that includes noise reduction in the SMS queries and the FAQ corpora. Overall, we find that our methods work well for the retrieval experiments in the different languages. For English, the use of Google Spelling Suggestions and term expansion strategies improve retrieval scores. For Hindi and Malayalam retrieval experiments, we find that translation of queries and corpus to English increases retrieval accuracy. |
Author | Bhattacharya, Sanmitra Tran, Hung Srinivasan, Padmini |
Author_xml | – sequence: 1 givenname: Sanmitra surname: Bhattacharya fullname: Bhattacharya, Sanmitra email: sanmitra-bhattacharya@uiowa.edu organization: Department of Computer Science, The University of Iowa, Iowa City, USA – sequence: 2 givenname: Hung surname: Tran fullname: Tran, Hung email: hung-trv@uiowa.edu organization: Department of Computer Science, The University of Iowa, Iowa City, USA – sequence: 3 givenname: Padmini surname: Srinivasan fullname: Srinivasan, Padmini email: padmini-srinivasan@uiowa.edu organization: Department of Computer Science, The University of Iowa, Iowa City, USA |
BookMark | eNpVkMtOwzAQRQ0UibT0D1jkB0xnPE5sL0sfgNQKQbu38hhDoUpQXPX7SQsbVlc6V7rSuUMxaNqGhbhDuEcAM3HGSpK5VlIDWCOVR7wQ4x5TD89MXYoEc0RJpN3Vvy7PBiIBAiWd0XQjhjF-AoAyTiViMi8OhZx3uyM36ZoPH20d09B26Wa9kQ9F5DpdTl_TNz50Oz4W-1txHYp95PFfjsR2udjOnuTq5fF5Nl3JiEgoK60ot2yDYjKgGLMaoIJADGUNVGcuC-Cs5Qpz5uCgKutgcqUJ0QZNI6F-Z-N3t2veufNl235Fj-BPj_hez5PvBf3Z3p8eoR8CW04e |
ContentType | Book Chapter |
Copyright | Springer-Verlag Berlin Heidelberg 2013 |
Copyright_xml | – notice: Springer-Verlag Berlin Heidelberg 2013 |
DOI | 10.1007/978-3-642-40087-2_11 |
DatabaseTitleList | |
DeliveryMethod | fulltext_linktorsrc |
Discipline | Computer Science |
EISBN | 9783642400872 3642400876 |
EISSN | 1611-3349 |
Editor | Subramaniam, L. Venkata Bhattacharyya, Pushpak Contractor, Danish Rosso, Paolo Majumder, Prasenjit Mitra, Mandar |
Editor_xml | – sequence: 1 givenname: Prasenjit surname: Majumder fullname: Majumder, Prasenjit email: prasenjit.majumder@gmail.com – sequence: 2 givenname: Mandar surname: Mitra fullname: Mitra, Mandar email: mandar.mitra@gmail.com – sequence: 3 givenname: Pushpak surname: Bhattacharyya fullname: Bhattacharyya, Pushpak email: pushbakbh@gmail.com – sequence: 4 givenname: L. Venkata surname: Subramaniam fullname: Subramaniam, L. Venkata email: lvsubram@in.ibm.com – sequence: 5 givenname: Danish surname: Contractor fullname: Contractor, Danish email: dcontrac@in.ibm.com – sequence: 6 givenname: Paolo surname: Rosso fullname: Rosso, Paolo email: prosso@dsic.upv.es |
EndPage | 118 |
GroupedDBID | -DT -GH -~X 1SB 29L 2HA 2HV 5QI 875 AASHB ABMNI ACGFS ADCXD AEFIE ALMA_UNASSIGNED_HOLDINGS EJD F5P FEDTE HVGLF LAS LDH P2P RIG RNI RSU SVGTG VI1 ~02 |
ID | FETCH-LOGICAL-s1131-c42368e8f2e3702e15d00c0f3e0bd03d595f0988ec16eef90cbdf76243118f43 |
ISBN | 9783642400865 3642400868 |
ISSN | 0302-9743 |
IngestDate | Wed Nov 06 06:18:50 EST 2024 |
IsPeerReviewed | true |
IsScholarly | true |
Language | English |
LinkModel | OpenURL |
MergedId | FETCHMERGED-LOGICAL-s1131-c42368e8f2e3702e15d00c0f3e0bd03d595f0988ec16eef90cbdf76243118f43 |
PageCount | 15 |
ParticipantIDs | springer_books_10_1007_978_3_642_40087_2_11 |
PublicationCentury | 2000 |
PublicationDate | 2013 |
PublicationDateYYYYMMDD | 2013-01-01 |
PublicationDate_xml | – year: 2013 text: 2013 |
PublicationDecade | 2010 |
PublicationPlace | Berlin, Heidelberg |
PublicationPlace_xml | – name: Berlin, Heidelberg |
PublicationSeriesTitle | Lecture Notes in Computer Science |
PublicationSubtitle | Second International Workshop, FIRE 2010, Gandhinagar, India, February 19-21, 2010 and Third International Workshop, FIRE 2011, Bombay, India, December 2-4, 2011, Revised Selected Papers |
PublicationTitle | Multilingual Information Access in South Asian Languages |
PublicationYear | 2013 |
Publisher | Springer Berlin Heidelberg |
Publisher_xml | – name: Springer Berlin Heidelberg |
RelatedPersons | Kleinberg, Jon M. Mattern, Friedemann Nierstrasz, Oscar Steffen, Bernhard Kittler, Josef Vardi, Moshe Y. Weikum, Gerhard Sudan, Madhu Naor, Moni Mitchell, John C. Terzopoulos, Demetri Pandu Rangan, C. Kanade, Takeo Hutchison, David Tygar, Doug |
RelatedPersons_xml | – sequence: 1 givenname: David surname: Hutchison fullname: Hutchison, David organization: Lancaster University, Lancaster, UK – sequence: 2 givenname: Takeo surname: Kanade fullname: Kanade, Takeo organization: Carnegie Mellon University, Pittsburgh, USA – sequence: 3 givenname: Josef surname: Kittler fullname: Kittler, Josef organization: University of Surrey, Guildford, UK – sequence: 4 givenname: Jon M. surname: Kleinberg fullname: Kleinberg, Jon M. organization: Cornell University, Ithaca, USA – sequence: 5 givenname: Friedemann surname: Mattern fullname: Mattern, Friedemann organization: ETH Zurich, Zurich, Switzerland – sequence: 6 givenname: John C. surname: Mitchell fullname: Mitchell, John C. organization: Stanford University, Stanford, USA – sequence: 7 givenname: Moni surname: Naor fullname: Naor, Moni organization: Weizmann Institute of Science, Rehovot, Israel – sequence: 8 givenname: Oscar surname: Nierstrasz fullname: Nierstrasz, Oscar organization: University of Bern, Bern, Switzerland – sequence: 9 givenname: C. surname: Pandu Rangan fullname: Pandu Rangan, C. organization: Indian Institute of Technology, Madras, India – sequence: 10 givenname: Bernhard surname: Steffen fullname: Steffen, Bernhard organization: University of Dortmund, Dortmund, Germany – sequence: 11 givenname: Madhu surname: Sudan fullname: Sudan, Madhu organization: Massachusetts Institute of Technology, USA – sequence: 12 givenname: Demetri surname: Terzopoulos fullname: Terzopoulos, Demetri organization: University of California, Los Angeles, USA – sequence: 13 givenname: Doug surname: Tygar fullname: Tygar, Doug organization: University of California, Berkeley, USA – sequence: 14 givenname: Moshe Y. surname: Vardi fullname: Vardi, Moshe Y. organization: Rice University, Houston, USA – sequence: 15 givenname: Gerhard surname: Weikum fullname: Weikum, Gerhard organization: Max-Planck Institute of Computer Science, Saarbrücken, Germany |
SSID | ssj0002792 ssj0000988219 |
Score | 1.7299454 |
Snippet | SMS text messaging is one of the most popular data applications on mobile phones these days. Other than personal communication, text messaging can also be used... |
SourceID | springer |
SourceType | Publisher |
StartPage | 104 |
SubjectTerms | Relevance Judgment Retrieval Experiment Retrieval Problem Retrieval Strategy Text Messaging |
Title | Data-Driven Methods for SMS-Based FAQ Retrieval |
URI | http://link.springer.com/10.1007/978-3-642-40087-2_11 |
hasFullText | 1 |
inHoldings | 1 |
isFullTextHit | |
isPrint | |
link | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwnV1Bb9MwFLa6ckE7DBgIGKAcuEUGO07T9MBh0FVV1VZMK2i3yE5s0QOp1KaTtl_Pe7aTpqxCKpeoSqsmeZ_93suzv-8R8hGislRpkVIZGUPjXDCq4sRQpVAMyqT5gCE5eTZPxj_iyW3vttNp71raVupT_nCQV_I_qMI5wBVZskcg2_wpnIDPgC8cAWE4_pX87pdZXYch3AqIZPKtlctoWIjYAALd1xImLvbHAwRwEk99YXJXFv8lq0oi6-peutpw-XtZrRs3jVHMBqatD25Yh8H1nju5cd98l1Z5tz3qhrKSdLhGFxrObHNqq_cQ3sxu6FcImEU4urwGTLGN150fwmgqvfky9asZ81VlN4mFdcOJ2v-0CxTYLGKvQFEXKMN_6HdZLkmM21lT1zmipnSBu4YXHucBtfPQCeouCqdz6r2u72DsAzh3Dv1RbGhvB4GLUbxan0YZMsNPwE11yZPLq8n0Z1OiYwN4_UC1Ox_YUWvRLUq5u0KqUH3XqRNz2j1Fi6Z56JKPFt5tPrN4Rk6R4xIg-QQM_Jx0dPmCnNUGD7zBz8nnFp6BxzMAPIMGzwDwDBo8X5LF6GrxbUx9iw264VxwmkM2naQ6NZEWfRZp3isYy5kRmqmCiaI36Bk0g855orWBiasKA_ET0k6emli8It1yVerXJNA8kjqJuBEqj_so5SVzYwYGEmTDdazekLB-4AznzCarBbPBPJnIwDyZNU-G5nl71K8vyNPdwHtHutV6q99DrlipDx7TP9Z_Xi8 |
link.rule.ids | 779,780,784,793,27925 |
linkProvider | Library Specific Holdings |
openUrl | ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Abook&rft.genre=bookitem&rft.title=Multilingual+Information+Access+in+South+Asian+Languages&rft.au=Bhattacharya%2C+Sanmitra&rft.au=Tran%2C+Hung&rft.au=Srinivasan%2C+Padmini&rft.atitle=Data-Driven+Methods+for+SMS-Based+FAQ+Retrieval&rft.series=Lecture+Notes+in+Computer+Science&rft.date=2013-01-01&rft.pub=Springer+Berlin+Heidelberg&rft.isbn=9783642400865&rft.issn=0302-9743&rft.eissn=1611-3349&rft.spage=104&rft.epage=118&rft_id=info:doi/10.1007%2F978-3-642-40087-2_11 |
thumbnail_l | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=0302-9743&client=summon |
thumbnail_m | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=0302-9743&client=summon |
thumbnail_s | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=0302-9743&client=summon |