Data-Driven Methods for SMS-Based FAQ Retrieval

SMS text messaging is one of the most popular data applications on mobile phones these days. Other than personal communication, text messaging can also be used for various purposes like bill payment, banking, inquiry, etc. However these messages are extremely noisy and contain misspellings, abbrevia...

Full description

Saved in:
Bibliographic Details
Published inMultilingual Information Access in South Asian Languages pp. 104 - 118
Main Authors Bhattacharya, Sanmitra, Tran, Hung, Srinivasan, Padmini
Format Book Chapter
LanguageEnglish
Published Berlin, Heidelberg Springer Berlin Heidelberg 2013
SeriesLecture Notes in Computer Science
Subjects
Online AccessGet full text

Cover

Loading…
Abstract SMS text messaging is one of the most popular data applications on mobile phones these days. Other than personal communication, text messaging can also be used for various purposes like bill payment, banking, inquiry, etc. However these messages are extremely noisy and contain misspellings, abbreviations, transliterations, etc. Keeping this in mind, FIRE 2011 introduced a new retrieval task called SMS-based FAQ retrieval in English, Hindi and Malayalam. Within-language and cross-language tasks were designed for this retrieval problem. As solutions we propose various data-driven retrieval techniques that includes noise reduction in the SMS queries and the FAQ corpora. Overall, we find that our methods work well for the retrieval experiments in the different languages. For English, the use of Google Spelling Suggestions and term expansion strategies improve retrieval scores. For Hindi and Malayalam retrieval experiments, we find that translation of queries and corpus to English increases retrieval accuracy.
AbstractList SMS text messaging is one of the most popular data applications on mobile phones these days. Other than personal communication, text messaging can also be used for various purposes like bill payment, banking, inquiry, etc. However these messages are extremely noisy and contain misspellings, abbreviations, transliterations, etc. Keeping this in mind, FIRE 2011 introduced a new retrieval task called SMS-based FAQ retrieval in English, Hindi and Malayalam. Within-language and cross-language tasks were designed for this retrieval problem. As solutions we propose various data-driven retrieval techniques that includes noise reduction in the SMS queries and the FAQ corpora. Overall, we find that our methods work well for the retrieval experiments in the different languages. For English, the use of Google Spelling Suggestions and term expansion strategies improve retrieval scores. For Hindi and Malayalam retrieval experiments, we find that translation of queries and corpus to English increases retrieval accuracy.
Author Bhattacharya, Sanmitra
Tran, Hung
Srinivasan, Padmini
Author_xml – sequence: 1
  givenname: Sanmitra
  surname: Bhattacharya
  fullname: Bhattacharya, Sanmitra
  email: sanmitra-bhattacharya@uiowa.edu
  organization: Department of Computer Science, The University of Iowa, Iowa City, USA
– sequence: 2
  givenname: Hung
  surname: Tran
  fullname: Tran, Hung
  email: hung-trv@uiowa.edu
  organization: Department of Computer Science, The University of Iowa, Iowa City, USA
– sequence: 3
  givenname: Padmini
  surname: Srinivasan
  fullname: Srinivasan, Padmini
  email: padmini-srinivasan@uiowa.edu
  organization: Department of Computer Science, The University of Iowa, Iowa City, USA
BookMark eNpVkMtOwzAQRQ0UibT0D1jkB0xnPE5sL0sfgNQKQbu38hhDoUpQXPX7SQsbVlc6V7rSuUMxaNqGhbhDuEcAM3HGSpK5VlIDWCOVR7wQ4x5TD89MXYoEc0RJpN3Vvy7PBiIBAiWd0XQjhjF-AoAyTiViMi8OhZx3uyM36ZoPH20d09B26Wa9kQ9F5DpdTl_TNz50Oz4W-1txHYp95PFfjsR2udjOnuTq5fF5Nl3JiEgoK60ot2yDYjKgGLMaoIJADGUNVGcuC-Cs5Qpz5uCgKutgcqUJ0QZNI6F-Z-N3t2veufNl235Fj-BPj_hez5PvBf3Z3p8eoR8CW04e
ContentType Book Chapter
Copyright Springer-Verlag Berlin Heidelberg 2013
Copyright_xml – notice: Springer-Verlag Berlin Heidelberg 2013
DOI 10.1007/978-3-642-40087-2_11
DatabaseTitleList
DeliveryMethod fulltext_linktorsrc
Discipline Computer Science
EISBN 9783642400872
3642400876
EISSN 1611-3349
Editor Subramaniam, L. Venkata
Bhattacharyya, Pushpak
Contractor, Danish
Rosso, Paolo
Majumder, Prasenjit
Mitra, Mandar
Editor_xml – sequence: 1
  givenname: Prasenjit
  surname: Majumder
  fullname: Majumder, Prasenjit
  email: prasenjit.majumder@gmail.com
– sequence: 2
  givenname: Mandar
  surname: Mitra
  fullname: Mitra, Mandar
  email: mandar.mitra@gmail.com
– sequence: 3
  givenname: Pushpak
  surname: Bhattacharyya
  fullname: Bhattacharyya, Pushpak
  email: pushbakbh@gmail.com
– sequence: 4
  givenname: L. Venkata
  surname: Subramaniam
  fullname: Subramaniam, L. Venkata
  email: lvsubram@in.ibm.com
– sequence: 5
  givenname: Danish
  surname: Contractor
  fullname: Contractor, Danish
  email: dcontrac@in.ibm.com
– sequence: 6
  givenname: Paolo
  surname: Rosso
  fullname: Rosso, Paolo
  email: prosso@dsic.upv.es
EndPage 118
GroupedDBID -DT
-GH
-~X
1SB
29L
2HA
2HV
5QI
875
AASHB
ABMNI
ACGFS
ADCXD
AEFIE
ALMA_UNASSIGNED_HOLDINGS
EJD
F5P
FEDTE
HVGLF
LAS
LDH
P2P
RIG
RNI
RSU
SVGTG
VI1
~02
ID FETCH-LOGICAL-s1131-c42368e8f2e3702e15d00c0f3e0bd03d595f0988ec16eef90cbdf76243118f43
ISBN 9783642400865
3642400868
ISSN 0302-9743
IngestDate Wed Nov 06 06:18:50 EST 2024
IsPeerReviewed true
IsScholarly true
Language English
LinkModel OpenURL
MergedId FETCHMERGED-LOGICAL-s1131-c42368e8f2e3702e15d00c0f3e0bd03d595f0988ec16eef90cbdf76243118f43
PageCount 15
ParticipantIDs springer_books_10_1007_978_3_642_40087_2_11
PublicationCentury 2000
PublicationDate 2013
PublicationDateYYYYMMDD 2013-01-01
PublicationDate_xml – year: 2013
  text: 2013
PublicationDecade 2010
PublicationPlace Berlin, Heidelberg
PublicationPlace_xml – name: Berlin, Heidelberg
PublicationSeriesTitle Lecture Notes in Computer Science
PublicationSubtitle Second International Workshop, FIRE 2010, Gandhinagar, India, February 19-21, 2010 and Third International Workshop, FIRE 2011, Bombay, India, December 2-4, 2011, Revised Selected Papers
PublicationTitle Multilingual Information Access in South Asian Languages
PublicationYear 2013
Publisher Springer Berlin Heidelberg
Publisher_xml – name: Springer Berlin Heidelberg
RelatedPersons Kleinberg, Jon M.
Mattern, Friedemann
Nierstrasz, Oscar
Steffen, Bernhard
Kittler, Josef
Vardi, Moshe Y.
Weikum, Gerhard
Sudan, Madhu
Naor, Moni
Mitchell, John C.
Terzopoulos, Demetri
Pandu Rangan, C.
Kanade, Takeo
Hutchison, David
Tygar, Doug
RelatedPersons_xml – sequence: 1
  givenname: David
  surname: Hutchison
  fullname: Hutchison, David
  organization: Lancaster University, Lancaster, UK
– sequence: 2
  givenname: Takeo
  surname: Kanade
  fullname: Kanade, Takeo
  organization: Carnegie Mellon University, Pittsburgh, USA
– sequence: 3
  givenname: Josef
  surname: Kittler
  fullname: Kittler, Josef
  organization: University of Surrey, Guildford, UK
– sequence: 4
  givenname: Jon M.
  surname: Kleinberg
  fullname: Kleinberg, Jon M.
  organization: Cornell University, Ithaca, USA
– sequence: 5
  givenname: Friedemann
  surname: Mattern
  fullname: Mattern, Friedemann
  organization: ETH Zurich, Zurich, Switzerland
– sequence: 6
  givenname: John C.
  surname: Mitchell
  fullname: Mitchell, John C.
  organization: Stanford University, Stanford, USA
– sequence: 7
  givenname: Moni
  surname: Naor
  fullname: Naor, Moni
  organization: Weizmann Institute of Science, Rehovot, Israel
– sequence: 8
  givenname: Oscar
  surname: Nierstrasz
  fullname: Nierstrasz, Oscar
  organization: University of Bern, Bern, Switzerland
– sequence: 9
  givenname: C.
  surname: Pandu Rangan
  fullname: Pandu Rangan, C.
  organization: Indian Institute of Technology, Madras, India
– sequence: 10
  givenname: Bernhard
  surname: Steffen
  fullname: Steffen, Bernhard
  organization: University of Dortmund, Dortmund, Germany
– sequence: 11
  givenname: Madhu
  surname: Sudan
  fullname: Sudan, Madhu
  organization: Massachusetts Institute of Technology, USA
– sequence: 12
  givenname: Demetri
  surname: Terzopoulos
  fullname: Terzopoulos, Demetri
  organization: University of California, Los Angeles, USA
– sequence: 13
  givenname: Doug
  surname: Tygar
  fullname: Tygar, Doug
  organization: University of California, Berkeley, USA
– sequence: 14
  givenname: Moshe Y.
  surname: Vardi
  fullname: Vardi, Moshe Y.
  organization: Rice University, Houston, USA
– sequence: 15
  givenname: Gerhard
  surname: Weikum
  fullname: Weikum, Gerhard
  organization: Max-Planck Institute of Computer Science, Saarbrücken, Germany
SSID ssj0002792
ssj0000988219
Score 1.7299454
Snippet SMS text messaging is one of the most popular data applications on mobile phones these days. Other than personal communication, text messaging can also be used...
SourceID springer
SourceType Publisher
StartPage 104
SubjectTerms Relevance Judgment
Retrieval Experiment
Retrieval Problem
Retrieval Strategy
Text Messaging
Title Data-Driven Methods for SMS-Based FAQ Retrieval
URI http://link.springer.com/10.1007/978-3-642-40087-2_11
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
link http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwnV1Bb9MwFLa6ckE7DBgIGKAcuEUGO07T9MBh0FVV1VZMK2i3yE5s0QOp1KaTtl_Pe7aTpqxCKpeoSqsmeZ_93suzv-8R8hGislRpkVIZGUPjXDCq4sRQpVAMyqT5gCE5eTZPxj_iyW3vttNp71raVupT_nCQV_I_qMI5wBVZskcg2_wpnIDPgC8cAWE4_pX87pdZXYch3AqIZPKtlctoWIjYAALd1xImLvbHAwRwEk99YXJXFv8lq0oi6-peutpw-XtZrRs3jVHMBqatD25Yh8H1nju5cd98l1Z5tz3qhrKSdLhGFxrObHNqq_cQ3sxu6FcImEU4urwGTLGN150fwmgqvfky9asZ81VlN4mFdcOJ2v-0CxTYLGKvQFEXKMN_6HdZLkmM21lT1zmipnSBu4YXHucBtfPQCeouCqdz6r2u72DsAzh3Dv1RbGhvB4GLUbxan0YZMsNPwE11yZPLq8n0Z1OiYwN4_UC1Ox_YUWvRLUq5u0KqUH3XqRNz2j1Fi6Z56JKPFt5tPrN4Rk6R4xIg-QQM_Jx0dPmCnNUGD7zBz8nnFp6BxzMAPIMGzwDwDBo8X5LF6GrxbUx9iw264VxwmkM2naQ6NZEWfRZp3isYy5kRmqmCiaI36Bk0g855orWBiasKA_ET0k6emli8It1yVerXJNA8kjqJuBEqj_so5SVzYwYGEmTDdazekLB-4AznzCarBbPBPJnIwDyZNU-G5nl71K8vyNPdwHtHutV6q99DrlipDx7TP9Z_Xi8
link.rule.ids 779,780,784,793,27925
linkProvider Library Specific Holdings
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Abook&rft.genre=bookitem&rft.title=Multilingual+Information+Access+in+South+Asian+Languages&rft.au=Bhattacharya%2C+Sanmitra&rft.au=Tran%2C+Hung&rft.au=Srinivasan%2C+Padmini&rft.atitle=Data-Driven+Methods+for+SMS-Based+FAQ+Retrieval&rft.series=Lecture+Notes+in+Computer+Science&rft.date=2013-01-01&rft.pub=Springer+Berlin+Heidelberg&rft.isbn=9783642400865&rft.issn=0302-9743&rft.eissn=1611-3349&rft.spage=104&rft.epage=118&rft_id=info:doi/10.1007%2F978-3-642-40087-2_11
thumbnail_l http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=0302-9743&client=summon
thumbnail_m http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=0302-9743&client=summon
thumbnail_s http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=0302-9743&client=summon