Issues in Translating from Natural Language to SQL in a Domain-Independent Natural Language Interface to Databases

This paper deals with a domain-independent natural language interface to databases (NLIDB) for the Spanish language. This NLIDB had been previously tested for the Northwind and Pubs domains and had attained good performance (86% success rate). However, domain independence complicates the task of ach...

Full description

Saved in:
Bibliographic Details
Published inMICAI 2006: Advances in Artificial Intelligence pp. 922 - 931
Main Authors Juan J. González, B., Rangel, Rodolfo A. Pazos, Cruz C., I. Cristina, Héctor J. Fraire, H., de Santos Aguilar, L., Joaquín Pérez, O.
Format Book Chapter
LanguageEnglish
Published Berlin, Heidelberg Springer Berlin Heidelberg
SeriesLecture Notes in Computer Science
Subjects
Online AccessGet full text

Cover

Loading…
Abstract This paper deals with a domain-independent natural language interface to databases (NLIDB) for the Spanish language. This NLIDB had been previously tested for the Northwind and Pubs domains and had attained good performance (86% success rate). However, domain independence complicates the task of achieving high translation success, and to this end the ATIS (Air Travel Information System) database, which has been used by several natural language interfaces, was selected to conduct a new evaluation. The purpose of this evaluation was to asses the efficiency of the interface after the reconfiguration for another domain and to detect the problems that affect translation success. For the tests a corpus of queries was gathered and the results obtained showed that the interface can easily be reconfigured and that attained a 50% success rate. When the found problems concerning query translation were analyzed, wording deficiencies of some user queries and several errors in the synonym dictionary were discovered. After correcting these problems a second test was conducted, in which the interface attained a 61.4% success rate. These experiments showed that user training is necessary as well as a dialogue system that permits to clarify a query when it is deficiently formulated.
AbstractList This paper deals with a domain-independent natural language interface to databases (NLIDB) for the Spanish language. This NLIDB had been previously tested for the Northwind and Pubs domains and had attained good performance (86% success rate). However, domain independence complicates the task of achieving high translation success, and to this end the ATIS (Air Travel Information System) database, which has been used by several natural language interfaces, was selected to conduct a new evaluation. The purpose of this evaluation was to asses the efficiency of the interface after the reconfiguration for another domain and to detect the problems that affect translation success. For the tests a corpus of queries was gathered and the results obtained showed that the interface can easily be reconfigured and that attained a 50% success rate. When the found problems concerning query translation were analyzed, wording deficiencies of some user queries and several errors in the synonym dictionary were discovered. After correcting these problems a second test was conducted, in which the interface attained a 61.4% success rate. These experiments showed that user training is necessary as well as a dialogue system that permits to clarify a query when it is deficiently formulated.
Author de Santos Aguilar, L.
Rangel, Rodolfo A. Pazos
Héctor J. Fraire, H.
Cruz C., I. Cristina
Juan J. González, B.
Joaquín Pérez, O.
Author_xml – sequence: 1
  givenname: B.
  surname: Juan J. González
  fullname: Juan J. González, B.
  email: jjgonzalezbarbosa@hotmail.com
  organization: Instituto Tecnológico de Cd., Madero, Mexico
– sequence: 2
  givenname: Rodolfo A. Pazos
  surname: Rangel
  fullname: Rangel, Rodolfo A. Pazos
  email: pazos@cenidet.edu.mx
  organization: Centro Nacional de Investigación y Desarrollo Tecnológico (CENIDET),  
– sequence: 3
  givenname: I. Cristina
  surname: Cruz C.
  fullname: Cruz C., I. Cristina
  email: ircriscc@hotmail.com
  organization: Instituto Tecnológico de Cd., Madero, Mexico
– sequence: 4
  givenname: H.
  surname: Héctor J. Fraire
  fullname: Héctor J. Fraire, H.
  email: hfraire@prodigy.net.mx
  organization: Instituto Tecnológico de Cd., Madero, Mexico
– sequence: 5
  givenname: L.
  surname: de Santos Aguilar
  fullname: de Santos Aguilar, L.
  email: Santosaguilar13@itcm.edu.mx
  organization: Instituto Tecnológico de Cd., Madero, Mexico
– sequence: 6
  givenname: O.
  surname: Joaquín Pérez
  fullname: Joaquín Pérez, O.
  email: jperez@cenidet.edu.mx
  organization: Centro Nacional de Investigación y Desarrollo Tecnológico (CENIDET),  
BookMark eNqVjz9PwzAUxA0UiRQ68QW8MgSe7fzzTKmIVCEhuluv4ESB9Lnyc74_LWKoxMRyN9zvdLq5mFEgL8StgnsFUD8oZXWpjXJNcyYWtm5MWUBhoWyqc5GpSqncmMJenGa6KmciAwM6t3VhrsSc-RMAdG11JmLLPHmWA8lNROIR00C97GLYyRdMU8RRrpH6CXsvU5Bvr-sji3IZdjhQ3tKH3_uDUPrLt5R87PD9p7nEhFtkzzfissOR_eLXr8Xd6mnz-JzzPh62fXTbEL7YKXDH0-7ktPkP-w0xr1jl
ContentType Book Chapter
Copyright Springer-Verlag Berlin Heidelberg 2006
Copyright_xml – notice: Springer-Verlag Berlin Heidelberg 2006
DOI 10.1007/11925231_88
DatabaseTitleList
DeliveryMethod fulltext_linktorsrc
Discipline Engineering
Computer Science
EISBN 9783540490586
3540490582
EISSN 1611-3349
Editor Reyes-Garcia, Carlos Alberto
Gelbukh, Alexander
Editor_xml – sequence: 1
  givenname: Alexander
  surname: Gelbukh
  fullname: Gelbukh, Alexander
  email: gelbukh@gelbukh.com
– sequence: 2
  givenname: Carlos Alberto
  surname: Reyes-Garcia
  fullname: Reyes-Garcia, Carlos Alberto
  email: kargaxxi@inaoep.mx
EndPage 931
GroupedDBID -DT
-GH
-~X
1SB
29L
2HA
2HV
5QI
875
AASHB
ABMNI
ACGFS
ADCXD
AEFIE
ALMA_UNASSIGNED_HOLDINGS
EJD
F5P
FEDTE
HVGLF
LAS
LDH
P2P
RIG
RNI
RSU
SVGTG
VI1
~02
ID FETCH-springer_books_10_1007_11925231_883
ISBN 9783540490265
3540490264
ISSN 0302-9743
IngestDate Tue Oct 01 19:58:35 EDT 2024
IsPeerReviewed false
IsScholarly false
Language English
LinkModel OpenURL
MergedId FETCHMERGED-springer_books_10_1007_11925231_883
ParticipantIDs springer_books_10_1007_11925231_88
PublicationPlace Berlin, Heidelberg
PublicationPlace_xml – name: Berlin, Heidelberg
PublicationSeriesSubtitle Lecture Notes in Artificial Intelligence
PublicationSeriesTitle Lecture Notes in Computer Science
PublicationSubtitle 5th Mexican International Conference on Artificial Intelligence, Apizaco, Mexico, November 13-17, 2006. Proceedings
PublicationTitle MICAI 2006: Advances in Artificial Intelligence
Publisher Springer Berlin Heidelberg
Publisher_xml – name: Springer Berlin Heidelberg
RelatedPersons Kleinberg, Jon M.
Mattern, Friedemann
Nierstrasz, Oscar
Tygar, Dough
Steffen, Bernhard
Kittler, Josef
Vardi, Moshe Y.
Weikum, Gerhard
Sudan, Madhu
Naor, Moni
Mitchell, John C.
Terzopoulos, Demetri
Pandu Rangan, C.
Kanade, Takeo
Hutchison, David
RelatedPersons_xml – sequence: 1
  givenname: David
  surname: Hutchison
  fullname: Hutchison, David
  organization: Lancaster University, UK
– sequence: 2
  givenname: Takeo
  surname: Kanade
  fullname: Kanade, Takeo
  organization: Carnegie Mellon University, Pittsburgh, USA
– sequence: 3
  givenname: Josef
  surname: Kittler
  fullname: Kittler, Josef
  organization: University of Surrey, Guildford, UK
– sequence: 4
  givenname: Jon M.
  surname: Kleinberg
  fullname: Kleinberg, Jon M.
  organization: Cornell University, Ithaca, USA
– sequence: 5
  givenname: Friedemann
  surname: Mattern
  fullname: Mattern, Friedemann
  organization: ETH Zurich, Switzerland
– sequence: 6
  givenname: John C.
  surname: Mitchell
  fullname: Mitchell, John C.
  organization: Stanford University, CA, USA
– sequence: 7
  givenname: Moni
  surname: Naor
  fullname: Naor, Moni
  organization: Weizmann Institute of Science, Rehovot, Israel
– sequence: 8
  givenname: Oscar
  surname: Nierstrasz
  fullname: Nierstrasz, Oscar
  organization: University of Bern, Switzerland
– sequence: 9
  givenname: C.
  surname: Pandu Rangan
  fullname: Pandu Rangan, C.
  organization: Indian Institute of Technology, Madras, India
– sequence: 10
  givenname: Bernhard
  surname: Steffen
  fullname: Steffen, Bernhard
  organization: University of Dortmund, Germany
– sequence: 11
  givenname: Madhu
  surname: Sudan
  fullname: Sudan, Madhu
  organization: Massachusetts Institute of Technology, MA, USA
– sequence: 12
  givenname: Demetri
  surname: Terzopoulos
  fullname: Terzopoulos, Demetri
  organization: University of California, Los Angeles, USA
– sequence: 13
  givenname: Dough
  surname: Tygar
  fullname: Tygar, Dough
  organization: University of California, Berkeley, USA
– sequence: 14
  givenname: Moshe Y.
  surname: Vardi
  fullname: Vardi, Moshe Y.
  organization: Rice University, Houston, USA
– sequence: 15
  givenname: Gerhard
  surname: Weikum
  fullname: Weikum, Gerhard
  organization: Max-Planck Institute of Computer Science, Saarbruecken, Germany
SSID ssj0002792
ssj0000318928
Score 2.5953512
Snippet This paper deals with a domain-independent natural language interface to databases (NLIDB) for the Spanish language. This NLIDB had been previously tested for...
SourceID springer
SourceType Publisher
StartPage 922
SubjectTerms Dialogue System
Fare Class
Natural Language Interface
Noun Location
Structure Query Language
Title Issues in Translating from Natural Language to SQL in a Domain-Independent Natural Language Interface to Databases
URI http://link.springer.com/10.1007/11925231_88
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
link http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwnV1LS8NAEF5qPenBNyoqi3iRkmBMmnQPHmp9tFIK4gNvZZNuQNAE2vTSf-k_cmZnk0Yjvi4hhE0y2a-dnee3jB2BhgM7NXKsyA-E5cXgsLZOlGOF0ovjUSsQUulqi4HfffBunppPtdpbqWppmoV2NPuyr-Q_qMI1wBW7ZP-AbPFQuADngC8cAWE4fjJ-P4ZZaYehXqfdQ0ZAXwf3KJevq1vbY10ARDQac8bNolhmii1LduM6TWY6T-68UBj53J5nfbDWlSqvR-lLnDbaNhibs7SwwDvj6azR0aqlZ-PmYKAqkkLFdyn_jgkBfBNYxyYE3rXLv1Ha-g9F1ksmluVhXSd2vAwkEYL0TTwVbeS72z6OlWD1v8rnxOoVW_hm1fE61BnLSN95ITOJ6zWJj9ioyVnfpE8GaUZC5Dtc5AqvEv1sfEMOZoJbngB3s1lSry6sBeBNkXpVpP59JHV0iUTVqHRBfdPGOhC0ZlUWHqo1ccBeBs_eGbZaC2whEKBmF9uXN_3HIuyHOlSczgntkb-REl0kjGk_0sISb2VJeNNfiq2fpTdVcvjaNLpfZcvYLsOxjwWmbo3VVLLOVvKp5GYq19lSiflyg40Jev6c8BL0HKHnBkqeQ8mzlAP0OFbyKvTV8QX0eGcB_SY7vrq873St_DOG-KeaDHNG7dK3ulusnqSJ2mZcIFObOzrxw8DzYqWEF0Yj6TSVG4ATpLwddvjz83Z_M2iP1bPxVO2D9ZmFBwbRd-tlgxU
link.rule.ids 785,786,790,799,27958
linkProvider Library Specific Holdings
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Abook&rft.genre=bookitem&rft.title=MICAI+2006%3A+Advances+in+Artificial+Intelligence&rft.au=Juan+J.+Gonz%C3%A1lez%2C+B.&rft.au=Rangel%2C+Rodolfo+A.+Pazos&rft.au=Cruz+C.%2C+I.+Cristina&rft.au=H%C3%A9ctor+J.+Fraire%2C+H.&rft.atitle=Issues+in+Translating+from+Natural+Language+to+SQL+in+a+Domain-Independent+Natural+Language+Interface+to+Databases&rft.series=Lecture+Notes+in+Computer+Science&rft.pub=Springer+Berlin+Heidelberg&rft.isbn=9783540490265&rft.issn=0302-9743&rft.eissn=1611-3349&rft.spage=922&rft.epage=931&rft_id=info:doi/10.1007%2F11925231_88
thumbnail_l http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=0302-9743&client=summon
thumbnail_m http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=0302-9743&client=summon
thumbnail_s http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=0302-9743&client=summon