Issues in Translating from Natural Language to SQL in a Domain-Independent Natural Language Interface to Databases
This paper deals with a domain-independent natural language interface to databases (NLIDB) for the Spanish language. This NLIDB had been previously tested for the Northwind and Pubs domains and had attained good performance (86% success rate). However, domain independence complicates the task of ach...
Saved in:
Published in | MICAI 2006: Advances in Artificial Intelligence pp. 922 - 931 |
---|---|
Main Authors | , , , , , |
Format | Book Chapter |
Language | English |
Published |
Berlin, Heidelberg
Springer Berlin Heidelberg
|
Series | Lecture Notes in Computer Science |
Subjects | |
Online Access | Get full text |
Cover
Loading…
Abstract | This paper deals with a domain-independent natural language interface to databases (NLIDB) for the Spanish language. This NLIDB had been previously tested for the Northwind and Pubs domains and had attained good performance (86% success rate). However, domain independence complicates the task of achieving high translation success, and to this end the ATIS (Air Travel Information System) database, which has been used by several natural language interfaces, was selected to conduct a new evaluation. The purpose of this evaluation was to asses the efficiency of the interface after the reconfiguration for another domain and to detect the problems that affect translation success. For the tests a corpus of queries was gathered and the results obtained showed that the interface can easily be reconfigured and that attained a 50% success rate. When the found problems concerning query translation were analyzed, wording deficiencies of some user queries and several errors in the synonym dictionary were discovered. After correcting these problems a second test was conducted, in which the interface attained a 61.4% success rate. These experiments showed that user training is necessary as well as a dialogue system that permits to clarify a query when it is deficiently formulated. |
---|---|
AbstractList | This paper deals with a domain-independent natural language interface to databases (NLIDB) for the Spanish language. This NLIDB had been previously tested for the Northwind and Pubs domains and had attained good performance (86% success rate). However, domain independence complicates the task of achieving high translation success, and to this end the ATIS (Air Travel Information System) database, which has been used by several natural language interfaces, was selected to conduct a new evaluation. The purpose of this evaluation was to asses the efficiency of the interface after the reconfiguration for another domain and to detect the problems that affect translation success. For the tests a corpus of queries was gathered and the results obtained showed that the interface can easily be reconfigured and that attained a 50% success rate. When the found problems concerning query translation were analyzed, wording deficiencies of some user queries and several errors in the synonym dictionary were discovered. After correcting these problems a second test was conducted, in which the interface attained a 61.4% success rate. These experiments showed that user training is necessary as well as a dialogue system that permits to clarify a query when it is deficiently formulated. |
Author | de Santos Aguilar, L. Rangel, Rodolfo A. Pazos Héctor J. Fraire, H. Cruz C., I. Cristina Juan J. González, B. Joaquín Pérez, O. |
Author_xml | – sequence: 1 givenname: B. surname: Juan J. González fullname: Juan J. González, B. email: jjgonzalezbarbosa@hotmail.com organization: Instituto Tecnológico de Cd., Madero, Mexico – sequence: 2 givenname: Rodolfo A. Pazos surname: Rangel fullname: Rangel, Rodolfo A. Pazos email: pazos@cenidet.edu.mx organization: Centro Nacional de Investigación y Desarrollo Tecnológico (CENIDET), – sequence: 3 givenname: I. Cristina surname: Cruz C. fullname: Cruz C., I. Cristina email: ircriscc@hotmail.com organization: Instituto Tecnológico de Cd., Madero, Mexico – sequence: 4 givenname: H. surname: Héctor J. Fraire fullname: Héctor J. Fraire, H. email: hfraire@prodigy.net.mx organization: Instituto Tecnológico de Cd., Madero, Mexico – sequence: 5 givenname: L. surname: de Santos Aguilar fullname: de Santos Aguilar, L. email: Santosaguilar13@itcm.edu.mx organization: Instituto Tecnológico de Cd., Madero, Mexico – sequence: 6 givenname: O. surname: Joaquín Pérez fullname: Joaquín Pérez, O. email: jperez@cenidet.edu.mx organization: Centro Nacional de Investigación y Desarrollo Tecnológico (CENIDET), |
BookMark | eNqVjz9PwzAUxA0UiRQ68QW8MgSe7fzzTKmIVCEhuluv4ESB9Lnyc74_LWKoxMRyN9zvdLq5mFEgL8StgnsFUD8oZXWpjXJNcyYWtm5MWUBhoWyqc5GpSqncmMJenGa6KmciAwM6t3VhrsSc-RMAdG11JmLLPHmWA8lNROIR00C97GLYyRdMU8RRrpH6CXsvU5Bvr-sji3IZdjhQ3tKH3_uDUPrLt5R87PD9p7nEhFtkzzfissOR_eLXr8Xd6mnz-JzzPh62fXTbEL7YKXDH0-7ktPkP-w0xr1jl |
ContentType | Book Chapter |
Copyright | Springer-Verlag Berlin Heidelberg 2006 |
Copyright_xml | – notice: Springer-Verlag Berlin Heidelberg 2006 |
DOI | 10.1007/11925231_88 |
DatabaseTitleList | |
DeliveryMethod | fulltext_linktorsrc |
Discipline | Engineering Computer Science |
EISBN | 9783540490586 3540490582 |
EISSN | 1611-3349 |
Editor | Reyes-Garcia, Carlos Alberto Gelbukh, Alexander |
Editor_xml | – sequence: 1 givenname: Alexander surname: Gelbukh fullname: Gelbukh, Alexander email: gelbukh@gelbukh.com – sequence: 2 givenname: Carlos Alberto surname: Reyes-Garcia fullname: Reyes-Garcia, Carlos Alberto email: kargaxxi@inaoep.mx |
EndPage | 931 |
GroupedDBID | -DT -GH -~X 1SB 29L 2HA 2HV 5QI 875 AASHB ABMNI ACGFS ADCXD AEFIE ALMA_UNASSIGNED_HOLDINGS EJD F5P FEDTE HVGLF LAS LDH P2P RIG RNI RSU SVGTG VI1 ~02 |
ID | FETCH-springer_books_10_1007_11925231_883 |
ISBN | 9783540490265 3540490264 |
ISSN | 0302-9743 |
IngestDate | Tue Oct 01 19:58:35 EDT 2024 |
IsPeerReviewed | false |
IsScholarly | false |
Language | English |
LinkModel | OpenURL |
MergedId | FETCHMERGED-springer_books_10_1007_11925231_883 |
ParticipantIDs | springer_books_10_1007_11925231_88 |
PublicationPlace | Berlin, Heidelberg |
PublicationPlace_xml | – name: Berlin, Heidelberg |
PublicationSeriesSubtitle | Lecture Notes in Artificial Intelligence |
PublicationSeriesTitle | Lecture Notes in Computer Science |
PublicationSubtitle | 5th Mexican International Conference on Artificial Intelligence, Apizaco, Mexico, November 13-17, 2006. Proceedings |
PublicationTitle | MICAI 2006: Advances in Artificial Intelligence |
Publisher | Springer Berlin Heidelberg |
Publisher_xml | – name: Springer Berlin Heidelberg |
RelatedPersons | Kleinberg, Jon M. Mattern, Friedemann Nierstrasz, Oscar Tygar, Dough Steffen, Bernhard Kittler, Josef Vardi, Moshe Y. Weikum, Gerhard Sudan, Madhu Naor, Moni Mitchell, John C. Terzopoulos, Demetri Pandu Rangan, C. Kanade, Takeo Hutchison, David |
RelatedPersons_xml | – sequence: 1 givenname: David surname: Hutchison fullname: Hutchison, David organization: Lancaster University, UK – sequence: 2 givenname: Takeo surname: Kanade fullname: Kanade, Takeo organization: Carnegie Mellon University, Pittsburgh, USA – sequence: 3 givenname: Josef surname: Kittler fullname: Kittler, Josef organization: University of Surrey, Guildford, UK – sequence: 4 givenname: Jon M. surname: Kleinberg fullname: Kleinberg, Jon M. organization: Cornell University, Ithaca, USA – sequence: 5 givenname: Friedemann surname: Mattern fullname: Mattern, Friedemann organization: ETH Zurich, Switzerland – sequence: 6 givenname: John C. surname: Mitchell fullname: Mitchell, John C. organization: Stanford University, CA, USA – sequence: 7 givenname: Moni surname: Naor fullname: Naor, Moni organization: Weizmann Institute of Science, Rehovot, Israel – sequence: 8 givenname: Oscar surname: Nierstrasz fullname: Nierstrasz, Oscar organization: University of Bern, Switzerland – sequence: 9 givenname: C. surname: Pandu Rangan fullname: Pandu Rangan, C. organization: Indian Institute of Technology, Madras, India – sequence: 10 givenname: Bernhard surname: Steffen fullname: Steffen, Bernhard organization: University of Dortmund, Germany – sequence: 11 givenname: Madhu surname: Sudan fullname: Sudan, Madhu organization: Massachusetts Institute of Technology, MA, USA – sequence: 12 givenname: Demetri surname: Terzopoulos fullname: Terzopoulos, Demetri organization: University of California, Los Angeles, USA – sequence: 13 givenname: Dough surname: Tygar fullname: Tygar, Dough organization: University of California, Berkeley, USA – sequence: 14 givenname: Moshe Y. surname: Vardi fullname: Vardi, Moshe Y. organization: Rice University, Houston, USA – sequence: 15 givenname: Gerhard surname: Weikum fullname: Weikum, Gerhard organization: Max-Planck Institute of Computer Science, Saarbruecken, Germany |
SSID | ssj0002792 ssj0000318928 |
Score | 2.5953512 |
Snippet | This paper deals with a domain-independent natural language interface to databases (NLIDB) for the Spanish language. This NLIDB had been previously tested for... |
SourceID | springer |
SourceType | Publisher |
StartPage | 922 |
SubjectTerms | Dialogue System Fare Class Natural Language Interface Noun Location Structure Query Language |
Title | Issues in Translating from Natural Language to SQL in a Domain-Independent Natural Language Interface to Databases |
URI | http://link.springer.com/10.1007/11925231_88 |
hasFullText | 1 |
inHoldings | 1 |
isFullTextHit | |
isPrint | |
link | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwnV1LS8NAEF5qPenBNyoqi3iRkmBMmnQPHmp9tFIK4gNvZZNuQNAE2vTSf-k_cmZnk0Yjvi4hhE0y2a-dnee3jB2BhgM7NXKsyA-E5cXgsLZOlGOF0ovjUSsQUulqi4HfffBunppPtdpbqWppmoV2NPuyr-Q_qMI1wBW7ZP-AbPFQuADngC8cAWE4fjJ-P4ZZaYehXqfdQ0ZAXwf3KJevq1vbY10ARDQac8bNolhmii1LduM6TWY6T-68UBj53J5nfbDWlSqvR-lLnDbaNhibs7SwwDvj6azR0aqlZ-PmYKAqkkLFdyn_jgkBfBNYxyYE3rXLv1Ha-g9F1ksmluVhXSd2vAwkEYL0TTwVbeS72z6OlWD1v8rnxOoVW_hm1fE61BnLSN95ITOJ6zWJj9ioyVnfpE8GaUZC5Dtc5AqvEv1sfEMOZoJbngB3s1lSry6sBeBNkXpVpP59JHV0iUTVqHRBfdPGOhC0ZlUWHqo1ccBeBs_eGbZaC2whEKBmF9uXN_3HIuyHOlSczgntkb-REl0kjGk_0sISb2VJeNNfiq2fpTdVcvjaNLpfZcvYLsOxjwWmbo3VVLLOVvKp5GYq19lSiflyg40Jev6c8BL0HKHnBkqeQ8mzlAP0OFbyKvTV8QX0eGcB_SY7vrq873St_DOG-KeaDHNG7dK3ulusnqSJ2mZcIFObOzrxw8DzYqWEF0Yj6TSVG4ATpLwddvjz83Z_M2iP1bPxVO2D9ZmFBwbRd-tlgxU |
link.rule.ids | 785,786,790,799,27958 |
linkProvider | Library Specific Holdings |
openUrl | ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Abook&rft.genre=bookitem&rft.title=MICAI+2006%3A+Advances+in+Artificial+Intelligence&rft.au=Juan+J.+Gonz%C3%A1lez%2C+B.&rft.au=Rangel%2C+Rodolfo+A.+Pazos&rft.au=Cruz+C.%2C+I.+Cristina&rft.au=H%C3%A9ctor+J.+Fraire%2C+H.&rft.atitle=Issues+in+Translating+from+Natural+Language+to+SQL+in+a+Domain-Independent+Natural+Language+Interface+to+Databases&rft.series=Lecture+Notes+in+Computer+Science&rft.pub=Springer+Berlin+Heidelberg&rft.isbn=9783540490265&rft.issn=0302-9743&rft.eissn=1611-3349&rft.spage=922&rft.epage=931&rft_id=info:doi/10.1007%2F11925231_88 |
thumbnail_l | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=0302-9743&client=summon |
thumbnail_m | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=0302-9743&client=summon |
thumbnail_s | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=0302-9743&client=summon |