Natural Language Decomposition and Interpretation of Complex Utterances

Natural language interfaces often require supervised data to translate user requests into programs, database queries, or other structured intent representations. During data collection, it can be difficult to anticipate and formalize the full range of user needs -- for example, in a system designed...

Full description

Saved in:
Bibliographic Details
Published inarXiv.org
Main Authors Jhamtani, Harsh, Fang, Hao, Xia, Patrick, Levy, Eran, Jacob, Andreas, Ben Van Durme
Format Paper
LanguageEnglish
Published Ithaca Cornell University Library, arXiv.org 15.05.2023
Subjects
Online AccessGet full text

Cover

Loading…
Abstract Natural language interfaces often require supervised data to translate user requests into programs, database queries, or other structured intent representations. During data collection, it can be difficult to anticipate and formalize the full range of user needs -- for example, in a system designed to handle simple requests (like \(\textit{find my meetings tomorrow}\) or \(\textit{move my meeting with my manager to noon})\), users may also express more elaborate requests (like \(\textit{swap all my calls on Monday and Tuesday}\)). We introduce an approach for equipping a simple language-to-code model to handle complex utterances via a process of hierarchical natural language decomposition. Our approach uses a pre-trained language model to decompose a complex utterance into a sequence of smaller natural language steps, then interprets each step using the language-to-code model. To test our approach, we collect and release DeCU -- a new NL-to-program benchmark to evaluate Decomposition of Complex Utterances. Experiments show that the proposed approach enables the interpretation of complex utterances with almost no complex training data, while outperforming standard few-shot prompting approaches.
AbstractList Natural language interfaces often require supervised data to translate user requests into programs, database queries, or other structured intent representations. During data collection, it can be difficult to anticipate and formalize the full range of user needs -- for example, in a system designed to handle simple requests (like \(\textit{find my meetings tomorrow}\) or \(\textit{move my meeting with my manager to noon})\), users may also express more elaborate requests (like \(\textit{swap all my calls on Monday and Tuesday}\)). We introduce an approach for equipping a simple language-to-code model to handle complex utterances via a process of hierarchical natural language decomposition. Our approach uses a pre-trained language model to decompose a complex utterance into a sequence of smaller natural language steps, then interprets each step using the language-to-code model. To test our approach, we collect and release DeCU -- a new NL-to-program benchmark to evaluate Decomposition of Complex Utterances. Experiments show that the proposed approach enables the interpretation of complex utterances with almost no complex training data, while outperforming standard few-shot prompting approaches.
Author Jhamtani, Harsh
Xia, Patrick
Levy, Eran
Fang, Hao
Ben Van Durme
Jacob, Andreas
Author_xml – sequence: 1
  givenname: Harsh
  surname: Jhamtani
  fullname: Jhamtani, Harsh
– sequence: 2
  givenname: Hao
  surname: Fang
  fullname: Fang, Hao
– sequence: 3
  givenname: Patrick
  surname: Xia
  fullname: Xia, Patrick
– sequence: 4
  givenname: Eran
  surname: Levy
  fullname: Levy, Eran
– sequence: 5
  givenname: Andreas
  surname: Jacob
  fullname: Jacob, Andreas
– sequence: 6
  fullname: Ben Van Durme
BookMark eNqNissKgkAUQIcoyMp_GGgtjDNaurYnRKtay8Wuotgdmwf0-Un0Aa0OnHMWbEqacMICqVQcZYmUcxZa2wkh5GYr01QF7HgF5w30_ALUeGiQ77DSz0Hb1rWaONCDn8mhGQw6-Cpd82I8enzzuxsLUIV2xWY19BbDH5dsfdjfilM0GP3yaF3ZaW9oTKXM4kSKPM2V-u_6AJVjPe4
ContentType Paper
Copyright 2023. This work is published under http://arxiv.org/licenses/nonexclusive-distrib/1.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.
Copyright_xml – notice: 2023. This work is published under http://arxiv.org/licenses/nonexclusive-distrib/1.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.
DBID 8FE
8FG
ABJCF
ABUWG
AFKRA
AZQEC
BENPR
BGLVJ
CCPQU
DWQXO
HCIFZ
L6V
M7S
PIMPY
PQEST
PQQKQ
PQUKI
PRINS
PTHSS
DatabaseName ProQuest SciTech Collection
ProQuest Technology Collection
Materials Science & Engineering Collection
ProQuest Central (Alumni)
ProQuest Central
ProQuest Central Essentials
ProQuest Central
Technology Collection
ProQuest One Community College
ProQuest Central
SciTech Premium Collection
ProQuest Engineering Collection
Engineering Database
Publicly Available Content Database
ProQuest One Academic Eastern Edition (DO NOT USE)
ProQuest One Academic
ProQuest One Academic UKI Edition
ProQuest Central China
Engineering Collection
DatabaseTitle Publicly Available Content Database
Engineering Database
Technology Collection
ProQuest Central Essentials
ProQuest One Academic Eastern Edition
ProQuest Central (Alumni Edition)
SciTech Premium Collection
ProQuest One Community College
ProQuest Technology Collection
ProQuest SciTech Collection
ProQuest Central China
ProQuest Central
ProQuest Engineering Collection
ProQuest One Academic UKI Edition
ProQuest Central Korea
Materials Science & Engineering Collection
ProQuest One Academic
Engineering Collection
DatabaseTitleList Publicly Available Content Database
Database_xml – sequence: 1
  dbid: 8FG
  name: ProQuest Technology Collection
  url: https://search.proquest.com/technologycollection1
  sourceTypes: Aggregation Database
DeliveryMethod fulltext_linktorsrc
Discipline Physics
EISSN 2331-8422
Genre Working Paper/Pre-Print
GroupedDBID 8FE
8FG
ABJCF
ABUWG
AFKRA
ALMA_UNASSIGNED_HOLDINGS
AZQEC
BENPR
BGLVJ
CCPQU
DWQXO
FRJ
HCIFZ
L6V
M7S
M~E
PIMPY
PQEST
PQQKQ
PQUKI
PRINS
PTHSS
ID FETCH-proquest_journals_28142095933
IEDL.DBID BENPR
IngestDate Thu Oct 10 16:17:22 EDT 2024
IsOpenAccess true
IsPeerReviewed false
IsScholarly false
Language English
LinkModel DirectLink
MergedId FETCHMERGED-proquest_journals_28142095933
OpenAccessLink https://www.proquest.com/docview/2814209593?pq-origsite=%requestingapplication%
PQID 2814209593
PQPubID 2050157
ParticipantIDs proquest_journals_2814209593
PublicationCentury 2000
PublicationDate 20230515
PublicationDateYYYYMMDD 2023-05-15
PublicationDate_xml – month: 05
  year: 2023
  text: 20230515
  day: 15
PublicationDecade 2020
PublicationPlace Ithaca
PublicationPlace_xml – name: Ithaca
PublicationTitle arXiv.org
PublicationYear 2023
Publisher Cornell University Library, arXiv.org
Publisher_xml – name: Cornell University Library, arXiv.org
SSID ssj0002672553
Score 3.4652672
SecondaryResourceType preprint
Snippet Natural language interfaces often require supervised data to translate user requests into programs, database queries, or other structured intent...
SourceID proquest
SourceType Aggregation Database
SubjectTerms Data collection
Decomposition
Language
Natural language
Natural language (computers)
Title Natural Language Decomposition and Interpretation of Complex Utterances
URI https://www.proquest.com/docview/2814209593
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
link http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwfV1LSwMxEB7sLoI3n_ioJaDXoJvHGk-CutsidiliobeS157EPraCJ3-7mSVVROgxBBKym8yX-ebLDMClEdJLoWtqHbJVLDNUSVFTzN4tvdKZdBjRHVb5YCyeJnISCbcmyirXNrE11G5mkSO_YioTrE2jezdfUKwahdHVWEKjAykLnsJ1Aul9UY1eflgWlt-EOzP_Z2hb9Ch3IR3puV_uwZZ_34ftVnRpmwPoV7rNekGeI2lIHj1KvKOOigQnn_wVBZJZTfAEv_lPMsZ3OPjPmkO4KIvXhwFdzz6NO6SZ_q6HH0ESXH1_DORWcJXrgPnGBthlynCruKmdlM4qydQJdDeNdLq5-wx2sFg6xr4z2YVktfzw5wFSV6YHHVX2e_Hrhdbwq_gGETGBbg
link.rule.ids 783,787,12777,21400,33385,33756,43612,43817
linkProvider ProQuest
linkToHtml http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwfV1LSwMxEB60i-jNJz6qBvQaZLPJGk-C2rrqdinSQm9LXnsSW7sV_PlmQqqI0HMgIa-ZzDdfvgG41Fw4wVVDjUW0iqWaSsEbiurdwkmVCosZ3UGVF2P-PBGTCLi1kVa5tInBUNupQYz8ismUsyCjezv7oFg1CrOrsYTGOiQoVeWDr-SuVw1ff1AWll_7N3P2z9AG79HfhmSoZm6-A2vufRc2AunStHvwWKmgekHKCBqSB4cU78ijIj7IJ39JgWTaELzBb-6LjPEfDu5Zuw8X_d7ovqDL0et4Qtr6dz7ZAXR8qO8Ogdz4WeTK-3xtvNtlUmdGZrqxQlgjBZNH0F3V0_Hq5nPYLEaDsi6fqpcT2MLC6ZgHT0UXOov5pzv17nWhz-IafgM9L4JR
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=Natural+Language+Decomposition+and+Interpretation+of+Complex+Utterances&rft.jtitle=arXiv.org&rft.au=Jhamtani%2C+Harsh&rft.au=Fang%2C+Hao&rft.au=Xia%2C+Patrick&rft.au=Levy%2C+Eran&rft.date=2023-05-15&rft.pub=Cornell+University+Library%2C+arXiv.org&rft.eissn=2331-8422