Exploring and Adapting Chinese GPT to Pinyin Input Method

While GPT has become the de-facto method for text generation tasks, its application to pinyin input method remains unexplored. In this work, we make the first exploration to leverage Chinese GPT for pinyin input method. We find that a frozen GPT achieves state-of-the-art performance on perfect pinyi...

Full description

Saved in:
Bibliographic Details
Main Authors Tan, Minghuan, Dai, Yong, Tang, Duyu, Feng, Zhangyin, Huang, Guoping, Jiang, Jing, Li, Jiwei, Shi, Shuming
Format Journal Article
LanguageEnglish
Published 01.03.2022
Subjects
Online AccessGet full text

Cover

Loading…
Abstract While GPT has become the de-facto method for text generation tasks, its application to pinyin input method remains unexplored. In this work, we make the first exploration to leverage Chinese GPT for pinyin input method. We find that a frozen GPT achieves state-of-the-art performance on perfect pinyin. However, the performance drops dramatically when the input includes abbreviated pinyin. A reason is that an abbreviated pinyin can be mapped to many perfect pinyin, which links to even larger number of Chinese characters. We mitigate this issue with two strategies, including enriching the context with pinyin and optimizing the training process to help distinguish homophones. To further facilitate the evaluation of pinyin input method, we create a dataset consisting of 270K instances from 15 domains. Results show that our approach improves performance on abbreviated pinyin across all domains. Model analysis demonstrates that both strategies contribute to the performance boost.
AbstractList While GPT has become the de-facto method for text generation tasks, its application to pinyin input method remains unexplored. In this work, we make the first exploration to leverage Chinese GPT for pinyin input method. We find that a frozen GPT achieves state-of-the-art performance on perfect pinyin. However, the performance drops dramatically when the input includes abbreviated pinyin. A reason is that an abbreviated pinyin can be mapped to many perfect pinyin, which links to even larger number of Chinese characters. We mitigate this issue with two strategies, including enriching the context with pinyin and optimizing the training process to help distinguish homophones. To further facilitate the evaluation of pinyin input method, we create a dataset consisting of 270K instances from 15 domains. Results show that our approach improves performance on abbreviated pinyin across all domains. Model analysis demonstrates that both strategies contribute to the performance boost.
Author Jiang, Jing
Tang, Duyu
Li, Jiwei
Shi, Shuming
Huang, Guoping
Feng, Zhangyin
Tan, Minghuan
Dai, Yong
Author_xml – sequence: 1
  givenname: Minghuan
  surname: Tan
  fullname: Tan, Minghuan
– sequence: 2
  givenname: Yong
  surname: Dai
  fullname: Dai, Yong
– sequence: 3
  givenname: Duyu
  surname: Tang
  fullname: Tang, Duyu
– sequence: 4
  givenname: Zhangyin
  surname: Feng
  fullname: Feng, Zhangyin
– sequence: 5
  givenname: Guoping
  surname: Huang
  fullname: Huang, Guoping
– sequence: 6
  givenname: Jing
  surname: Jiang
  fullname: Jiang, Jing
– sequence: 7
  givenname: Jiwei
  surname: Li
  fullname: Li, Jiwei
– sequence: 8
  givenname: Shuming
  surname: Shi
  fullname: Shi, Shuming
BackLink https://doi.org/10.48550/arXiv.2203.00249$$DView paper in arXiv
BookMark eNotj7tuwjAYRj3QAWgfoBN-gaR_fY1HFFGKBCpD9siX32AJnCikFbx9Be109C1H35mRSe4yEvL6DqWopIQ3O1zTT8kY8BKACTMlZnXtT92Q8oHaHOgy2H68j_qYMl6QrvcNHTu6T_mWMt3k_nukOxyPXXgmT9GeLvjyzzlpPlZN_Vlsv9aberktrNKmQG-5sgYUi8EEbZBJpxQ3DqAKUYFGjF46X3ktYsUwaOmEl9ohFxp95HOy-NM-vrf9kM52uLX3hvbRwH8BHeVDBw
ContentType Journal Article
Copyright http://creativecommons.org/licenses/by/4.0
Copyright_xml – notice: http://creativecommons.org/licenses/by/4.0
DBID AKY
GOX
DOI 10.48550/arxiv.2203.00249
DatabaseName arXiv Computer Science
arXiv.org
DatabaseTitleList
Database_xml – sequence: 1
  dbid: GOX
  name: arXiv.org
  url: http://arxiv.org/find
  sourceTypes: Open Access Repository
DeliveryMethod fulltext_linktorsrc
ExternalDocumentID 2203_00249
GroupedDBID AKY
GOX
ID FETCH-LOGICAL-a679-eca36a9062fd9d79e25b6639b008df607eefc5bc8c74f82ed75b4c57be347ecf3
IEDL.DBID GOX
IngestDate Mon Jan 08 05:38:16 EST 2024
IsDoiOpenAccess true
IsOpenAccess true
IsPeerReviewed false
IsScholarly false
Language English
LinkModel DirectLink
MergedId FETCHMERGED-LOGICAL-a679-eca36a9062fd9d79e25b6639b008df607eefc5bc8c74f82ed75b4c57be347ecf3
OpenAccessLink https://arxiv.org/abs/2203.00249
ParticipantIDs arxiv_primary_2203_00249
PublicationCentury 2000
PublicationDate 2022-03-01
PublicationDateYYYYMMDD 2022-03-01
PublicationDate_xml – month: 03
  year: 2022
  text: 2022-03-01
  day: 01
PublicationDecade 2020
PublicationYear 2022
Score 1.8338068
SecondaryResourceType preprint
Snippet While GPT has become the de-facto method for text generation tasks, its application to pinyin input method remains unexplored. In this work, we make the first...
SourceID arxiv
SourceType Open Access Repository
SubjectTerms Computer Science - Artificial Intelligence
Computer Science - Computation and Language
Title Exploring and Adapting Chinese GPT to Pinyin Input Method
URI https://arxiv.org/abs/2203.00249
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
link http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwdV27TgMxEFwlqWgQCFB4ygWtReLz68oIkQSkQIogpTvZvrV0zRElFwR_j-07Hg2t7cbjYne04xmA2wzNOHTGGXXOWspLJaj1hlEXTSddqQLPjRPdxbOcv_KntVj3gHz_hTHbj-q99Qe2uzvGkgFpoAh96DMWJVuzl3U7nExWXN3533Ohx0xLf4rE9AgOu-6OTNrnOIYe1ieQ_-jcSKDtZFKaTdQakxhdjTsks-WKNG9kWdWfVU0e682-IYsU7HwKq-nD6n5Ou8QCaqTKKTqTSROdf32ZlypHJmyo6NF2UJdejhSid8I67RT3mmFAxnInlMWMK3Q-O4NBIP04BGLHUqPIpFNacp0zrWPwE_cjgbm1Wp_DMN2z2LSmFEWEoEgQXPy_dQkHLMr3k4bqCgbNdo_Xoag29iYh-wWK2XaK
link.rule.ids 228,230,783,888
linkProvider Cornell University
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=Exploring+and+Adapting+Chinese+GPT+to+Pinyin+Input+Method&rft.au=Tan%2C+Minghuan&rft.au=Dai%2C+Yong&rft.au=Tang%2C+Duyu&rft.au=Feng%2C+Zhangyin&rft.date=2022-03-01&rft_id=info:doi/10.48550%2Farxiv.2203.00249&rft.externalDocID=2203_00249