Exploring and Adapting Chinese GPT to Pinyin Input Method
While GPT has become the de-facto method for text generation tasks, its application to pinyin input method remains unexplored. In this work, we make the first exploration to leverage Chinese GPT for pinyin input method. We find that a frozen GPT achieves state-of-the-art performance on perfect pinyi...
Saved in:
Main Authors | , , , , , , , |
---|---|
Format | Journal Article |
Language | English |
Published |
01.03.2022
|
Subjects | |
Online Access | Get full text |
Cover
Loading…
Abstract | While GPT has become the de-facto method for text generation tasks, its
application to pinyin input method remains unexplored. In this work, we make
the first exploration to leverage Chinese GPT for pinyin input method. We find
that a frozen GPT achieves state-of-the-art performance on perfect pinyin.
However, the performance drops dramatically when the input includes abbreviated
pinyin. A reason is that an abbreviated pinyin can be mapped to many perfect
pinyin, which links to even larger number of Chinese characters. We mitigate
this issue with two strategies, including enriching the context with pinyin and
optimizing the training process to help distinguish homophones. To further
facilitate the evaluation of pinyin input method, we create a dataset
consisting of 270K instances from 15 domains. Results show that our approach
improves performance on abbreviated pinyin across all domains. Model analysis
demonstrates that both strategies contribute to the performance boost. |
---|---|
AbstractList | While GPT has become the de-facto method for text generation tasks, its
application to pinyin input method remains unexplored. In this work, we make
the first exploration to leverage Chinese GPT for pinyin input method. We find
that a frozen GPT achieves state-of-the-art performance on perfect pinyin.
However, the performance drops dramatically when the input includes abbreviated
pinyin. A reason is that an abbreviated pinyin can be mapped to many perfect
pinyin, which links to even larger number of Chinese characters. We mitigate
this issue with two strategies, including enriching the context with pinyin and
optimizing the training process to help distinguish homophones. To further
facilitate the evaluation of pinyin input method, we create a dataset
consisting of 270K instances from 15 domains. Results show that our approach
improves performance on abbreviated pinyin across all domains. Model analysis
demonstrates that both strategies contribute to the performance boost. |
Author | Jiang, Jing Tang, Duyu Li, Jiwei Shi, Shuming Huang, Guoping Feng, Zhangyin Tan, Minghuan Dai, Yong |
Author_xml | – sequence: 1 givenname: Minghuan surname: Tan fullname: Tan, Minghuan – sequence: 2 givenname: Yong surname: Dai fullname: Dai, Yong – sequence: 3 givenname: Duyu surname: Tang fullname: Tang, Duyu – sequence: 4 givenname: Zhangyin surname: Feng fullname: Feng, Zhangyin – sequence: 5 givenname: Guoping surname: Huang fullname: Huang, Guoping – sequence: 6 givenname: Jing surname: Jiang fullname: Jiang, Jing – sequence: 7 givenname: Jiwei surname: Li fullname: Li, Jiwei – sequence: 8 givenname: Shuming surname: Shi fullname: Shi, Shuming |
BackLink | https://doi.org/10.48550/arXiv.2203.00249$$DView paper in arXiv |
BookMark | eNotj7tuwjAYRj3QAWgfoBN-gaR_fY1HFFGKBCpD9siX32AJnCikFbx9Be109C1H35mRSe4yEvL6DqWopIQ3O1zTT8kY8BKACTMlZnXtT92Q8oHaHOgy2H68j_qYMl6QrvcNHTu6T_mWMt3k_nukOxyPXXgmT9GeLvjyzzlpPlZN_Vlsv9aberktrNKmQG-5sgYUi8EEbZBJpxQ3DqAKUYFGjF46X3ktYsUwaOmEl9ohFxp95HOy-NM-vrf9kM52uLX3hvbRwH8BHeVDBw |
ContentType | Journal Article |
Copyright | http://creativecommons.org/licenses/by/4.0 |
Copyright_xml | – notice: http://creativecommons.org/licenses/by/4.0 |
DBID | AKY GOX |
DOI | 10.48550/arxiv.2203.00249 |
DatabaseName | arXiv Computer Science arXiv.org |
DatabaseTitleList | |
Database_xml | – sequence: 1 dbid: GOX name: arXiv.org url: http://arxiv.org/find sourceTypes: Open Access Repository |
DeliveryMethod | fulltext_linktorsrc |
ExternalDocumentID | 2203_00249 |
GroupedDBID | AKY GOX |
ID | FETCH-LOGICAL-a679-eca36a9062fd9d79e25b6639b008df607eefc5bc8c74f82ed75b4c57be347ecf3 |
IEDL.DBID | GOX |
IngestDate | Mon Jan 08 05:38:16 EST 2024 |
IsDoiOpenAccess | true |
IsOpenAccess | true |
IsPeerReviewed | false |
IsScholarly | false |
Language | English |
LinkModel | DirectLink |
MergedId | FETCHMERGED-LOGICAL-a679-eca36a9062fd9d79e25b6639b008df607eefc5bc8c74f82ed75b4c57be347ecf3 |
OpenAccessLink | https://arxiv.org/abs/2203.00249 |
ParticipantIDs | arxiv_primary_2203_00249 |
PublicationCentury | 2000 |
PublicationDate | 2022-03-01 |
PublicationDateYYYYMMDD | 2022-03-01 |
PublicationDate_xml | – month: 03 year: 2022 text: 2022-03-01 day: 01 |
PublicationDecade | 2020 |
PublicationYear | 2022 |
Score | 1.8338068 |
SecondaryResourceType | preprint |
Snippet | While GPT has become the de-facto method for text generation tasks, its
application to pinyin input method remains unexplored. In this work, we make
the first... |
SourceID | arxiv |
SourceType | Open Access Repository |
SubjectTerms | Computer Science - Artificial Intelligence Computer Science - Computation and Language |
Title | Exploring and Adapting Chinese GPT to Pinyin Input Method |
URI | https://arxiv.org/abs/2203.00249 |
hasFullText | 1 |
inHoldings | 1 |
isFullTextHit | |
isPrint | |
link | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwdV27TgMxEFwlqWgQCFB4ygWtReLz68oIkQSkQIogpTvZvrV0zRElFwR_j-07Hg2t7cbjYne04xmA2wzNOHTGGXXOWspLJaj1hlEXTSddqQLPjRPdxbOcv_KntVj3gHz_hTHbj-q99Qe2uzvGkgFpoAh96DMWJVuzl3U7nExWXN3533Ohx0xLf4rE9AgOu-6OTNrnOIYe1ieQ_-jcSKDtZFKaTdQakxhdjTsks-WKNG9kWdWfVU0e682-IYsU7HwKq-nD6n5Ou8QCaqTKKTqTSROdf32ZlypHJmyo6NF2UJdejhSid8I67RT3mmFAxnInlMWMK3Q-O4NBIP04BGLHUqPIpFNacp0zrWPwE_cjgbm1Wp_DMN2z2LSmFEWEoEgQXPy_dQkHLMr3k4bqCgbNdo_Xoag29iYh-wWK2XaK |
link.rule.ids | 228,230,783,888 |
linkProvider | Cornell University |
openUrl | ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=Exploring+and+Adapting+Chinese+GPT+to+Pinyin+Input+Method&rft.au=Tan%2C+Minghuan&rft.au=Dai%2C+Yong&rft.au=Tang%2C+Duyu&rft.au=Feng%2C+Zhangyin&rft.date=2022-03-01&rft_id=info:doi/10.48550%2Farxiv.2203.00249&rft.externalDocID=2203_00249 |