DoctorGPT: A Large Language Model with Chinese Medical Question-Answering Capabilities
Large Language Models (LLMs) have made incredible strides recently in understanding and reacting to user intents. However, these models typically excel in English and have not been specifically trained for medical applications, leading to suboptimal performance in responding to medical inquiries suc...
Saved in:
Published in | 2023 International Conference on High Performance Big Data and Intelligent Systems (HDIS) pp. 186 - 193 |
---|---|
Main Authors | , , , , , |
Format | Conference Proceeding |
Language | English |
Published |
IEEE
06.12.2023
|
Subjects | |
Online Access | Get full text |
DOI | 10.1109/HDIS60872.2023.10499472 |
Cover
Loading…
Abstract | Large Language Models (LLMs) have made incredible strides recently in understanding and reacting to user intents. However, these models typically excel in English and have not been specifically trained for medical applications, leading to suboptimal performance in responding to medical inquiries such as diagnostic queries and drug recommendations. In this paper, we propose DoctorGPT, a domain-specific large language model tailored for medical question-answering tasks. DoctorGPT leverages the open-source Baichuan2 as its foundational model, undergoes extensive pre-training on medical encyclopedic data to incorporate medical knowledge, and subsequently undergoes fine-tuning on a dataset consisting of two million medical instruction-dialogue pairs to enhance its question-answering capabilities. When compared to general-purpose large models, DoctorGPT demonstrates significant advantages in Chinese medical question-answerinz (O&A) tasks. |
---|---|
AbstractList | Large Language Models (LLMs) have made incredible strides recently in understanding and reacting to user intents. However, these models typically excel in English and have not been specifically trained for medical applications, leading to suboptimal performance in responding to medical inquiries such as diagnostic queries and drug recommendations. In this paper, we propose DoctorGPT, a domain-specific large language model tailored for medical question-answering tasks. DoctorGPT leverages the open-source Baichuan2 as its foundational model, undergoes extensive pre-training on medical encyclopedic data to incorporate medical knowledge, and subsequently undergoes fine-tuning on a dataset consisting of two million medical instruction-dialogue pairs to enhance its question-answering capabilities. When compared to general-purpose large models, DoctorGPT demonstrates significant advantages in Chinese medical question-answerinz (O&A) tasks. |
Author | Wu, Min Li, Wenqiang Hao, Meilan Liu, Jingyi Li, Yanjie Yu, Lina |
Author_xml | – sequence: 1 givenname: Wenqiang surname: Li fullname: Li, Wenqiang email: liwenqiang@semi.ac.cn organization: Institute of Semiconductors, Chinese Academy of Sciences,AnnLab,Beijing,China,100083 – sequence: 2 givenname: Lina surname: Yu fullname: Yu, Lina email: yulina@semi.ac.cn organization: Institute of Semiconductors, Chinese Academy of Sciences,AnnLab,Beijing,China,100083 – sequence: 3 givenname: Min surname: Wu fullname: Wu, Min email: wumin@semi.ac.cn organization: Institute of Semiconductors, Chinese Academy of Sciences,AnnLab,Beijing,China,100083 – sequence: 4 givenname: Jingyi surname: Liu fullname: Liu, Jingyi email: liujingyi@semi.ac.cn organization: Institute of Semiconductors, Chinese Academy of Sciences,AnnLab,Beijing,China,100083 – sequence: 5 givenname: Meilan surname: Hao fullname: Hao, Meilan email: mlhao@semi.ac.cn organization: Institute of Semiconductors, Chinese Academy of Sciences,AnnLab,Beijing,China,100083 – sequence: 6 givenname: Yanjie surname: Li fullname: Li, Yanjie email: liyanjie@semi.ac.cn organization: Institute of Semiconductors, Chinese Academy of Sciences,AnnLab,Beijing,China,100083 |
BookMark | eNo1j81Kw0AUhUfQhda-gWBeIHF-MplcdyHVthBRsbott5M76UCclCSl-PYNqJtzPs7ig3PDLkMXiLF7wRMhODysFuuPjOdGJpJLlQieAqRGXrA5GMiV5gpSkfFr9rXo7Nj1y7fNY1REFfYNTRmaI07w0tXURic_7qNy7wMN00S1t9hG70caRt-FuAjDiXofmqjEA-5860dPwy27ctgONP_rGft8ftqUq7h6Xa7Looq9EDDG0gFJYXIJ6c5KyFNC6VCjJq0V2Iy71GkCK4zJQNU8R6c5WqkFoeAk1Yzd_Xo9EW0Pvf_G_mf7f1edAUEkTws |
ContentType | Conference Proceeding |
DBID | 6IE 6IL CBEJK RIE RIL |
DOI | 10.1109/HDIS60872.2023.10499472 |
DatabaseName | IEEE Electronic Library (IEL) Conference Proceedings IEEE Proceedings Order Plan All Online (POP All Online) 1998-present by volume IEEE Xplore All Conference Proceedings IEEE Electronic Library (IEL) IEEE Proceedings Order Plans (POP All) 1998-Present |
DatabaseTitleList | |
Database_xml | – sequence: 1 dbid: RIE name: IEEE Electronic Library (IEL) url: https://proxy.k.utb.cz/login?url=https://ieeexplore.ieee.org/ sourceTypes: Publisher |
DeliveryMethod | fulltext_linktorsrc |
EISBN | 9798350394160 |
EndPage | 193 |
ExternalDocumentID | 10499472 |
Genre | orig-research |
GroupedDBID | 6IE 6IL CBEJK RIE RIL |
ID | FETCH-LOGICAL-i119t-2f9e2178294bc2984ea2fa5a5e5539c60f4f5e9c177693d08af50ac251ea10e23 |
IEDL.DBID | RIE |
IngestDate | Wed May 01 11:49:10 EDT 2024 |
IsPeerReviewed | false |
IsScholarly | false |
Language | English |
LinkModel | DirectLink |
MergedId | FETCHMERGED-LOGICAL-i119t-2f9e2178294bc2984ea2fa5a5e5539c60f4f5e9c177693d08af50ac251ea10e23 |
PageCount | 8 |
ParticipantIDs | ieee_primary_10499472 |
PublicationCentury | 2000 |
PublicationDate | 2023-Dec.-6 |
PublicationDateYYYYMMDD | 2023-12-06 |
PublicationDate_xml | – month: 12 year: 2023 text: 2023-Dec.-6 day: 06 |
PublicationDecade | 2020 |
PublicationTitle | 2023 International Conference on High Performance Big Data and Intelligent Systems (HDIS) |
PublicationTitleAbbrev | HDIS |
PublicationYear | 2023 |
Publisher | IEEE |
Publisher_xml | – name: IEEE |
Score | 1.8560193 |
Snippet | Large Language Models (LLMs) have made incredible strides recently in understanding and reacting to user intents. However, these models typically excel in... |
SourceID | ieee |
SourceType | Publisher |
StartPage | 186 |
SubjectTerms | artificial intelligence Benchmark testing Big Data Biomedical equipment Data models Drugs Encyclopedias large language model medical question-answering system Medical services |
Title | DoctorGPT: A Large Language Model with Chinese Medical Question-Answering Capabilities |
URI | https://ieeexplore.ieee.org/document/10499472 |
hasFullText | 1 |
inHoldings | 1 |
isFullTextHit | |
isPrint | |
link | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV3fS8MwEA66J59UnPibPPiamjZN0_g2pnOKjoGb7G2k6QVE6YbrEPzrvaSboiD4Ukoobck1_e4u33dHyHmeucQDHUP8AZaq0rEccZIJh0uviDNnQu3Oh0HWH6d3EzlZidWDFgYAAvkMIn8a9vLLmV36VBmucPTPU4V_3E38zhqx1oqzFXN90b-6fcx4rry-KhHR-uoffVMCbPS2yWD9wIYt8hIt6yKyH79qMf77jXZI-1uhR4df2LNLNqDaI0-IGBhC3wxHl7RD7z3HG49NPpL6pmev1Kddqe-ZDQscajZpaEh6on1Yp1q8h9KEtIsYGmizGEi3ybh3Per22apvAnuOY12zxGnASCNPdFrYROcpmMQZaSRIKbTNuEudBG1j5Rshljw3TnJj0dMBE3NIxD5pVbMKDgjFW2mtXCksF6lw1khreKm8l6CcUPEhaftJmc6b0hjT9Xwc_TF-TLa8bQIfJDshrfptCaeI6nVxFqz5CVR_oc0 |
linkProvider | IEEE |
linkToHtml | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV3PS8MwFA6iBz2pOPG3OXhNTZukabyN6ex0GwM32W1kaQKidOI6BP96X9JNURC8lBJCW_JIv_devu89hC6y1CUe6AjgjyVcFo5kgJOEOdh60zh1OtTu7PXTfMTvxmK8FKsHLYy1NpDPbORvw1l-MTMLnyqDHQ7-OZfwx90A4OeilmstWVsxVZf5dechpZn0CquERav5PzqnBOBob6P-6pU1X-Q5WlTTyHz8qsb472_aQY1vjR4efKHPLlqz5R56BMyAIPp2MLzCTdz1LG-41hlJ7NuevWCfeMW-a7adw1B9TIND2hMsRJrl_D0UJ8QtQNFAnIVQuoFG7ZthKyfLzgnkKY5VRRKnLMQaWaL41CQq41YnTgstrBBMmZQ67oRVJpa-FWJBM-0E1QZ8HatjahO2j9bLWWkPEIZHKSVdwQxlnDmjhdG0kN5PkI7J-BA1_KJMXuviGJPVehz9MX6ONvNhrzvpdvr3x2jL2ymwQ9ITtF69LewpYHw1PQuW_QT90qUa |
openUrl | ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Abook&rft.genre=proceeding&rft.title=2023+International+Conference+on+High+Performance+Big+Data+and+Intelligent+Systems+%28HDIS%29&rft.atitle=DoctorGPT%3A+A+Large+Language+Model+with+Chinese+Medical+Question-Answering+Capabilities&rft.au=Li%2C+Wenqiang&rft.au=Yu%2C+Lina&rft.au=Wu%2C+Min&rft.au=Liu%2C+Jingyi&rft.date=2023-12-06&rft.pub=IEEE&rft.spage=186&rft.epage=193&rft_id=info:doi/10.1109%2FHDIS60872.2023.10499472&rft.externalDocID=10499472 |