DoctorGPT: A Large Language Model with Chinese Medical Question-Answering Capabilities

Large Language Models (LLMs) have made incredible strides recently in understanding and reacting to user intents. However, these models typically excel in English and have not been specifically trained for medical applications, leading to suboptimal performance in responding to medical inquiries suc...

Full description

Saved in:

Bibliographic Details
Published in	2023 International Conference on High Performance Big Data and Intelligent Systems (HDIS) pp. 186 - 193
Main Authors	Li, Wenqiang, Yu, Lina, Wu, Min, Liu, Jingyi, Hao, Meilan, Li, Yanjie
Format	Conference Proceeding
Language	English
Published	IEEE 06.12.2023
Subjects	artificial intelligence Benchmark testing Big Data Biomedical equipment Data models Drugs Encyclopedias large language model medical question-answering system Medical services
Online Access	Get full text
DOI	10.1109/HDIS60872.2023.10499472

Cover

Loading…

Abstract	Large Language Models (LLMs) have made incredible strides recently in understanding and reacting to user intents. However, these models typically excel in English and have not been specifically trained for medical applications, leading to suboptimal performance in responding to medical inquiries such as diagnostic queries and drug recommendations. In this paper, we propose DoctorGPT, a domain-specific large language model tailored for medical question-answering tasks. DoctorGPT leverages the open-source Baichuan2 as its foundational model, undergoes extensive pre-training on medical encyclopedic data to incorporate medical knowledge, and subsequently undergoes fine-tuning on a dataset consisting of two million medical instruction-dialogue pairs to enhance its question-answering capabilities. When compared to general-purpose large models, DoctorGPT demonstrates significant advantages in Chinese medical question-answerinz (O&A) tasks.
AbstractList	Large Language Models (LLMs) have made incredible strides recently in understanding and reacting to user intents. However, these models typically excel in English and have not been specifically trained for medical applications, leading to suboptimal performance in responding to medical inquiries such as diagnostic queries and drug recommendations. In this paper, we propose DoctorGPT, a domain-specific large language model tailored for medical question-answering tasks. DoctorGPT leverages the open-source Baichuan2 as its foundational model, undergoes extensive pre-training on medical encyclopedic data to incorporate medical knowledge, and subsequently undergoes fine-tuning on a dataset consisting of two million medical instruction-dialogue pairs to enhance its question-answering capabilities. When compared to general-purpose large models, DoctorGPT demonstrates significant advantages in Chinese medical question-answerinz (O&A) tasks.
Author	Wu, Min Li, Wenqiang Hao, Meilan Liu, Jingyi Li, Yanjie Yu, Lina
Author_xml	– sequence: 1 givenname: Wenqiang surname: Li fullname: Li, Wenqiang email: liwenqiang@semi.ac.cn organization: Institute of Semiconductors, Chinese Academy of Sciences,AnnLab,Beijing,China,100083 – sequence: 2 givenname: Lina surname: Yu fullname: Yu, Lina email: yulina@semi.ac.cn organization: Institute of Semiconductors, Chinese Academy of Sciences,AnnLab,Beijing,China,100083 – sequence: 3 givenname: Min surname: Wu fullname: Wu, Min email: wumin@semi.ac.cn organization: Institute of Semiconductors, Chinese Academy of Sciences,AnnLab,Beijing,China,100083 – sequence: 4 givenname: Jingyi surname: Liu fullname: Liu, Jingyi email: liujingyi@semi.ac.cn organization: Institute of Semiconductors, Chinese Academy of Sciences,AnnLab,Beijing,China,100083 – sequence: 5 givenname: Meilan surname: Hao fullname: Hao, Meilan email: mlhao@semi.ac.cn organization: Institute of Semiconductors, Chinese Academy of Sciences,AnnLab,Beijing,China,100083 – sequence: 6 givenname: Yanjie surname: Li fullname: Li, Yanjie email: liyanjie@semi.ac.cn organization: Institute of Semiconductors, Chinese Academy of Sciences,AnnLab,Beijing,China,100083
BookMark	eNo1j81Kw0AUhUfQhda-gWBeIHF-MplcdyHVthBRsbott5M76UCclCSl-PYNqJtzPs7ig3PDLkMXiLF7wRMhODysFuuPjOdGJpJLlQieAqRGXrA5GMiV5gpSkfFr9rXo7Nj1y7fNY1REFfYNTRmaI07w0tXURic_7qNy7wMN00S1t9hG70caRt-FuAjDiXofmqjEA-5860dPwy27ctgONP_rGft8ftqUq7h6Xa7Looq9EDDG0gFJYXIJ6c5KyFNC6VCjJq0V2Iy71GkCK4zJQNU8R6c5WqkFoeAk1Yzd_Xo9EW0Pvf_G_mf7f1edAUEkTws
ContentType	Conference Proceeding
DBID	6IE 6IL CBEJK RIE RIL
DOI	10.1109/HDIS60872.2023.10499472
DatabaseName	IEEE Electronic Library (IEL) Conference Proceedings IEEE Proceedings Order Plan All Online (POP All Online) 1998-present by volume IEEE Xplore All Conference Proceedings IEEE Electronic Library (IEL) IEEE Proceedings Order Plans (POP All) 1998-Present
DatabaseTitleList
Database_xml	– sequence: 1 dbid: RIE name: IEEE Electronic Library (IEL) url: https://proxy.k.utb.cz/login?url=https://ieeexplore.ieee.org/ sourceTypes: Publisher
DeliveryMethod	fulltext_linktorsrc
EISBN	9798350394160
EndPage	193
ExternalDocumentID	10499472
Genre	orig-research
GroupedDBID	6IE 6IL CBEJK RIE RIL
ID	FETCH-LOGICAL-i119t-2f9e2178294bc2984ea2fa5a5e5539c60f4f5e9c177693d08af50ac251ea10e23
IEDL.DBID	RIE
IngestDate	Wed May 01 11:49:10 EDT 2024
IsPeerReviewed	false
IsScholarly	false
Language	English
LinkModel	DirectLink
MergedId	FETCHMERGED-LOGICAL-i119t-2f9e2178294bc2984ea2fa5a5e5539c60f4f5e9c177693d08af50ac251ea10e23
PageCount	8
ParticipantIDs	ieee_primary_10499472
PublicationCentury	2000
PublicationDate	2023-Dec.-6
PublicationDateYYYYMMDD	2023-12-06
PublicationDate_xml	– month: 12 year: 2023 text: 2023-Dec.-6 day: 06
PublicationDecade	2020
PublicationTitle	2023 International Conference on High Performance Big Data and Intelligent Systems (HDIS)
PublicationTitleAbbrev	HDIS
PublicationYear	2023
Publisher	IEEE
Publisher_xml	– name: IEEE
Score	1.8560193
Snippet	Large Language Models (LLMs) have made incredible strides recently in understanding and reacting to user intents. However, these models typically excel in...
SourceID	ieee
SourceType	Publisher
StartPage	186
SubjectTerms	artificial intelligence Benchmark testing Big Data Biomedical equipment Data models Drugs Encyclopedias large language model medical question-answering system Medical services
Title	DoctorGPT: A Large Language Model with Chinese Medical Question-Answering Capabilities
URI	https://ieeexplore.ieee.org/document/10499472
hasFullText	1
inHoldings	1
isFullTextHit
isPrint
link	http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV3fS8MwEA66J59UnPibPPiamjZN0_g2pnOKjoGb7G2k6QVE6YbrEPzrvaSboiD4Ukoobck1_e4u33dHyHmeucQDHUP8AZaq0rEccZIJh0uviDNnQu3Oh0HWH6d3EzlZidWDFgYAAvkMIn8a9vLLmV36VBmucPTPU4V_3E38zhqx1oqzFXN90b-6fcx4rry-KhHR-uoffVMCbPS2yWD9wIYt8hIt6yKyH79qMf77jXZI-1uhR4df2LNLNqDaI0-IGBhC3wxHl7RD7z3HG49NPpL6pmev1Kddqe-ZDQscajZpaEh6on1Yp1q8h9KEtIsYGmizGEi3ybh3Per22apvAnuOY12zxGnASCNPdFrYROcpmMQZaSRIKbTNuEudBG1j5Rshljw3TnJj0dMBE3NIxD5pVbMKDgjFW2mtXCksF6lw1khreKm8l6CcUPEhaftJmc6b0hjT9Xwc_TF-TLa8bQIfJDshrfptCaeI6nVxFqz5CVR_oc0
linkProvider	IEEE
linkToHtml	http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV3PS8MwFA6iBz2pOPG3OXhNTZukabyN6ex0GwM32W1kaQKidOI6BP96X9JNURC8lBJCW_JIv_devu89hC6y1CUe6AjgjyVcFo5kgJOEOdh60zh1OtTu7PXTfMTvxmK8FKsHLYy1NpDPbORvw1l-MTMLnyqDHQ7-OZfwx90A4OeilmstWVsxVZf5dechpZn0CquERav5PzqnBOBob6P-6pU1X-Q5WlTTyHz8qsb472_aQY1vjR4efKHPLlqz5R56BMyAIPp2MLzCTdz1LG-41hlJ7NuevWCfeMW-a7adw1B9TIND2hMsRJrl_D0UJ8QtQNFAnIVQuoFG7ZthKyfLzgnkKY5VRRKnLMQaWaL41CQq41YnTgstrBBMmZQ67oRVJpa-FWJBM-0E1QZ8HatjahO2j9bLWWkPEIZHKSVdwQxlnDmjhdG0kN5PkI7J-BA1_KJMXuviGJPVehz9MX6ONvNhrzvpdvr3x2jL2ymwQ9ITtF69LewpYHw1PQuW_QT90qUa
openUrl	ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Abook&rft.genre=proceeding&rft.title=2023+International+Conference+on+High+Performance+Big+Data+and+Intelligent+Systems+%28HDIS%29&rft.atitle=DoctorGPT%3A+A+Large+Language+Model+with+Chinese+Medical+Question-Answering+Capabilities&rft.au=Li%2C+Wenqiang&rft.au=Yu%2C+Lina&rft.au=Wu%2C+Min&rft.au=Liu%2C+Jingyi&rft.date=2023-12-06&rft.pub=IEEE&rft.spage=186&rft.epage=193&rft_id=info:doi/10.1109%2FHDIS60872.2023.10499472&rft.externalDocID=10499472