DoctorGPT: A Large Language Model with Chinese Medical Question-Answering Capabilities

Large Language Models (LLMs) have made incredible strides recently in understanding and reacting to user intents. However, these models typically excel in English and have not been specifically trained for medical applications, leading to suboptimal performance in responding to medical inquiries suc...

Full description

Saved in:
Bibliographic Details
Published in2023 International Conference on High Performance Big Data and Intelligent Systems (HDIS) pp. 186 - 193
Main Authors Li, Wenqiang, Yu, Lina, Wu, Min, Liu, Jingyi, Hao, Meilan, Li, Yanjie
Format Conference Proceeding
LanguageEnglish
Published IEEE 06.12.2023
Subjects
Online AccessGet full text
DOI10.1109/HDIS60872.2023.10499472

Cover

Loading…
Abstract Large Language Models (LLMs) have made incredible strides recently in understanding and reacting to user intents. However, these models typically excel in English and have not been specifically trained for medical applications, leading to suboptimal performance in responding to medical inquiries such as diagnostic queries and drug recommendations. In this paper, we propose DoctorGPT, a domain-specific large language model tailored for medical question-answering tasks. DoctorGPT leverages the open-source Baichuan2 as its foundational model, undergoes extensive pre-training on medical encyclopedic data to incorporate medical knowledge, and subsequently undergoes fine-tuning on a dataset consisting of two million medical instruction-dialogue pairs to enhance its question-answering capabilities. When compared to general-purpose large models, DoctorGPT demonstrates significant advantages in Chinese medical question-answerinz (O&A) tasks.
AbstractList Large Language Models (LLMs) have made incredible strides recently in understanding and reacting to user intents. However, these models typically excel in English and have not been specifically trained for medical applications, leading to suboptimal performance in responding to medical inquiries such as diagnostic queries and drug recommendations. In this paper, we propose DoctorGPT, a domain-specific large language model tailored for medical question-answering tasks. DoctorGPT leverages the open-source Baichuan2 as its foundational model, undergoes extensive pre-training on medical encyclopedic data to incorporate medical knowledge, and subsequently undergoes fine-tuning on a dataset consisting of two million medical instruction-dialogue pairs to enhance its question-answering capabilities. When compared to general-purpose large models, DoctorGPT demonstrates significant advantages in Chinese medical question-answerinz (O&A) tasks.
Author Wu, Min
Li, Wenqiang
Hao, Meilan
Liu, Jingyi
Li, Yanjie
Yu, Lina
Author_xml – sequence: 1
  givenname: Wenqiang
  surname: Li
  fullname: Li, Wenqiang
  email: liwenqiang@semi.ac.cn
  organization: Institute of Semiconductors, Chinese Academy of Sciences,AnnLab,Beijing,China,100083
– sequence: 2
  givenname: Lina
  surname: Yu
  fullname: Yu, Lina
  email: yulina@semi.ac.cn
  organization: Institute of Semiconductors, Chinese Academy of Sciences,AnnLab,Beijing,China,100083
– sequence: 3
  givenname: Min
  surname: Wu
  fullname: Wu, Min
  email: wumin@semi.ac.cn
  organization: Institute of Semiconductors, Chinese Academy of Sciences,AnnLab,Beijing,China,100083
– sequence: 4
  givenname: Jingyi
  surname: Liu
  fullname: Liu, Jingyi
  email: liujingyi@semi.ac.cn
  organization: Institute of Semiconductors, Chinese Academy of Sciences,AnnLab,Beijing,China,100083
– sequence: 5
  givenname: Meilan
  surname: Hao
  fullname: Hao, Meilan
  email: mlhao@semi.ac.cn
  organization: Institute of Semiconductors, Chinese Academy of Sciences,AnnLab,Beijing,China,100083
– sequence: 6
  givenname: Yanjie
  surname: Li
  fullname: Li, Yanjie
  email: liyanjie@semi.ac.cn
  organization: Institute of Semiconductors, Chinese Academy of Sciences,AnnLab,Beijing,China,100083
BookMark eNo1j81Kw0AUhUfQhda-gWBeIHF-MplcdyHVthBRsbott5M76UCclCSl-PYNqJtzPs7ig3PDLkMXiLF7wRMhODysFuuPjOdGJpJLlQieAqRGXrA5GMiV5gpSkfFr9rXo7Nj1y7fNY1REFfYNTRmaI07w0tXURic_7qNy7wMN00S1t9hG70caRt-FuAjDiXofmqjEA-5860dPwy27ctgONP_rGft8ftqUq7h6Xa7Looq9EDDG0gFJYXIJ6c5KyFNC6VCjJq0V2Iy71GkCK4zJQNU8R6c5WqkFoeAk1Yzd_Xo9EW0Pvf_G_mf7f1edAUEkTws
ContentType Conference Proceeding
DBID 6IE
6IL
CBEJK
RIE
RIL
DOI 10.1109/HDIS60872.2023.10499472
DatabaseName IEEE Electronic Library (IEL) Conference Proceedings
IEEE Proceedings Order Plan All Online (POP All Online) 1998-present by volume
IEEE Xplore All Conference Proceedings
IEEE Electronic Library (IEL)
IEEE Proceedings Order Plans (POP All) 1998-Present
DatabaseTitleList
Database_xml – sequence: 1
  dbid: RIE
  name: IEEE Electronic Library (IEL)
  url: https://proxy.k.utb.cz/login?url=https://ieeexplore.ieee.org/
  sourceTypes: Publisher
DeliveryMethod fulltext_linktorsrc
EISBN 9798350394160
EndPage 193
ExternalDocumentID 10499472
Genre orig-research
GroupedDBID 6IE
6IL
CBEJK
RIE
RIL
ID FETCH-LOGICAL-i119t-2f9e2178294bc2984ea2fa5a5e5539c60f4f5e9c177693d08af50ac251ea10e23
IEDL.DBID RIE
IngestDate Wed May 01 11:49:10 EDT 2024
IsPeerReviewed false
IsScholarly false
Language English
LinkModel DirectLink
MergedId FETCHMERGED-LOGICAL-i119t-2f9e2178294bc2984ea2fa5a5e5539c60f4f5e9c177693d08af50ac251ea10e23
PageCount 8
ParticipantIDs ieee_primary_10499472
PublicationCentury 2000
PublicationDate 2023-Dec.-6
PublicationDateYYYYMMDD 2023-12-06
PublicationDate_xml – month: 12
  year: 2023
  text: 2023-Dec.-6
  day: 06
PublicationDecade 2020
PublicationTitle 2023 International Conference on High Performance Big Data and Intelligent Systems (HDIS)
PublicationTitleAbbrev HDIS
PublicationYear 2023
Publisher IEEE
Publisher_xml – name: IEEE
Score 1.8560193
Snippet Large Language Models (LLMs) have made incredible strides recently in understanding and reacting to user intents. However, these models typically excel in...
SourceID ieee
SourceType Publisher
StartPage 186
SubjectTerms artificial intelligence
Benchmark testing
Big Data
Biomedical equipment
Data models
Drugs
Encyclopedias
large language model
medical question-answering system
Medical services
Title DoctorGPT: A Large Language Model with Chinese Medical Question-Answering Capabilities
URI https://ieeexplore.ieee.org/document/10499472
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
link http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV3fS8MwEA66J59UnPibPPiamjZN0_g2pnOKjoGb7G2k6QVE6YbrEPzrvaSboiD4Ukoobck1_e4u33dHyHmeucQDHUP8AZaq0rEccZIJh0uviDNnQu3Oh0HWH6d3EzlZidWDFgYAAvkMIn8a9vLLmV36VBmucPTPU4V_3E38zhqx1oqzFXN90b-6fcx4rry-KhHR-uoffVMCbPS2yWD9wIYt8hIt6yKyH79qMf77jXZI-1uhR4df2LNLNqDaI0-IGBhC3wxHl7RD7z3HG49NPpL6pmev1Kddqe-ZDQscajZpaEh6on1Yp1q8h9KEtIsYGmizGEi3ybh3Per22apvAnuOY12zxGnASCNPdFrYROcpmMQZaSRIKbTNuEudBG1j5Rshljw3TnJj0dMBE3NIxD5pVbMKDgjFW2mtXCksF6lw1khreKm8l6CcUPEhaftJmc6b0hjT9Xwc_TF-TLa8bQIfJDshrfptCaeI6nVxFqz5CVR_oc0
linkProvider IEEE
linkToHtml http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV3PS8MwFA6iBz2pOPG3OXhNTZukabyN6ex0GwM32W1kaQKidOI6BP96X9JNURC8lBJCW_JIv_devu89hC6y1CUe6AjgjyVcFo5kgJOEOdh60zh1OtTu7PXTfMTvxmK8FKsHLYy1NpDPbORvw1l-MTMLnyqDHQ7-OZfwx90A4OeilmstWVsxVZf5dechpZn0CquERav5PzqnBOBob6P-6pU1X-Q5WlTTyHz8qsb472_aQY1vjR4efKHPLlqz5R56BMyAIPp2MLzCTdz1LG-41hlJ7NuevWCfeMW-a7adw1B9TIND2hMsRJrl_D0UJ8QtQNFAnIVQuoFG7ZthKyfLzgnkKY5VRRKnLMQaWaL41CQq41YnTgstrBBMmZQ67oRVJpa-FWJBM-0E1QZ8HatjahO2j9bLWWkPEIZHKSVdwQxlnDmjhdG0kN5PkI7J-BA1_KJMXuviGJPVehz9MX6ONvNhrzvpdvr3x2jL2ymwQ9ITtF69LewpYHw1PQuW_QT90qUa
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Abook&rft.genre=proceeding&rft.title=2023+International+Conference+on+High+Performance+Big+Data+and+Intelligent+Systems+%28HDIS%29&rft.atitle=DoctorGPT%3A+A+Large+Language+Model+with+Chinese+Medical+Question-Answering+Capabilities&rft.au=Li%2C+Wenqiang&rft.au=Yu%2C+Lina&rft.au=Wu%2C+Min&rft.au=Liu%2C+Jingyi&rft.date=2023-12-06&rft.pub=IEEE&rft.spage=186&rft.epage=193&rft_id=info:doi/10.1109%2FHDIS60872.2023.10499472&rft.externalDocID=10499472