Use of a large language model with instruction‐tuning for reliable clinical frailty scoring

Background Frailty is an important predictor of health outcomes, characterized by increased vulnerability due to physiological decline. The Clinical Frailty Scale (CFS) is commonly used for frailty assessment but may be influenced by rater bias. Use of artificial intelligence (AI), particularly Larg...

Full description

Saved in:
Bibliographic Details
Published inJournal of the American Geriatrics Society (JAGS) Vol. 72; no. 12; pp. 3849 - 3854
Main Authors Kee, Xiang Lee Jamie, Sng, Gerald Gui Ren, Lim, Daniel Yan Zheng, Tung, Joshua Yi Min, Abdullah, Hairil Rizal, Chowdury, Anupama Roy
Format Journal Article
LanguageEnglish
Published Hoboken, USA John Wiley & Sons, Inc 01.12.2024
Wiley Subscription Services, Inc
Subjects
Online AccessGet full text
ISSN0002-8614
1532-5415
1532-5415
DOI10.1111/jgs.19114

Cover

Loading…
Abstract Background Frailty is an important predictor of health outcomes, characterized by increased vulnerability due to physiological decline. The Clinical Frailty Scale (CFS) is commonly used for frailty assessment but may be influenced by rater bias. Use of artificial intelligence (AI), particularly Large Language Models (LLMs) offers a promising method for efficient and reliable frailty scoring. Methods The study utilized seven standardized patient scenarios to evaluate the consistency and reliability of CFS scoring by OpenAI's GPT‐3.5‐turbo model. Two methods were tested: a basic prompt and an instruction‐tuned prompt incorporating CFS definition, a directive for accurate responses, and temperature control. The outputs were compared using the Mann–Whitney U test and Fleiss' Kappa for inter‐rater reliability. The outputs were compared with historic human scores of the same scenarios. Results The LLM's median scores were similar to human raters, with differences of no more than one point. Significant differences in score distributions were observed between the basic and instruction‐tuned prompts in five out of seven scenarios. The instruction‐tuned prompt showed high inter‐rater reliability (Fleiss' Kappa of 0.887) and produced consistent responses in all scenarios. Difficulty in scoring was noted in scenarios with less explicit information on activities of daily living (ADLs). Conclusions This study demonstrates the potential of LLMs in consistently scoring clinical frailty with high reliability. It demonstrates that prompt engineering via instruction‐tuning can be a simple but effective approach for optimizing LLMs in healthcare applications. The LLM may overestimate frailty scores when less information about ADLs is provided, possibly as it is less subject to implicit assumptions and extrapolation than humans. Future research could explore the integration of LLMs in clinical research and frailty‐related outcome prediction.
AbstractList Background Frailty is an important predictor of health outcomes, characterized by increased vulnerability due to physiological decline. The Clinical Frailty Scale (CFS) is commonly used for frailty assessment but may be influenced by rater bias. Use of artificial intelligence (AI), particularly Large Language Models (LLMs) offers a promising method for efficient and reliable frailty scoring. Methods The study utilized seven standardized patient scenarios to evaluate the consistency and reliability of CFS scoring by OpenAI's GPT‐3.5‐turbo model. Two methods were tested: a basic prompt and an instruction‐tuned prompt incorporating CFS definition, a directive for accurate responses, and temperature control. The outputs were compared using the Mann–Whitney U test and Fleiss' Kappa for inter‐rater reliability. The outputs were compared with historic human scores of the same scenarios. Results The LLM's median scores were similar to human raters, with differences of no more than one point. Significant differences in score distributions were observed between the basic and instruction‐tuned prompts in five out of seven scenarios. The instruction‐tuned prompt showed high inter‐rater reliability (Fleiss' Kappa of 0.887) and produced consistent responses in all scenarios. Difficulty in scoring was noted in scenarios with less explicit information on activities of daily living (ADLs). Conclusions This study demonstrates the potential of LLMs in consistently scoring clinical frailty with high reliability. It demonstrates that prompt engineering via instruction‐tuning can be a simple but effective approach for optimizing LLMs in healthcare applications. The LLM may overestimate frailty scores when less information about ADLs is provided, possibly as it is less subject to implicit assumptions and extrapolation than humans. Future research could explore the integration of LLMs in clinical research and frailty‐related outcome prediction.
BackgroundFrailty is an important predictor of health outcomes, characterized by increased vulnerability due to physiological decline. The Clinical Frailty Scale (CFS) is commonly used for frailty assessment but may be influenced by rater bias. Use of artificial intelligence (AI), particularly Large Language Models (LLMs) offers a promising method for efficient and reliable frailty scoring.MethodsThe study utilized seven standardized patient scenarios to evaluate the consistency and reliability of CFS scoring by OpenAI's GPT‐3.5‐turbo model. Two methods were tested: a basic prompt and an instruction‐tuned prompt incorporating CFS definition, a directive for accurate responses, and temperature control. The outputs were compared using the Mann–Whitney U test and Fleiss' Kappa for inter‐rater reliability. The outputs were compared with historic human scores of the same scenarios.ResultsThe LLM's median scores were similar to human raters, with differences of no more than one point. Significant differences in score distributions were observed between the basic and instruction‐tuned prompts in five out of seven scenarios. The instruction‐tuned prompt showed high inter‐rater reliability (Fleiss' Kappa of 0.887) and produced consistent responses in all scenarios. Difficulty in scoring was noted in scenarios with less explicit information on activities of daily living (ADLs).ConclusionsThis study demonstrates the potential of LLMs in consistently scoring clinical frailty with high reliability. It demonstrates that prompt engineering via instruction‐tuning can be a simple but effective approach for optimizing LLMs in healthcare applications. The LLM may overestimate frailty scores when less information about ADLs is provided, possibly as it is less subject to implicit assumptions and extrapolation than humans. Future research could explore the integration of LLMs in clinical research and frailty‐related outcome prediction.
Frailty is an important predictor of health outcomes, characterized by increased vulnerability due to physiological decline. The Clinical Frailty Scale (CFS) is commonly used for frailty assessment but may be influenced by rater bias. Use of artificial intelligence (AI), particularly Large Language Models (LLMs) offers a promising method for efficient and reliable frailty scoring. The study utilized seven standardized patient scenarios to evaluate the consistency and reliability of CFS scoring by OpenAI's GPT-3.5-turbo model. Two methods were tested: a basic prompt and an instruction-tuned prompt incorporating CFS definition, a directive for accurate responses, and temperature control. The outputs were compared using the Mann-Whitney U test and Fleiss' Kappa for inter-rater reliability. The outputs were compared with historic human scores of the same scenarios. The LLM's median scores were similar to human raters, with differences of no more than one point. Significant differences in score distributions were observed between the basic and instruction-tuned prompts in five out of seven scenarios. The instruction-tuned prompt showed high inter-rater reliability (Fleiss' Kappa of 0.887) and produced consistent responses in all scenarios. Difficulty in scoring was noted in scenarios with less explicit information on activities of daily living (ADLs). This study demonstrates the potential of LLMs in consistently scoring clinical frailty with high reliability. It demonstrates that prompt engineering via instruction-tuning can be a simple but effective approach for optimizing LLMs in healthcare applications. The LLM may overestimate frailty scores when less information about ADLs is provided, possibly as it is less subject to implicit assumptions and extrapolation than humans. Future research could explore the integration of LLMs in clinical research and frailty-related outcome prediction.
Frailty is an important predictor of health outcomes, characterized by increased vulnerability due to physiological decline. The Clinical Frailty Scale (CFS) is commonly used for frailty assessment but may be influenced by rater bias. Use of artificial intelligence (AI), particularly Large Language Models (LLMs) offers a promising method for efficient and reliable frailty scoring.BACKGROUNDFrailty is an important predictor of health outcomes, characterized by increased vulnerability due to physiological decline. The Clinical Frailty Scale (CFS) is commonly used for frailty assessment but may be influenced by rater bias. Use of artificial intelligence (AI), particularly Large Language Models (LLMs) offers a promising method for efficient and reliable frailty scoring.The study utilized seven standardized patient scenarios to evaluate the consistency and reliability of CFS scoring by OpenAI's GPT-3.5-turbo model. Two methods were tested: a basic prompt and an instruction-tuned prompt incorporating CFS definition, a directive for accurate responses, and temperature control. The outputs were compared using the Mann-Whitney U test and Fleiss' Kappa for inter-rater reliability. The outputs were compared with historic human scores of the same scenarios.METHODSThe study utilized seven standardized patient scenarios to evaluate the consistency and reliability of CFS scoring by OpenAI's GPT-3.5-turbo model. Two methods were tested: a basic prompt and an instruction-tuned prompt incorporating CFS definition, a directive for accurate responses, and temperature control. The outputs were compared using the Mann-Whitney U test and Fleiss' Kappa for inter-rater reliability. The outputs were compared with historic human scores of the same scenarios.The LLM's median scores were similar to human raters, with differences of no more than one point. Significant differences in score distributions were observed between the basic and instruction-tuned prompts in five out of seven scenarios. The instruction-tuned prompt showed high inter-rater reliability (Fleiss' Kappa of 0.887) and produced consistent responses in all scenarios. Difficulty in scoring was noted in scenarios with less explicit information on activities of daily living (ADLs).RESULTSThe LLM's median scores were similar to human raters, with differences of no more than one point. Significant differences in score distributions were observed between the basic and instruction-tuned prompts in five out of seven scenarios. The instruction-tuned prompt showed high inter-rater reliability (Fleiss' Kappa of 0.887) and produced consistent responses in all scenarios. Difficulty in scoring was noted in scenarios with less explicit information on activities of daily living (ADLs).This study demonstrates the potential of LLMs in consistently scoring clinical frailty with high reliability. It demonstrates that prompt engineering via instruction-tuning can be a simple but effective approach for optimizing LLMs in healthcare applications. The LLM may overestimate frailty scores when less information about ADLs is provided, possibly as it is less subject to implicit assumptions and extrapolation than humans. Future research could explore the integration of LLMs in clinical research and frailty-related outcome prediction.CONCLUSIONSThis study demonstrates the potential of LLMs in consistently scoring clinical frailty with high reliability. It demonstrates that prompt engineering via instruction-tuning can be a simple but effective approach for optimizing LLMs in healthcare applications. The LLM may overestimate frailty scores when less information about ADLs is provided, possibly as it is less subject to implicit assumptions and extrapolation than humans. Future research could explore the integration of LLMs in clinical research and frailty-related outcome prediction.
Author Sng, Gerald Gui Ren
Abdullah, Hairil Rizal
Lim, Daniel Yan Zheng
Chowdury, Anupama Roy
Kee, Xiang Lee Jamie
Tung, Joshua Yi Min
Author_xml – sequence: 1
  givenname: Xiang Lee Jamie
  orcidid: 0000-0001-8387-9513
  surname: Kee
  fullname: Kee, Xiang Lee Jamie
  email: drjamiekee@gmail.com
  organization: Singapore General Hospital
– sequence: 2
  givenname: Gerald Gui Ren
  surname: Sng
  fullname: Sng, Gerald Gui Ren
  organization: Singapore General Hospital
– sequence: 3
  givenname: Daniel Yan Zheng
  surname: Lim
  fullname: Lim, Daniel Yan Zheng
  organization: Singapore General Hospital
– sequence: 4
  givenname: Joshua Yi Min
  surname: Tung
  fullname: Tung, Joshua Yi Min
  organization: Singapore General Hospital
– sequence: 5
  givenname: Hairil Rizal
  surname: Abdullah
  fullname: Abdullah, Hairil Rizal
  organization: Singapore General Hospital
– sequence: 6
  givenname: Anupama Roy
  surname: Chowdury
  fullname: Chowdury, Anupama Roy
  organization: Singapore General Hospital
BackLink https://www.ncbi.nlm.nih.gov/pubmed/39105505$$D View this record in MEDLINE/PubMed
BookMark eNp10c1OJCEQB3Bi3Oj4cfAFDIkX99BKNdBNH81EXY3JHlaPhjA0jEwYUOiOmZuP4DP6JLKO7mET6wAcfkUK_jtoM8RgEDoAcgKlThfzfAIdANtAE-C0rjgDvokmhJC6Eg2wbbST84IQqIkQW2ibdkA4J3yC7u-ywdFihb1Kc1PWMB9VOSxjbzx-dsMDdiEPadSDi-Ht5XUYgwtzbGPCyXinZt5g7V1wWnlsk3J-WOGsYypqD_2wymez_7nvoruL89vpr-rm9-XV9Oym0hQoq3jdib6nupn1jBvC247OhBC845a1pm1bQtquUdaw2jKubUMV1KIH3lOl-tK5i47X9z6m-DSaPMily9r48hoTxywpER0HDg0UevQfXcQxhTKdpMBoKd62RR1-qnG2NL18TG6p0kp-fVwBP9dAp5hzMvYfASL_hiJLKPIjlGJP1_bZebP6Hsrryz_rjncmV40u
Cites_doi 10.1111/ANAE.14596
10.1056/NEJMSR2214184
10.1111/JGS.18334
10.7759/CUREUS.41435
10.1007/S12630‐023‐02590‐4
10.1016/J.JAMDA.2016.09.010
10.5770/CGJ.23.463
10.14283/JFA.2014.27
10.1016/J.BJA.2023.06.052
10.1503/CMAJ.050051
10.1093/AGEING/AFAD173
10.1038/s41746‐023‐00939‐z
10.1093/AGEING/AFAB006
10.3390/GERIATRICS5020040
10.1016/S2589‐7500(23)00225‐X
10.5770/CGJ.23.398
10.1038/S41591‐023‐02448‐8
ContentType Journal Article
Copyright 2024 The American Geriatrics Society.
2024 American Geriatrics Society and Wiley Periodicals LLC.
Copyright_xml – notice: 2024 The American Geriatrics Society.
– notice: 2024 American Geriatrics Society and Wiley Periodicals LLC.
DBID AAYXX
CITATION
CGR
CUY
CVF
ECM
EIF
NPM
7QP
7TK
K9.
NAPCQ
7X8
DOI 10.1111/jgs.19114
DatabaseName CrossRef
Medline
MEDLINE
MEDLINE (Ovid)
MEDLINE
MEDLINE
PubMed
Calcium & Calcified Tissue Abstracts
Neurosciences Abstracts
ProQuest Health & Medical Complete (Alumni)
Nursing & Allied Health Premium
MEDLINE - Academic
DatabaseTitle CrossRef
MEDLINE
Medline Complete
MEDLINE with Full Text
PubMed
MEDLINE (Ovid)
ProQuest Health & Medical Complete (Alumni)
Nursing & Allied Health Premium
Calcium & Calcified Tissue Abstracts
Neurosciences Abstracts
MEDLINE - Academic
DatabaseTitleList
ProQuest Health & Medical Complete (Alumni)
MEDLINE
MEDLINE - Academic
Database_xml – sequence: 1
  dbid: NPM
  name: PubMed
  url: https://proxy.k.utb.cz/login?url=http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=PubMed
  sourceTypes: Index Database
– sequence: 2
  dbid: EIF
  name: MEDLINE
  url: https://proxy.k.utb.cz/login?url=https://www.webofscience.com/wos/medline/basic-search
  sourceTypes: Index Database
DeliveryMethod fulltext_linktorsrc
Discipline Medicine
EISSN 1532-5415
EndPage 3854
ExternalDocumentID 39105505
10_1111_jgs_19114
JGS19114
Genre shortCommunication
Journal Article
GroupedDBID ---
--Z
-~X
..I
.3N
.55
.GA
.GJ
.Y3
05W
0R~
10A
1CY
1KJ
1OB
1OC
29L
31~
33P
36B
3O-
3SF
4.4
50Y
50Z
51W
51X
52M
52N
52O
52P
52R
52S
52T
52U
52V
52W
52X
53G
5GY
5HH
5LA
5RE
5VS
66C
6PF
702
7PT
8-0
8-1
8-3
8-4
8-5
8F7
8UM
930
A01
A03
AAESR
AAEVG
AAHHS
AAHQN
AAIPD
AAMNL
AANHP
AANLZ
AAONW
AAQQT
AARRQ
AASGY
AAWTL
AAXRX
AAYCA
AAYJJ
AAZKR
ABCQN
ABCUV
ABDPE
ABEML
ABIVO
ABJNI
ABLJU
ABOCM
ABPPZ
ABPVW
ABQWH
ABXGK
ACAHQ
ACBWZ
ACCFJ
ACCZN
ACGFO
ACGFS
ACGOF
ACHQT
ACMXC
ACNCT
ACPOU
ACPRK
ACRPL
ACSCC
ACXBN
ACXQS
ACYXJ
ADBBV
ADBTR
ADEOM
ADIZJ
ADKYN
ADMGS
ADNMO
ADOZA
ADXAS
ADZMN
AEEZP
AEGXH
AEIGN
AEIMD
AENEX
AEQDE
AEUQT
AEUYR
AFBPY
AFEBI
AFFNX
AFFPM
AFGKR
AFPWT
AFWVQ
AFZJQ
AHBTC
AHMBA
AI.
AIACR
AIAGR
AITYG
AIURR
AIWBW
AJBDE
ALAGY
ALMA_UNASSIGNED_HOLDINGS
ALUQN
ALVPJ
AMBMR
AMYDB
ASPBG
ATUGU
AVWKF
AZBYB
AZFZN
AZVAB
BAFTC
BDRZF
BFHJK
BHBCM
BKOMP
BMXJE
BROTX
BRXPI
BY8
C45
CAG
COF
CS3
D-6
D-7
D-E
D-F
DCZOG
DPXWK
DR2
DRFUL
DRMAN
DRSTM
DU5
DUUFO
EBS
EJD
EMOBN
ESX
EX3
F00
F01
F04
F5P
FEDTE
FUBAC
G-S
G.N
GODZA
H.X
HF~
HGLYW
HVGLF
HZI
HZ~
H~9
IHE
IX1
J0M
J5H
K48
KBYEO
L7B
LATKE
LC2
LC3
LEEKS
LH4
LITHE
LOXES
LP6
LP7
LUTES
LW6
LYRES
MEWTI
MK4
MRFUL
MRMAN
MRSTM
MSFUL
MSMAN
MSSTM
MXFUL
MXMAN
MXSTM
N04
N05
N4W
N9A
NF~
NHB
O66
O9-
OHT
OIG
OVD
P2P
P2W
P2X
P2Z
P4B
P4D
PALCI
PQQKQ
Q.N
Q11
QB0
QN7
R.K
RIWAO
RJQFR
ROL
RX1
SAMSI
SUPJJ
TAE
TEORI
TN5
TWZ
UB1
UKR
UPT
V9Y
VH1
W8V
W99
WBKPD
WH7
WHWMO
WIH
WIJ
WIK
WOHZO
WOW
WQ9
WQJ
WRC
WVDHM
WXI
WXSBR
X7M
XG1
XOL
YCJ
YF5
YFH
YOC
YQJ
YQT
YUY
YZZ
ZGI
ZHY
ZXP
ZY1
ZZTAW
~IA
~S-
~WT
AAYXX
AEYWJ
AGHNM
AGQPQ
AGYGG
CITATION
CGR
CUY
CVF
ECM
EIF
NPM
PKN
YIN
Z5M
7QP
7TK
AAMMB
AEFGJ
AGXDD
AIDQK
AIDYY
K9.
NAPCQ
7X8
ID FETCH-LOGICAL-c3134-5298dd3c6bd45e05793b888595f47e77700796afe42f45cf63a128d15d3aaddd3
IEDL.DBID DR2
ISSN 0002-8614
1532-5415
IngestDate Thu Jul 10 18:40:06 EDT 2025
Fri Jul 25 23:03:32 EDT 2025
Wed Feb 19 02:17:53 EST 2025
Tue Jul 01 00:49:59 EDT 2025
Wed Jan 22 17:12:02 EST 2025
IsPeerReviewed true
IsScholarly true
Issue 12
Keywords frailty
geriatrics
artificial intelligence
Language English
License 2024 The American Geriatrics Society.
LinkModel DirectLink
MergedId FETCHMERGED-LOGICAL-c3134-5298dd3c6bd45e05793b888595f47e77700796afe42f45cf63a128d15d3aaddd3
Notes Xiang Lee Jamie Kee and Gerald Gui Ren Sng contributed equally to this work.
ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 14
content type line 23
ORCID 0000-0001-8387-9513
PMID 39105505
PQID 3143333577
PQPubID 41968
PageCount 6
ParticipantIDs proquest_miscellaneous_3089515161
proquest_journals_3143333577
pubmed_primary_39105505
crossref_primary_10_1111_jgs_19114
wiley_primary_10_1111_jgs_19114_JGS19114
ProviderPackageCode CITATION
AAYXX
PublicationCentury 2000
PublicationDate December 2024
2024-12-00
2024-Dec
20241201
PublicationDateYYYYMMDD 2024-12-01
PublicationDate_xml – month: 12
  year: 2024
  text: December 2024
PublicationDecade 2020
PublicationPlace Hoboken, USA
PublicationPlace_xml – name: Hoboken, USA
– name: United States
– name: New York
PublicationTitle Journal of the American Geriatrics Society (JAGS)
PublicationTitleAlternate J Am Geriatr Soc
PublicationYear 2024
Publisher John Wiley & Sons, Inc
Wiley Subscription Services, Inc
Publisher_xml – name: John Wiley & Sons, Inc
– name: Wiley Subscription Services, Inc
References 2020; 5
2023; 52
2005; 173
2014; 3
2023; 131
2023
2024; 6
2019; 74
2023; 29
2023; 15
2023; 6
2023; 388
2017
2020; 23
2021; 50
2016; 17
2023; 71
2023; 70
e_1_2_9_20_1
e_1_2_9_11_1
e_1_2_9_10_1
e_1_2_9_21_1
e_1_2_9_13_1
e_1_2_9_12_1
e_1_2_9_8_1
e_1_2_9_7_1
e_1_2_9_6_1
e_1_2_9_5_1
e_1_2_9_4_1
WHO (e_1_2_9_2_1) 2017
e_1_2_9_3_1
e_1_2_9_9_1
e_1_2_9_15_1
e_1_2_9_14_1
e_1_2_9_17_1
e_1_2_9_16_1
e_1_2_9_19_1
e_1_2_9_18_1
References_xml – volume: 74
  start-page: 758
  issue: 6
  year: 2019
  end-page: 764
  article-title: Reliability of frailty assessment in the critically ill: a multicentre prospective observational study
  publication-title: Anaesthesia
– volume: 17
  start-page: 1163.e1
  issue: 12
  year: 2016
  end-page: 1163.e17
  article-title: Frailty and the prediction of negative health outcomes: a meta‐analysis
  publication-title: J Am Med Dir Assoc
– volume: 388
  start-page: 1233
  issue: 13
  year: 2023
  end-page: 1239
  article-title: Benefits, limits, and risks of GPT‐4 as an AI Chatbot for medicine
  publication-title: N Engl J Med
– volume: 71
  start-page: 2028
  issue: 6
  year: 2023
  end-page: 2031
  article-title: Artificial intelligence and geriatric medicine: new possibilities and consequences
  publication-title: J Am Geriatr Soc
– year: 2023
– volume: 15
  issue: 7
  year: 2023
  article-title: ChatGPT in radiology: the advantages and limitations of artificial intelligence for medical imaging diagnosis
  publication-title: Cureus
– volume: 173
  start-page: 489
  issue: 5
  year: 2005
  end-page: 495
  article-title: A global clinical measure of fitness and frailty in elderly people
  publication-title: Can Med Assoc J
– volume: 131
  start-page: e73
  issue: 3
  year: 2023
  end-page: e75
  article-title: Large language models in anaesthesiology: use of ChatGPT for American Society of Anesthesiologists physical status classification
  publication-title: Br J Anaesth
– volume: 52
  issue: 11
  year: 2023
  article-title: Validated frailty measures using electronic primary care records: a review of diagnostic test accuracy
  publication-title: Age Ageing
– volume: 70
  start-page: 1726
  issue: 11
  year: 2023
  end-page: 1734
  article-title: Interrater reliability of the Clinical Frailty Scale in the anesthesia preadmission clinic
  publication-title: Can J Anaesth
– year: 2017
– volume: 5
  start-page: 1
  issue: 2
  year: 2020
  end-page: 6
  article-title: The clinical frailty scale: do staff agree?
  publication-title: Geriatrics
– volume: 50
  start-page: 1406
  issue: 4
  year: 2021
  end-page: 1411
  article-title: A classification tree to assist with routine scoring of the Clinical Frailty Scale
  publication-title: Age Ageing
– volume: 6
  start-page: e12
  issue: 1
  year: 2024
  end-page: e22
  article-title: Assessing the potential of GPT‐4 to perpetuate racial and gender biases in health care: a model evaluation study
  publication-title: Lancet Digital Health
– volume: 6
  start-page: 195
  issue: 1
  year: 2023
  article-title: Large language models propagate race‐based medicine
  publication-title: NPJ Digit Med
– volume: 23
  start-page: 210
  issue: 3
  year: 2020
  article-title: Using the clinical frailty scale in allocating scarce health care resources
  publication-title: Can Geriatr J
– volume: 3
  start-page: 1
  issue: 4
  year: 2014
  end-page: 6
  article-title: Facilitating frailty identification: comparison of two methods among community‐dwelling order adults
  publication-title: J Frailty Aging
– volume: 23
  start-page: 223
  issue: 3
  year: 2020
  end-page: 229
  article-title: Interrater reliability of the clinical frailty scale by geriatrician and intensivist in patients admitted to the intensive care unit
  publication-title: Can Geriatr J
– volume: 29
  start-page: 1930
  issue: 8
  year: 2023
  end-page: 1940
  article-title: Large language models in medicine
  publication-title: Nat Med
– ident: e_1_2_9_5_1
  doi: 10.1111/ANAE.14596
– ident: e_1_2_9_10_1
  doi: 10.1056/NEJMSR2214184
– ident: e_1_2_9_11_1
  doi: 10.1111/JGS.18334
– ident: e_1_2_9_12_1
  doi: 10.7759/CUREUS.41435
– ident: e_1_2_9_17_1
  doi: 10.1007/S12630‐023‐02590‐4
– ident: e_1_2_9_3_1
  doi: 10.1016/J.JAMDA.2016.09.010
– ident: e_1_2_9_7_1
  doi: 10.5770/CGJ.23.463
– ident: e_1_2_9_18_1
  doi: 10.14283/JFA.2014.27
– volume-title: WHO Clinical Consortium on Healthy Ageing
  year: 2017
  ident: e_1_2_9_2_1
– ident: e_1_2_9_14_1
  doi: 10.1016/J.BJA.2023.06.052
– ident: e_1_2_9_8_1
– ident: e_1_2_9_4_1
  doi: 10.1503/CMAJ.050051
– ident: e_1_2_9_15_1
  doi: 10.1093/AGEING/AFAD173
– ident: e_1_2_9_21_1
  doi: 10.1038/s41746‐023‐00939‐z
– ident: e_1_2_9_19_1
  doi: 10.1093/AGEING/AFAB006
– ident: e_1_2_9_13_1
  doi: 10.3390/GERIATRICS5020040
– ident: e_1_2_9_16_1
– ident: e_1_2_9_20_1
  doi: 10.1016/S2589‐7500(23)00225‐X
– ident: e_1_2_9_6_1
  doi: 10.5770/CGJ.23.398
– ident: e_1_2_9_9_1
  doi: 10.1038/S41591‐023‐02448‐8
SSID ssj0012088
Score 2.4715285
Snippet Background Frailty is an important predictor of health outcomes, characterized by increased vulnerability due to physiological decline. The Clinical Frailty...
Frailty is an important predictor of health outcomes, characterized by increased vulnerability due to physiological decline. The Clinical Frailty Scale (CFS)...
BackgroundFrailty is an important predictor of health outcomes, characterized by increased vulnerability due to physiological decline. The Clinical Frailty...
SourceID proquest
pubmed
crossref
wiley
SourceType Aggregation Database
Index Database
Publisher
StartPage 3849
SubjectTerms Activities of daily living
Aged
Aged, 80 and over
Artificial Intelligence
Female
Frail Elderly
Frailty
Frailty - diagnosis
Geriatric Assessment - methods
geriatrics
Humans
Large language models
Male
Medical tests
Older people
Reliability
Reproducibility of Results
Title Use of a large language model with instruction‐tuning for reliable clinical frailty scoring
URI https://onlinelibrary.wiley.com/doi/abs/10.1111%2Fjgs.19114
https://www.ncbi.nlm.nih.gov/pubmed/39105505
https://www.proquest.com/docview/3143333577
https://www.proquest.com/docview/3089515161
Volume 72
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
link http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwnV1LS8NAEB5KD-LF9yNaZRUPXlqa7G42wZOItRTqQS30oIRNsinF0krTHvTkT_A3-kuc3Ty0iiDmFMgum2Rmdr6ZnQfAiUul5zKV1J2YR3UWokh5OtZVNO1I-B7zuNInut1rt91jnT7vV-CsyIXJ6kOUDjctGWa_1gIuw_SrkA_SBhobpom1jtXSgOimLB1lO03PK6Ev6qC8qpCJ4ilmLuqiHwBzEa8ahdNahfviVbM4k8fGfBY2opdvVRz_-S1rsJIDUXKecc46VNR4A5a6-VH7Jjz0UkUmCZFkpIPFSeHYJKZ3DtH-WzL8LD_7_vo2m2sfC0EUTKb4PjonixSJlySZyuFo9kzSyET8bUGvdXl30a7nvRjqEbUpQ3vV9-KYRm4YM650BisN0XjmPk-YUEIIxBq-KxPFnITxKEEeQM0X2zymErfQmG5DdTwZq10groNc4PjSUY5CuBiG1I6V4BwnS-VGiQXHBVWCp6zkRlCaKoM0MD_KglpBryCXujSgCP7w4kJYcFQ-RnnRhyByrCZzHNP0EFRyBLoW7GR0Llehvm4X2uQWnBpq_b580Lm6NTd7fx-6D8sOIqIsFqYGVSSQOkBEMwsPDet-AMs08Xo
linkProvider Wiley-Blackwell
linkToHtml http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwnV3JTsMwEB2hIgEX9iWsBnHg0iqJ7TiRuCC2UloO0Eq9VFEWp0KgFpH2ACc-gW_kSxg7C5uQEDlFii0nmRnPm_EsAPsODVyHyaRqxzyqshBFylWxrsK0IuG5zOVSnei2rpx6hzW6vDsBh0UuTFYfonS4KcnQ-7UScOWQ_izl_bSG1obqYj2pOnqr_gUn12XxKMs2XbcEv6iF8rpCOo6nmPpVG_2AmF8Rq1Y5Z3PQK142izS5q41HYS16_lbH8b9fMw-zORYlRxnzLMCEHCzCVCs_bV-CXieVZJiQgNyreHFS-DaJbp9DlAuX3H5UoH17eR2NlZuFIBAmj_hCKi2LFLmXJHkMbu9HTySNdNDfMnTOTtvH9WrejqEaUYsyNFk9N45p5IQx41IlsdIQ7Wfu8YQJKYRAuOE5QSKZnTAeJcgGqPxii8c0wF00pitQGQwHcg2IYyMj2F5gS1siYgxDasVScI6TA-lEiQF7BVn8h6zqhl9aK_3U1z_KgM2CYH4ueKlPEf_hxYUwYLd8jCKjzkGCgRyOcYzpIq7kiHUNWM0IXa5CPdUx1OQGHGhy_b683zi_0Tfrfx-6A9P1dqvpNy-uLjdgxkaAlIXGbEIFiSW3EOCMwm3Nx-_Z5PWU
linkToPdf http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwnV1LS8QwEB5EQbz4ftRnFA9edtk2SdPiSdT1LaIueFBK2ySLKLtidw968if4G_0lTtKHLwSxp0IT0nZmMt9M5gGw7tM48JnSNU_ytMYSFKnAxLqKhpuKMGABV-ZE9-TU32-xwyt-NQCbZS5MXh-icrgZybD7tRHwB6k_C3k7q6OxYZpYDzEfhcUgovOqdpTrNYKgwr6ohIqyQjaMp5z6VRn9QJhfAavVOM0xuC7fNQ80uav3e0k9ff5WxvGfHzMOowUSJVs560zAgOpMwvBJcdY-BTetTJGuJjG5N9HipPRsEts8hxgHLrn9qD_79vLa6xsnC0EYTB7xfUxSFikzL4l-jG_ve08kS23I3zS0mruX2_u1ohlDLaUuZWiwhoGUNPUTybgyKaw0QeuZh1wzoYQQCDZCP9aKeZrxVCMToOqTLpc0xj1U0hkY7HQ7ag6I7yEbeGHsKU8hXkwS6kolOMfJsfJT7cBaSZXoIa-5EVW2SjuL7I9yYLGkV1SIXRZRRH94cSEcWK0eo8CYU5C4o7p9HNMIEFVyRLoOzOZ0rlahoekX2uAObFhq_b58dLh3YW_m_z50BYbPdprR8cHp0QKMeIiO8riYRRhEWqklRDe9ZNly8TsiZvRM
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=Use+of+a+large+language+model+with+instruction%E2%80%90tuning+for+reliable+clinical+frailty+scoring&rft.jtitle=Journal+of+the+American+Geriatrics+Society+%28JAGS%29&rft.au=Kee%2C+Xiang+Lee+Jamie&rft.au=Sng%2C+Gerald+Gui+Ren&rft.au=Lim%2C+Daniel+Yan+Zheng&rft.au=Tung%2C+Joshua+Yi+Min&rft.date=2024-12-01&rft.pub=John+Wiley+%26+Sons%2C+Inc&rft.issn=0002-8614&rft.eissn=1532-5415&rft.volume=72&rft.issue=12&rft.spage=3849&rft.epage=3854&rft_id=info:doi/10.1111%2Fjgs.19114&rft.externalDBID=10.1111%252Fjgs.19114&rft.externalDocID=JGS19114
thumbnail_l http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=0002-8614&client=summon
thumbnail_m http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=0002-8614&client=summon
thumbnail_s http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=0002-8614&client=summon