P-320 WHAT DOES A LARGE LANGUAGE MODEL KNOW ABOUT THE PREVALENCE OF OCCUPATIONALLY-RELATED MEDICAL CONDITIONS? EXPERIMENTS WITH SYNTHETIC AND REAL OCCUPATIONAL MEDICINE DATA

Abstract Introduction We report on experiments with Large Language Models (LLM) to generate synthetic data around occupationally-related medical conditions in a variety of industrial settings. Methods and Results A LLM was programmed to generate 10000 records giving accounts of fictitious patients w...

Full description

Saved in:
Bibliographic Details
Published inOccupational medicine (Oxford) Vol. 74; no. Supplement_1
Main Author Johnson, Mark William
Format Journal Article
LanguageEnglish
Published 05.07.2024
Online AccessGet full text

Cover

Loading…
Abstract Abstract Introduction We report on experiments with Large Language Models (LLM) to generate synthetic data around occupationally-related medical conditions in a variety of industrial settings. Methods and Results A LLM was programmed to generate 10000 records giving accounts of fictitious patients working in a variety of industrial settings with a range of randomised parameters concerning worker characteristics (e.g. age, sex, underlying conditions, type of activity, etc). The generated text was then coded by AI to determine what AI “thought” were the likely clinical outcomes. This data was then compared to the general prevalence of different medical conditions. A second experiment was conducted with historical data from the Health and Occupational Research network at Manchester University (THOR). LLMs were able to extrapolate underlying factors within the data providing contextual richness to the existing case records. Both these experiments result in realistic accounts, and we show how LLMs tend to reflect the prevalence of conditions. Discussion and Conclusion Given that an LLM is an AI tool which predicts text based on “training” derived the relationships between words in a vast corpus of text on the internet, these experiments raise a fundamental question as to what LLMs “know” about the prevalence of occupational medical conditions: indeed, how much knowledge about occupational medicine might be encoded in the structures of language? We suggest that the study of LLMs in this way might be an additional tool not only for identifying, predicting and mitigating risk, but for assisting in the identification of new sentinal cases.
AbstractList Abstract Introduction We report on experiments with Large Language Models (LLM) to generate synthetic data around occupationally-related medical conditions in a variety of industrial settings. Methods and Results A LLM was programmed to generate 10000 records giving accounts of fictitious patients working in a variety of industrial settings with a range of randomised parameters concerning worker characteristics (e.g. age, sex, underlying conditions, type of activity, etc). The generated text was then coded by AI to determine what AI “thought” were the likely clinical outcomes. This data was then compared to the general prevalence of different medical conditions. A second experiment was conducted with historical data from the Health and Occupational Research network at Manchester University (THOR). LLMs were able to extrapolate underlying factors within the data providing contextual richness to the existing case records. Both these experiments result in realistic accounts, and we show how LLMs tend to reflect the prevalence of conditions. Discussion and Conclusion Given that an LLM is an AI tool which predicts text based on “training” derived the relationships between words in a vast corpus of text on the internet, these experiments raise a fundamental question as to what LLMs “know” about the prevalence of occupational medical conditions: indeed, how much knowledge about occupational medicine might be encoded in the structures of language? We suggest that the study of LLMs in this way might be an additional tool not only for identifying, predicting and mitigating risk, but for assisting in the identification of new sentinal cases.
Author Johnson, Mark William
Author_xml – sequence: 1
  givenname: Mark William
  surname: Johnson
  fullname: Johnson, Mark William
BookMark eNqdkE1PwkAQhjcGE0H9BV7mDxS2H1I4mXF3oBuX3aZdrJwagiXxA5D25I_yP9oGDp69zDvJzPscngHr7Q_7irE7nw99Pg1Hh81mV72OPo7rigfhkE-D-IL1_Sj2vUnE73usz6fjwIujCb9ig6Z559wfR5Ogz35SLww4FAk6kJZyQNCYzamdZr7EdllYSRqejC0AH-3SgUsI0oyeUZMRBHYGVohlik5Zg1qvvIw0OpKwIKkEahDWSNVd8wegl5QytSDjciiUSyBfmRbolAA0EjJq___iThBlCCQ6vGGX2_VnU92e85qFM3Ii8Tb1oWnqalt-1W-7df1d-rzszJQnM-XZTNmZCf_X-gXLh2V-
ContentType Journal Article
DBID AAYXX
CITATION
DOI 10.1093/occmed/kqae023.0927
DatabaseName CrossRef
DatabaseTitle CrossRef
DatabaseTitleList CrossRef
DeliveryMethod fulltext_linktorsrc
Discipline Medicine
Occupational Therapy & Rehabilitation
EISSN 1471-8405
ExternalDocumentID 10_1093_occmed_kqae023_0927
GroupedDBID ---
-E4
.2P
.I3
.ZR
0R~
123
1TH
2WC
4.4
482
48X
5VS
5WA
5WD
70D
AABZA
AACZT
AAJKP
AAMDB
AAMVS
AAOGV
AAPNW
AAPQZ
AAPXW
AARHZ
AASNB
AAUAY
AAVAP
AAYXX
ABEUO
ABIVO
ABIXL
ABKDP
ABNHQ
ABNKS
ABPTD
ABQLI
ABWST
ABXVV
ABZBJ
ACGFS
ACIWK
ACPRK
ACUFI
ACUTJ
ACUTO
ADBBV
ADEYI
ADEZT
ADGZP
ADHKW
ADHZD
ADIPN
ADJQC
ADOCK
ADQBN
ADRIX
ADRTK
ADVEK
ADYVW
ADZXQ
AEGPL
AEJOX
AEKSI
AEMDU
AENZO
AEPUE
AETBJ
AEWNT
AFFZL
AFIYH
AFOFC
AFRAH
AFXEN
AGINJ
AGKEF
AGQXC
AGSYK
AHMBA
AHXPO
AIJHB
AJEEA
AKWXX
ALMA_UNASSIGNED_HOLDINGS
ALUQC
APIBT
APWMN
ATGXG
AXUDD
BAWUL
BAYMD
BCRHZ
BEYMZ
BHONS
BTRTY
BVRKM
CDBKE
CITATION
CS3
CZ4
DAKXR
DILTD
D~K
EBS
EDH
EE~
F5P
F9B
FLUFQ
FOEOM
FOTVD
FQBLK
GAUVT
GJXCC
H5~
HAR
HW0
HZ~
IOX
J21
KAQDR
KOP
KQ8
KSI
KSN
M-Z
N9A
NGC
NOMLY
NOYVH
O9-
OAWHX
OCZFY
ODMLO
OJQWA
OJZSN
OK1
OPAEJ
OWPYF
P2P
PAFKI
PEELM
PQQKQ
Q1.
Q5Y
R44
RD5
ROL
ROX
ROZ
RUSNO
RW1
RXO
TJX
TR2
WH7
WOQ
X7H
YAYTL
YKOAZ
YXANX
ZKX
~91
ID FETCH-crossref_primary_10_1093_occmed_kqae023_09273
ISSN 0962-7480
IngestDate Thu Sep 12 21:06:31 EDT 2024
IsPeerReviewed true
IsScholarly true
Issue Supplement_1
Language English
LinkModel OpenURL
MergedId FETCHMERGED-crossref_primary_10_1093_occmed_kqae023_09273
ParticipantIDs crossref_primary_10_1093_occmed_kqae023_0927
PublicationCentury 2000
PublicationDate 2024-07-05
PublicationDateYYYYMMDD 2024-07-05
PublicationDate_xml – month: 07
  year: 2024
  text: 2024-07-05
  day: 05
PublicationDecade 2020
PublicationTitle Occupational medicine (Oxford)
PublicationYear 2024
SSID ssj0016482
Score 4.8764186
Snippet Abstract Introduction We report on experiments with Large Language Models (LLM) to generate synthetic data around occupationally-related medical conditions in...
SourceID crossref
SourceType Aggregation Database
Title P-320 WHAT DOES A LARGE LANGUAGE MODEL KNOW ABOUT THE PREVALENCE OF OCCUPATIONALLY-RELATED MEDICAL CONDITIONS? EXPERIMENTS WITH SYNTHETIC AND REAL OCCUPATIONAL MEDICINE DATA
Volume 74
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
link http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwnZ3BTttAEIZXKZUQF0QpFdBSzaH04hqC107iEzKJTSiJHRKnhJOVOMsFEVoUJMTT8AK8IzO7trOpItT0YkWWNUp2vqxnZmf_ZewbHwruOkPLHNXssWkLzFNqqZOatZpTxmzgmo-vqQ7ZDivNvv1z4AxKpWeta-lhOjpInxbuK_kfr-I99Cvtkl3Cs4VRvIGf0b94RQ_j9Z983DG5VTYum15sNCK_h3_yltc99fEanvZpqagdNfyWcR5Gl4Z3EvVj2eCDg_7La8nSUhQYUb3e72SSuK0rk47QwMmskH2sR2HjTG413ueB4Q86mPiR_n8PU_-4afSuQjQZoytJpqrrUxuRZlCZOQt9o-HFnh4Iz6kb5wv8Uvv0UbXbFwWK7AyvfGNRriOhlyssW7a2OnrdsWKRgKlajBFq1sU3pImZpqNPy-rwngw_ecSprJcmRwsnfiWKdZemt1QjDm7-DAVGIwdlVykPzAtt__UCLNoS1YI8T5SZJDOSkJF37L2FUxk1DZ5fzNapKrY8jqz4UbmulcsPlZFD_ZtosY8WxMQbbD3LPsBTKH1gJTHZZKvtbPg32b7uFoiV4gR8h-6cnvtH9iLJAyIPiDzwQJIHOXkgyQMiDyR5gJjAjDyIAlhMHmTkwYy8Y9C4A-IOCu4AuQPibs4c5NwBcbfFeODH9aaZD0vyW2msJG-4gn9iK5O7idhmgOFrVVTLaSp14-yq66RiPBwLyx3ZHKPTHfZjGcu7yz3-ma3NCP_CVqb3D2IPQ9Tp6Kuk5BXseH6D
link.rule.ids 315,783,787,27938,27939
linkProvider Colorado Alliance of Research Libraries
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=P-320+WHAT+DOES+A+LARGE+LANGUAGE+MODEL+KNOW+ABOUT+THE+PREVALENCE+OF+OCCUPATIONALLY-RELATED+MEDICAL+CONDITIONS%3F+EXPERIMENTS+WITH+SYNTHETIC+AND+REAL+OCCUPATIONAL+MEDICINE+DATA&rft.jtitle=Occupational+medicine+%28Oxford%29&rft.au=Johnson%2C+Mark+William&rft.date=2024-07-05&rft.issn=0962-7480&rft.eissn=1471-8405&rft.volume=74&rft.issue=Supplement_1&rft_id=info:doi/10.1093%2Foccmed%2Fkqae023.0927&rft.externalDBID=n%2Fa&rft.externalDocID=10_1093_occmed_kqae023_0927
thumbnail_l http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=0962-7480&client=summon
thumbnail_m http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=0962-7480&client=summon
thumbnail_s http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=0962-7480&client=summon