P-320 WHAT DOES A LARGE LANGUAGE MODEL KNOW ABOUT THE PREVALENCE OF OCCUPATIONALLY-RELATED MEDICAL CONDITIONS? EXPERIMENTS WITH SYNTHETIC AND REAL OCCUPATIONAL MEDICINE DATA
Abstract Introduction We report on experiments with Large Language Models (LLM) to generate synthetic data around occupationally-related medical conditions in a variety of industrial settings. Methods and Results A LLM was programmed to generate 10000 records giving accounts of fictitious patients w...
Saved in:
Published in | Occupational medicine (Oxford) Vol. 74; no. Supplement_1 |
---|---|
Main Author | |
Format | Journal Article |
Language | English |
Published |
05.07.2024
|
Online Access | Get full text |
Cover
Loading…
Abstract | Abstract Introduction We report on experiments with Large Language Models (LLM) to generate synthetic data around occupationally-related medical conditions in a variety of industrial settings. Methods and Results A LLM was programmed to generate 10000 records giving accounts of fictitious patients working in a variety of industrial settings with a range of randomised parameters concerning worker characteristics (e.g. age, sex, underlying conditions, type of activity, etc). The generated text was then coded by AI to determine what AI “thought” were the likely clinical outcomes. This data was then compared to the general prevalence of different medical conditions. A second experiment was conducted with historical data from the Health and Occupational Research network at Manchester University (THOR). LLMs were able to extrapolate underlying factors within the data providing contextual richness to the existing case records. Both these experiments result in realistic accounts, and we show how LLMs tend to reflect the prevalence of conditions. Discussion and Conclusion Given that an LLM is an AI tool which predicts text based on “training” derived the relationships between words in a vast corpus of text on the internet, these experiments raise a fundamental question as to what LLMs “know” about the prevalence of occupational medical conditions: indeed, how much knowledge about occupational medicine might be encoded in the structures of language? We suggest that the study of LLMs in this way might be an additional tool not only for identifying, predicting and mitigating risk, but for assisting in the identification of new sentinal cases. |
---|---|
AbstractList | Abstract Introduction We report on experiments with Large Language Models (LLM) to generate synthetic data around occupationally-related medical conditions in a variety of industrial settings. Methods and Results A LLM was programmed to generate 10000 records giving accounts of fictitious patients working in a variety of industrial settings with a range of randomised parameters concerning worker characteristics (e.g. age, sex, underlying conditions, type of activity, etc). The generated text was then coded by AI to determine what AI “thought” were the likely clinical outcomes. This data was then compared to the general prevalence of different medical conditions. A second experiment was conducted with historical data from the Health and Occupational Research network at Manchester University (THOR). LLMs were able to extrapolate underlying factors within the data providing contextual richness to the existing case records. Both these experiments result in realistic accounts, and we show how LLMs tend to reflect the prevalence of conditions. Discussion and Conclusion Given that an LLM is an AI tool which predicts text based on “training” derived the relationships between words in a vast corpus of text on the internet, these experiments raise a fundamental question as to what LLMs “know” about the prevalence of occupational medical conditions: indeed, how much knowledge about occupational medicine might be encoded in the structures of language? We suggest that the study of LLMs in this way might be an additional tool not only for identifying, predicting and mitigating risk, but for assisting in the identification of new sentinal cases. |
Author | Johnson, Mark William |
Author_xml | – sequence: 1 givenname: Mark William surname: Johnson fullname: Johnson, Mark William |
BookMark | eNqdkE1PwkAQhjcGE0H9BV7mDxS2H1I4mXF3oBuX3aZdrJwagiXxA5D25I_yP9oGDp69zDvJzPscngHr7Q_7irE7nw99Pg1Hh81mV72OPo7rigfhkE-D-IL1_Sj2vUnE73usz6fjwIujCb9ig6Z559wfR5Ogz35SLww4FAk6kJZyQNCYzamdZr7EdllYSRqejC0AH-3SgUsI0oyeUZMRBHYGVohlik5Zg1qvvIw0OpKwIKkEahDWSNVd8wegl5QytSDjciiUSyBfmRbolAA0EjJq___iThBlCCQ6vGGX2_VnU92e85qFM3Ii8Tb1oWnqalt-1W-7df1d-rzszJQnM-XZTNmZCf_X-gXLh2V- |
ContentType | Journal Article |
DBID | AAYXX CITATION |
DOI | 10.1093/occmed/kqae023.0927 |
DatabaseName | CrossRef |
DatabaseTitle | CrossRef |
DatabaseTitleList | CrossRef |
DeliveryMethod | fulltext_linktorsrc |
Discipline | Medicine Occupational Therapy & Rehabilitation |
EISSN | 1471-8405 |
ExternalDocumentID | 10_1093_occmed_kqae023_0927 |
GroupedDBID | --- -E4 .2P .I3 .ZR 0R~ 123 1TH 2WC 4.4 482 48X 5VS 5WA 5WD 70D AABZA AACZT AAJKP AAMDB AAMVS AAOGV AAPNW AAPQZ AAPXW AARHZ AASNB AAUAY AAVAP AAYXX ABEUO ABIVO ABIXL ABKDP ABNHQ ABNKS ABPTD ABQLI ABWST ABXVV ABZBJ ACGFS ACIWK ACPRK ACUFI ACUTJ ACUTO ADBBV ADEYI ADEZT ADGZP ADHKW ADHZD ADIPN ADJQC ADOCK ADQBN ADRIX ADRTK ADVEK ADYVW ADZXQ AEGPL AEJOX AEKSI AEMDU AENZO AEPUE AETBJ AEWNT AFFZL AFIYH AFOFC AFRAH AFXEN AGINJ AGKEF AGQXC AGSYK AHMBA AHXPO AIJHB AJEEA AKWXX ALMA_UNASSIGNED_HOLDINGS ALUQC APIBT APWMN ATGXG AXUDD BAWUL BAYMD BCRHZ BEYMZ BHONS BTRTY BVRKM CDBKE CITATION CS3 CZ4 DAKXR DILTD D~K EBS EDH EE~ F5P F9B FLUFQ FOEOM FOTVD FQBLK GAUVT GJXCC H5~ HAR HW0 HZ~ IOX J21 KAQDR KOP KQ8 KSI KSN M-Z N9A NGC NOMLY NOYVH O9- OAWHX OCZFY ODMLO OJQWA OJZSN OK1 OPAEJ OWPYF P2P PAFKI PEELM PQQKQ Q1. Q5Y R44 RD5 ROL ROX ROZ RUSNO RW1 RXO TJX TR2 WH7 WOQ X7H YAYTL YKOAZ YXANX ZKX ~91 |
ID | FETCH-crossref_primary_10_1093_occmed_kqae023_09273 |
ISSN | 0962-7480 |
IngestDate | Thu Sep 12 21:06:31 EDT 2024 |
IsPeerReviewed | true |
IsScholarly | true |
Issue | Supplement_1 |
Language | English |
LinkModel | OpenURL |
MergedId | FETCHMERGED-crossref_primary_10_1093_occmed_kqae023_09273 |
ParticipantIDs | crossref_primary_10_1093_occmed_kqae023_0927 |
PublicationCentury | 2000 |
PublicationDate | 2024-07-05 |
PublicationDateYYYYMMDD | 2024-07-05 |
PublicationDate_xml | – month: 07 year: 2024 text: 2024-07-05 day: 05 |
PublicationDecade | 2020 |
PublicationTitle | Occupational medicine (Oxford) |
PublicationYear | 2024 |
SSID | ssj0016482 |
Score | 4.8764186 |
Snippet | Abstract Introduction We report on experiments with Large Language Models (LLM) to generate synthetic data around occupationally-related medical conditions in... |
SourceID | crossref |
SourceType | Aggregation Database |
Title | P-320 WHAT DOES A LARGE LANGUAGE MODEL KNOW ABOUT THE PREVALENCE OF OCCUPATIONALLY-RELATED MEDICAL CONDITIONS? EXPERIMENTS WITH SYNTHETIC AND REAL OCCUPATIONAL MEDICINE DATA |
Volume | 74 |
hasFullText | 1 |
inHoldings | 1 |
isFullTextHit | |
isPrint | |
link | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwnZ3BTttAEIZXKZUQF0QpFdBSzaH04hqC107iEzKJTSiJHRKnhJOVOMsFEVoUJMTT8AK8IzO7trOpItT0YkWWNUp2vqxnZmf_ZewbHwruOkPLHNXssWkLzFNqqZOatZpTxmzgmo-vqQ7ZDivNvv1z4AxKpWeta-lhOjpInxbuK_kfr-I99Cvtkl3Cs4VRvIGf0b94RQ_j9Z983DG5VTYum15sNCK_h3_yltc99fEanvZpqagdNfyWcR5Gl4Z3EvVj2eCDg_7La8nSUhQYUb3e72SSuK0rk47QwMmskH2sR2HjTG413ueB4Q86mPiR_n8PU_-4afSuQjQZoytJpqrrUxuRZlCZOQt9o-HFnh4Iz6kb5wv8Uvv0UbXbFwWK7AyvfGNRriOhlyssW7a2OnrdsWKRgKlajBFq1sU3pImZpqNPy-rwngw_ecSprJcmRwsnfiWKdZemt1QjDm7-DAVGIwdlVykPzAtt__UCLNoS1YI8T5SZJDOSkJF37L2FUxk1DZ5fzNapKrY8jqz4UbmulcsPlZFD_ZtosY8WxMQbbD3LPsBTKH1gJTHZZKvtbPg32b7uFoiV4gR8h-6cnvtH9iLJAyIPiDzwQJIHOXkgyQMiDyR5gJjAjDyIAlhMHmTkwYy8Y9C4A-IOCu4AuQPibs4c5NwBcbfFeODH9aaZD0vyW2msJG-4gn9iK5O7idhmgOFrVVTLaSp14-yq66RiPBwLyx3ZHKPTHfZjGcu7yz3-ma3NCP_CVqb3D2IPQ9Tp6Kuk5BXseH6D |
link.rule.ids | 315,783,787,27938,27939 |
linkProvider | Colorado Alliance of Research Libraries |
openUrl | ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=P-320+WHAT+DOES+A+LARGE+LANGUAGE+MODEL+KNOW+ABOUT+THE+PREVALENCE+OF+OCCUPATIONALLY-RELATED+MEDICAL+CONDITIONS%3F+EXPERIMENTS+WITH+SYNTHETIC+AND+REAL+OCCUPATIONAL+MEDICINE+DATA&rft.jtitle=Occupational+medicine+%28Oxford%29&rft.au=Johnson%2C+Mark+William&rft.date=2024-07-05&rft.issn=0962-7480&rft.eissn=1471-8405&rft.volume=74&rft.issue=Supplement_1&rft_id=info:doi/10.1093%2Foccmed%2Fkqae023.0927&rft.externalDBID=n%2Fa&rft.externalDocID=10_1093_occmed_kqae023_0927 |
thumbnail_l | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=0962-7480&client=summon |
thumbnail_m | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=0962-7480&client=summon |
thumbnail_s | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=0962-7480&client=summon |