Evaluation of the Performance of Generative AI Large Language Models ChatGPT, Google Bard, and Microsoft Bing Chat in Supporting Evidence-Based Dentistry: Comparative Mixed Methods Study

The increasing application of generative artificial intelligence large language models (LLMs) in various fields, including dentistry, raises questions about their accuracy. This study aims to comparatively evaluate the answers provided by 4 LLMs, namely Bard (Google LLC), ChatGPT-3.5 and ChatGPT-4 (...

Full description

Saved in:
Bibliographic Details
Published inJournal of medical Internet research Vol. 25; no. 1; p. e51580
Main Authors Giannakopoulos, Kostis, Kavadella, Argyro, Aaqel Salim, Anas, Stamatopoulos, Vassilis, Kaklamanos, Eleftherios G
Format Journal Article
LanguageEnglish
Published Canada Gunther Eysenbach MD MPH, Associate Professor 28.12.2023
JMIR Publications
Subjects
Online AccessGet full text

Cover

Loading…
Abstract The increasing application of generative artificial intelligence large language models (LLMs) in various fields, including dentistry, raises questions about their accuracy. This study aims to comparatively evaluate the answers provided by 4 LLMs, namely Bard (Google LLC), ChatGPT-3.5 and ChatGPT-4 (OpenAI), and Bing Chat (Microsoft Corp), to clinically relevant questions from the field of dentistry. The LLMs were queried with 20 open-type, clinical dentistry-related questions from different disciplines, developed by the respective faculty of the School of Dentistry, European University Cyprus. The LLMs' answers were graded 0 (minimum) to 10 (maximum) points against strong, traditionally collected scientific evidence, such as guidelines and consensus statements, using a rubric, as if they were examination questions posed to students, by 2 experienced faculty members. The scores were statistically compared to identify the best-performing model using the Friedman and Wilcoxon tests. Moreover, the evaluators were asked to provide a qualitative evaluation of the comprehensiveness, scientific accuracy, clarity, and relevance of the LLMs' answers. Overall, no statistically significant difference was detected between the scores given by the 2 evaluators; therefore, an average score was computed for every LLM. Although ChatGPT-4 statistically outperformed ChatGPT-3.5 (P=.008), Bing Chat (P=.049), and Bard (P=.045), all models occasionally exhibited inaccuracies, generality, outdated content, and a lack of source references. The evaluators noted instances where the LLMs delivered irrelevant information, vague answers, or information that was not fully accurate. This study demonstrates that although LLMs hold promising potential as an aid in the implementation of evidence-based dentistry, their current limitations can lead to potentially harmful health care decisions if not used judiciously. Therefore, these tools should not replace the dentist's critical thinking and in-depth understanding of the subject matter. Further research, clinical validation, and model improvements are necessary for these tools to be fully integrated into dental practice. Dental practitioners must be aware of the limitations of LLMs, as their imprudent use could potentially impact patient care. Regulatory measures should be established to oversee the use of these evolving technologies.
AbstractList BackgroundThe increasing application of generative artificial intelligence large language models (LLMs) in various fields, including dentistry, raises questions about their accuracy. ObjectiveThis study aims to comparatively evaluate the answers provided by 4 LLMs, namely Bard (Google LLC), ChatGPT-3.5 and ChatGPT-4 (OpenAI), and Bing Chat (Microsoft Corp), to clinically relevant questions from the field of dentistry. MethodsThe LLMs were queried with 20 open-type, clinical dentistry–related questions from different disciplines, developed by the respective faculty of the School of Dentistry, European University Cyprus. The LLMs’ answers were graded 0 (minimum) to 10 (maximum) points against strong, traditionally collected scientific evidence, such as guidelines and consensus statements, using a rubric, as if they were examination questions posed to students, by 2 experienced faculty members. The scores were statistically compared to identify the best-performing model using the Friedman and Wilcoxon tests. Moreover, the evaluators were asked to provide a qualitative evaluation of the comprehensiveness, scientific accuracy, clarity, and relevance of the LLMs’ answers. ResultsOverall, no statistically significant difference was detected between the scores given by the 2 evaluators; therefore, an average score was computed for every LLM. Although ChatGPT-4 statistically outperformed ChatGPT-3.5 (P=.008), Bing Chat (P=.049), and Bard (P=.045), all models occasionally exhibited inaccuracies, generality, outdated content, and a lack of source references. The evaluators noted instances where the LLMs delivered irrelevant information, vague answers, or information that was not fully accurate. ConclusionsThis study demonstrates that although LLMs hold promising potential as an aid in the implementation of evidence-based dentistry, their current limitations can lead to potentially harmful health care decisions if not used judiciously. Therefore, these tools should not replace the dentist’s critical thinking and in-depth understanding of the subject matter. Further research, clinical validation, and model improvements are necessary for these tools to be fully integrated into dental practice. Dental practitioners must be aware of the limitations of LLMs, as their imprudent use could potentially impact patient care. Regulatory measures should be established to oversee the use of these evolving technologies.
Background:The increasing application of generative artificial intelligence large language models (LLMs) in various fields, including dentistry, raises questions about their accuracy.Objective:This study aims to comparatively evaluate the answers provided by 4 LLMs, namely Bard (Google LLC), ChatGPT-3.5 and ChatGPT-4 (OpenAI), and Bing Chat (Microsoft Corp), to clinically relevant questions from the field of dentistry.Methods:The LLMs were queried with 20 open-type, clinical dentistry–related questions from different disciplines, developed by the respective faculty of the School of Dentistry, European University Cyprus. The LLMs’ answers were graded 0 (minimum) to 10 (maximum) points against strong, traditionally collected scientific evidence, such as guidelines and consensus statements, using a rubric, as if they were examination questions posed to students, by 2 experienced faculty members. The scores were statistically compared to identify the best-performing model using the Friedman and Wilcoxon tests. Moreover, the evaluators were asked to provide a qualitative evaluation of the comprehensiveness, scientific accuracy, clarity, and relevance of the LLMs’ answers.Results:Overall, no statistically significant difference was detected between the scores given by the 2 evaluators; therefore, an average score was computed for every LLM. Although ChatGPT-4 statistically outperformed ChatGPT-3.5 (P=.008), Bing Chat (P=.049), and Bard (P=.045), all models occasionally exhibited inaccuracies, generality, outdated content, and a lack of source references. The evaluators noted instances where the LLMs delivered irrelevant information, vague answers, or information that was not fully accurate.Conclusions:This study demonstrates that although LLMs hold promising potential as an aid in the implementation of evidence-based dentistry, their current limitations can lead to potentially harmful health care decisions if not used judiciously. Therefore, these tools should not replace the dentist’s critical thinking and in-depth understanding of the subject matter. Further research, clinical validation, and model improvements are necessary for these tools to be fully integrated into dental practice. Dental practitioners must be aware of the limitations of LLMs, as their imprudent use could potentially impact patient care. Regulatory measures should be established to oversee the use of these evolving technologies.
The increasing application of generative artificial intelligence large language models (LLMs) in various fields, including dentistry, raises questions about their accuracy. This study aims to comparatively evaluate the answers provided by 4 LLMs, namely Bard (Google LLC), ChatGPT-3.5 and ChatGPT-4 (OpenAI), and Bing Chat (Microsoft Corp), to clinically relevant questions from the field of dentistry. The LLMs were queried with 20 open-type, clinical dentistry-related questions from different disciplines, developed by the respective faculty of the School of Dentistry, European University Cyprus. The LLMs' answers were graded 0 (minimum) to 10 (maximum) points against strong, traditionally collected scientific evidence, such as guidelines and consensus statements, using a rubric, as if they were examination questions posed to students, by 2 experienced faculty members. The scores were statistically compared to identify the best-performing model using the Friedman and Wilcoxon tests. Moreover, the evaluators were asked to provide a qualitative evaluation of the comprehensiveness, scientific accuracy, clarity, and relevance of the LLMs' answers. Overall, no statistically significant difference was detected between the scores given by the 2 evaluators; therefore, an average score was computed for every LLM. Although ChatGPT-4 statistically outperformed ChatGPT-3.5 (P=.008), Bing Chat (P=.049), and Bard (P=.045), all models occasionally exhibited inaccuracies, generality, outdated content, and a lack of source references. The evaluators noted instances where the LLMs delivered irrelevant information, vague answers, or information that was not fully accurate. This study demonstrates that although LLMs hold promising potential as an aid in the implementation of evidence-based dentistry, their current limitations can lead to potentially harmful health care decisions if not used judiciously. Therefore, these tools should not replace the dentist's critical thinking and in-depth understanding of the subject matter. Further research, clinical validation, and model improvements are necessary for these tools to be fully integrated into dental practice. Dental practitioners must be aware of the limitations of LLMs, as their imprudent use could potentially impact patient care. Regulatory measures should be established to oversee the use of these evolving technologies.
The increasing application of generative artificial intelligence large language models (LLMs) in various fields, including dentistry, raises questions about their accuracy.BACKGROUNDThe increasing application of generative artificial intelligence large language models (LLMs) in various fields, including dentistry, raises questions about their accuracy.This study aims to comparatively evaluate the answers provided by 4 LLMs, namely Bard (Google LLC), ChatGPT-3.5 and ChatGPT-4 (OpenAI), and Bing Chat (Microsoft Corp), to clinically relevant questions from the field of dentistry.OBJECTIVEThis study aims to comparatively evaluate the answers provided by 4 LLMs, namely Bard (Google LLC), ChatGPT-3.5 and ChatGPT-4 (OpenAI), and Bing Chat (Microsoft Corp), to clinically relevant questions from the field of dentistry.The LLMs were queried with 20 open-type, clinical dentistry-related questions from different disciplines, developed by the respective faculty of the School of Dentistry, European University Cyprus. The LLMs' answers were graded 0 (minimum) to 10 (maximum) points against strong, traditionally collected scientific evidence, such as guidelines and consensus statements, using a rubric, as if they were examination questions posed to students, by 2 experienced faculty members. The scores were statistically compared to identify the best-performing model using the Friedman and Wilcoxon tests. Moreover, the evaluators were asked to provide a qualitative evaluation of the comprehensiveness, scientific accuracy, clarity, and relevance of the LLMs' answers.METHODSThe LLMs were queried with 20 open-type, clinical dentistry-related questions from different disciplines, developed by the respective faculty of the School of Dentistry, European University Cyprus. The LLMs' answers were graded 0 (minimum) to 10 (maximum) points against strong, traditionally collected scientific evidence, such as guidelines and consensus statements, using a rubric, as if they were examination questions posed to students, by 2 experienced faculty members. The scores were statistically compared to identify the best-performing model using the Friedman and Wilcoxon tests. Moreover, the evaluators were asked to provide a qualitative evaluation of the comprehensiveness, scientific accuracy, clarity, and relevance of the LLMs' answers.Overall, no statistically significant difference was detected between the scores given by the 2 evaluators; therefore, an average score was computed for every LLM. Although ChatGPT-4 statistically outperformed ChatGPT-3.5 (P=.008), Bing Chat (P=.049), and Bard (P=.045), all models occasionally exhibited inaccuracies, generality, outdated content, and a lack of source references. The evaluators noted instances where the LLMs delivered irrelevant information, vague answers, or information that was not fully accurate.RESULTSOverall, no statistically significant difference was detected between the scores given by the 2 evaluators; therefore, an average score was computed for every LLM. Although ChatGPT-4 statistically outperformed ChatGPT-3.5 (P=.008), Bing Chat (P=.049), and Bard (P=.045), all models occasionally exhibited inaccuracies, generality, outdated content, and a lack of source references. The evaluators noted instances where the LLMs delivered irrelevant information, vague answers, or information that was not fully accurate.This study demonstrates that although LLMs hold promising potential as an aid in the implementation of evidence-based dentistry, their current limitations can lead to potentially harmful health care decisions if not used judiciously. Therefore, these tools should not replace the dentist's critical thinking and in-depth understanding of the subject matter. Further research, clinical validation, and model improvements are necessary for these tools to be fully integrated into dental practice. Dental practitioners must be aware of the limitations of LLMs, as their imprudent use could potentially impact patient care. Regulatory measures should be established to oversee the use of these evolving technologies.CONCLUSIONSThis study demonstrates that although LLMs hold promising potential as an aid in the implementation of evidence-based dentistry, their current limitations can lead to potentially harmful health care decisions if not used judiciously. Therefore, these tools should not replace the dentist's critical thinking and in-depth understanding of the subject matter. Further research, clinical validation, and model improvements are necessary for these tools to be fully integrated into dental practice. Dental practitioners must be aware of the limitations of LLMs, as their imprudent use could potentially impact patient care. Regulatory measures should be established to oversee the use of these evolving technologies.
Author Kaklamanos, Eleftherios G
Aaqel Salim, Anas
Stamatopoulos, Vassilis
Giannakopoulos, Kostis
Kavadella, Argyro
Author_xml – sequence: 1
  givenname: Kostis
  orcidid: 0000-0001-7008-7306
  surname: Giannakopoulos
  fullname: Giannakopoulos, Kostis
– sequence: 2
  givenname: Argyro
  orcidid: 0009-0003-0560-8373
  surname: Kavadella
  fullname: Kavadella, Argyro
– sequence: 3
  givenname: Anas
  orcidid: 0000-0002-7731-6666
  surname: Aaqel Salim
  fullname: Aaqel Salim, Anas
– sequence: 4
  givenname: Vassilis
  orcidid: 0000-0002-9044-796X
  surname: Stamatopoulos
  fullname: Stamatopoulos, Vassilis
– sequence: 5
  givenname: Eleftherios G
  orcidid: 0000-0002-0513-5110
  surname: Kaklamanos
  fullname: Kaklamanos, Eleftherios G
BackLink https://www.ncbi.nlm.nih.gov/pubmed/38009003$$D View this record in MEDLINE/PubMed
BookMark eNpdkt2K2zAQhU3Z0v3pvkIRlEKhm1ayZVvq3SZN00BCF3Z7bcbWOFFwJK8kh-bV-nS1k3QpuZGGmY8zM8y5ji6MNRhFt4x-jpnMvqQsFfRVdMV4IkZC5Oziv_gyuvZ-Q2lMuWRvostEUCopTa6iP9MdNB0EbQ2xNQlrJA_oauu2YCocUjM06Hpgh-R-ThbgVti_ZtVBHyytwsaTyRrC7OHpjsysXTVIxuDUHQGjyFJXznpbBzLWZnUAiTbksWtb68KQmu60wr7XaAweFfmGJmgf3P4rmdhtC6fWS_27Ly4xrK3y5DF0av82el1D4_H29N9Ev75PnyY_Roufs_nkfjGqeCbCqBYihbLEiseizlDIChMqqxh5DlDJNJaxkFmZpzXlgucK0poLpjLGZQacseQmmh91lYVN0Tq9BbcvLOjikLBuVUC_StVgkSqhVIapGNqxrJSqKqXI85qDEmWS9Vofj1qts88d-lBsta-wacCg7XzRj8KTjMV0QN-foRvbOdNvWsSS5Vks83ig3p2ortyiehnv34V74MMRGO7gHdYvCKPF4Jzi4Jye-3TGVTocfBEc6OaM_gvUOsKt
CitedBy_id crossref_primary_10_37990_medr_1446396
crossref_primary_10_1111_edt_13041
crossref_primary_10_1111_edt_13042
crossref_primary_10_37349_edht_2024_00032
crossref_primary_10_1093_jamia_ocae086
crossref_primary_10_1007_s40368_025_01012_x
crossref_primary_10_1177_14604582241304679
crossref_primary_10_1186_s12903_025_05732_w
crossref_primary_10_3145_thinkepi_2024_e18a04
crossref_primary_10_1186_s12909_024_05630_9
crossref_primary_10_1016_j_pec_2025_108672
crossref_primary_10_2196_54580
crossref_primary_10_3389_fdmed_2024_1456208
crossref_primary_10_34248_bsengineering_1544165
crossref_primary_10_1038_s41746_024_01258_7
crossref_primary_10_1111_jcpe_14101
crossref_primary_10_3390_diagnostics14161779
crossref_primary_10_1111_jep_14084
crossref_primary_10_1177_10497315241313071
crossref_primary_10_3390_bioengineering11111145
crossref_primary_10_1016_j_ijmedinf_2025_105787
crossref_primary_10_1111_odi_15082
crossref_primary_10_4239_wjd_v16_i3_98408
crossref_primary_10_1002_lary_31434
crossref_primary_10_1038_s41598_025_94576_z
crossref_primary_10_1002_hsr2_70300
crossref_primary_10_3390_app142310802
crossref_primary_10_1111_edt_13020
crossref_primary_10_1002_jdd_13882
crossref_primary_10_3389_fmed_2024_1406842
crossref_primary_10_3389_frai_2024_1379297
crossref_primary_10_1111_jcal_70004
crossref_primary_10_1186_s12911_024_02757_z
crossref_primary_10_1016_j_ijmedinf_2024_105474
crossref_primary_10_1111_jnu_13036
crossref_primary_10_1016_j_resmer_2024_101091
crossref_primary_10_1159_000538538
crossref_primary_10_1007_s10067_024_07154_5
crossref_primary_10_1016_j_prosdent_2025_02_008
crossref_primary_10_1002_ca_24244
crossref_primary_10_1097_JS9_0000000000001775
crossref_primary_10_2196_59258
crossref_primary_10_2196_60083
crossref_primary_10_1111_eje_13069
crossref_primary_10_2196_55933
crossref_primary_10_1253_circrep_CR_24_0019
crossref_primary_10_1016_j_prosdent_2024_10_020
crossref_primary_10_2147_CCID_S478309
crossref_primary_10_2196_52746
crossref_primary_10_2196_22769
crossref_primary_10_1111_edt_12999
crossref_primary_10_1111_jerd_13447
Cites_doi 10.48550/arXiv.2212.14882
10.1038/s41415-023-5926-2
10.1155/2022/1410448
10.1016/j.diii.2023.02.006
10.1259/dmfr.20190107
10.1111/jerd.13046
10.1101/2023.02.02.23285399
10.7759/cureus.38317
10.1016/j.dsx.2023.102744
10.1001/jamahealthforum.2023.1938
10.3390/healthcare11060887
10.1016/j.legalmed.2020.101826
10.1016/j.jebdp.2005.12.011
10.1016/j.adaj.2022.07.012
10.2196/48568
10.1038/s41368-023-00239-y
10.1002/jdd.13010
10.1021/acs.jchemed.3c00087
10.1016/S2589-7500(23)00021-3
10.1016/j.oooo.2023.09.010
10.1016/j.jdent.2021.103849
10.2196/20346
10.37074/jalt.2023.6.1.23
10.1016/j.ijom.2023.09.005
10.1038/s41746-021-00464-x
10.3390/jcm9113579
10.1016/j.jormas.2023.101471
10.1016/j.adaj.2023.06.003
10.1016/j.jmir.2020.08.011
10.1111/jerd.12844
10.1038/sj.bdj.4801062
10.1016/S2589-7500(23)00023-7
10.1016/j.jclinepi.2010.07.015
10.1038/s41746-023-00819-6
10.7759/cureus.42133
10.1038/s41415-023-5928-0
10.3390/healthcare11101480
ContentType Journal Article
Copyright Kostis Giannakopoulos, Argyro Kavadella, Anas Aaqel Salim, Vassilis Stamatopoulos, Eleftherios G Kaklamanos. Originally published in the Journal of Medical Internet Research (https://www.jmir.org), 28.12.2023.
2023. This work is licensed under https://creativecommons.org/licenses/by/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.
Copyright_xml – notice: Kostis Giannakopoulos, Argyro Kavadella, Anas Aaqel Salim, Vassilis Stamatopoulos, Eleftherios G Kaklamanos. Originally published in the Journal of Medical Internet Research (https://www.jmir.org), 28.12.2023.
– notice: 2023. This work is licensed under https://creativecommons.org/licenses/by/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.
DBID AAYXX
CITATION
CGR
CUY
CVF
ECM
EIF
NPM
3V.
7QJ
7RV
7X7
7XB
8FI
8FJ
8FK
ABUWG
AFKRA
ALSLI
AZQEC
BENPR
CCPQU
CNYFK
DWQXO
E3H
F2A
FYUFA
GHDGH
K9.
KB0
M0S
M1O
NAPCQ
PHGZM
PHGZT
PIMPY
PKEHL
PPXIY
PQEST
PQQKQ
PQUKI
PRQQA
7X8
DOA
DOI 10.2196/51580
DatabaseName CrossRef
Medline
MEDLINE
MEDLINE (Ovid)
MEDLINE
MEDLINE
PubMed
ProQuest Central (Corporate)
Applied Social Sciences Index & Abstracts (ASSIA)
Nursing & Allied Health Database
Health & Medical Collection
ProQuest Central (purchase pre-March 2016)
Hospital Premium Collection
Hospital Premium Collection (Alumni Edition)
ProQuest Central (Alumni) (purchase pre-March 2016)
ProQuest Central (Alumni)
ProQuest Central UK/Ireland
Social Science Premium Collection
ProQuest Central Essentials
ProQuest Central
ProQuest One Community College
Library & Information Science Collection
ProQuest Central Korea
Library & Information Sciences Abstracts (LISA)
Library & Information Science Abstracts (LISA)
Health Research Premium Collection
Health Research Premium Collection (Alumni)
ProQuest Health & Medical Complete (Alumni)
Nursing & Allied Health Database (Alumni Edition)
ProQuest Health & Medical Collection
Library Science Database
Nursing & Allied Health Premium
ProQuest Central Premium
ProQuest One Academic
Publicly Available Content Database
ProQuest One Academic Middle East (New)
ProQuest One Health & Nursing
ProQuest One Academic Eastern Edition (DO NOT USE)
ProQuest One Academic
ProQuest One Academic UKI Edition
ProQuest One Social Sciences
MEDLINE - Academic
DOAJ Directory of Open Access Journals
DatabaseTitle CrossRef
MEDLINE
Medline Complete
MEDLINE with Full Text
PubMed
MEDLINE (Ovid)
Publicly Available Content Database
ProQuest One Academic Middle East (New)
Library and Information Science Abstracts (LISA)
ProQuest Central Essentials
ProQuest Health & Medical Complete (Alumni)
ProQuest Central (Alumni Edition)
ProQuest One Community College
ProQuest One Health & Nursing
Applied Social Sciences Index and Abstracts (ASSIA)
ProQuest Central
ProQuest Library Science
Health Research Premium Collection
Health and Medicine Complete (Alumni Edition)
ProQuest Central Korea
Library & Information Science Collection
ProQuest Central (New)
Social Science Premium Collection
ProQuest One Social Sciences
ProQuest One Academic Eastern Edition
ProQuest Nursing & Allied Health Source
ProQuest Hospital Collection
Health Research Premium Collection (Alumni)
ProQuest Hospital Collection (Alumni)
Nursing & Allied Health Premium
ProQuest Health & Medical Complete
ProQuest One Academic UKI Edition
ProQuest Nursing & Allied Health Source (Alumni)
ProQuest One Academic
ProQuest One Academic (New)
ProQuest Central (Alumni)
MEDLINE - Academic
DatabaseTitleList
Publicly Available Content Database
MEDLINE
MEDLINE - Academic
Database_xml – sequence: 1
  dbid: DOA
  name: Directory of Open Access Journals
  url: https://www.doaj.org/
  sourceTypes: Open Website
– sequence: 2
  dbid: NPM
  name: PubMed
  url: https://proxy.k.utb.cz/login?url=http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=PubMed
  sourceTypes: Index Database
– sequence: 3
  dbid: EIF
  name: MEDLINE
  url: https://proxy.k.utb.cz/login?url=https://www.webofscience.com/wos/medline/basic-search
  sourceTypes: Index Database
– sequence: 4
  dbid: BENPR
  name: ProQuest Central
  url: https://www.proquest.com/central
  sourceTypes: Aggregation Database
DeliveryMethod fulltext_linktorsrc
Discipline Medicine
Library & Information Science
Dentistry
EISSN 1438-8871
ExternalDocumentID oai_doaj_org_article_5d8dd6e58bec416b9dcb9877f4ad8b36
38009003
10_2196_51580
Genre Research Support, Non-U.S. Gov't
Journal Article
GroupedDBID ---
.4I
.DC
29L
2WC
36B
53G
5GY
5VS
77K
7RV
7X7
8FI
8FJ
AAFWJ
AAKPC
AAWTL
AAYXX
ABDBF
ABIVO
ABUWG
ACGFO
ADBBV
AEGXH
AENEX
AFKRA
AFPKN
AIAGR
ALIPV
ALMA_UNASSIGNED_HOLDINGS
ALSLI
AOIJS
BAWUL
BCNDV
BENPR
CCPQU
CITATION
CNYFK
CS3
DIK
DU5
DWQXO
E3Z
EAP
EBD
EBS
EJD
ELW
EMB
EMOBN
ESX
F5P
FRP
FYUFA
GROUPED_DOAJ
GX1
HMCUK
HYE
IAO
ICO
IEA
IHR
INH
ISN
ITC
KQ8
M1O
M48
NAPCQ
OK1
OVT
P2P
PGMZT
PHGZM
PHGZT
PIMPY
PQQKQ
RNS
RPM
SJN
SV3
TR2
UKHRP
XSB
CGR
CUY
CVF
ECM
EIF
NPM
3V.
7QJ
7XB
8FK
ACUHS
AZQEC
E3H
F2A
K9.
PKEHL
PPXIY
PQEST
PQUKI
PRQQA
7X8
PUEGO
ID FETCH-LOGICAL-c468t-f885abbec428f6e89ce309c2e47aac95292896b75f04847da5f481d61496a4113
IEDL.DBID M48
ISSN 1438-8871
IngestDate Wed Aug 27 01:26:08 EDT 2025
Fri Jul 11 15:38:31 EDT 2025
Fri Jul 25 10:10:39 EDT 2025
Wed Feb 19 02:08:38 EST 2025
Thu Apr 24 23:02:06 EDT 2025
Tue Jul 01 02:06:12 EDT 2025
IsDoiOpenAccess true
IsOpenAccess true
IsPeerReviewed true
IsScholarly true
Issue 1
Keywords generative pretrained transformers
evidence-based dentistry
dental practice
dental professional
AI
clinical practice
artificial intelligence
clinical practice guidelines
Microsoft Bing
ChatGPT
large language models
clinical decision-making
Google Bard
Language English
License Kostis Giannakopoulos, Argyro Kavadella, Anas Aaqel Salim, Vassilis Stamatopoulos, Eleftherios G Kaklamanos. Originally published in the Journal of Medical Internet Research (https://www.jmir.org), 28.12.2023.
LinkModel DirectLink
MergedId FETCHMERGED-LOGICAL-c468t-f885abbec428f6e89ce309c2e47aac95292896b75f04847da5f481d61496a4113
Notes ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 14
content type line 23
ORCID 0000-0001-7008-7306
0000-0002-9044-796X
0009-0003-0560-8373
0000-0002-7731-6666
0000-0002-0513-5110
OpenAccessLink http://journals.scholarsportal.info/openUrl.xqy?doi=10.2196/51580
PMID 38009003
PQID 2917629726
PQPubID 2033121
ParticipantIDs doaj_primary_oai_doaj_org_article_5d8dd6e58bec416b9dcb9877f4ad8b36
proquest_miscellaneous_2894361206
proquest_journals_2917629726
pubmed_primary_38009003
crossref_primary_10_2196_51580
crossref_citationtrail_10_2196_51580
ProviderPackageCode CITATION
AAYXX
PublicationCentury 2000
PublicationDate 2023-12-28
PublicationDateYYYYMMDD 2023-12-28
PublicationDate_xml – month: 12
  year: 2023
  text: 2023-12-28
  day: 28
PublicationDecade 2020
PublicationPlace Canada
PublicationPlace_xml – name: Canada
– name: Toronto
PublicationTitle Journal of medical Internet research
PublicationTitleAlternate J Med Internet Res
PublicationYear 2023
Publisher Gunther Eysenbach MD MPH, Associate Professor
JMIR Publications
Publisher_xml – name: Gunther Eysenbach MD MPH, Associate Professor
– name: JMIR Publications
References Eggmann, F (ref12) 2023; 44
ref13
ref57
ref56
ref15
ref59
ref14
ref58
ref53
ref52
ref11
ref55
ref10
ref54
ref17
ref16
ref19
ref18
ref51
ref50
ref46
ref45
ref48
ref47
ref42
ref41
ref44
ref43
ref49
ref8
ref7
ref9
ref4
ref3
ref6
ref5
ref40
ref35
ref34
ref37
ref36
ref31
ref30
ref33
ref32
ref2
ref1
ref39
ref38
ref24
ref23
ref26
ref25
ref20
ref22
ref21
ref28
ref27
ref29
References_xml – ident: ref37
– ident: ref1
– ident: ref42
  doi: 10.48550/arXiv.2212.14882
– ident: ref47
  doi: 10.1038/s41415-023-5926-2
– ident: ref24
– ident: ref35
  doi: 10.1155/2022/1410448
– ident: ref27
– ident: ref57
  doi: 10.1016/j.diii.2023.02.006
– ident: ref4
  doi: 10.1259/dmfr.20190107
– ident: ref30
  doi: 10.1111/jerd.13046
– ident: ref9
– ident: ref22
  doi: 10.1101/2023.02.02.23285399
– ident: ref53
– ident: ref34
– ident: ref20
  doi: 10.7759/cureus.38317
– volume: 44
  start-page: 220
  issue: 4
  year: 2023
  ident: ref12
  publication-title: Compend Contin Educ Dent
– ident: ref2
– ident: ref13
  doi: 10.1016/j.dsx.2023.102744
– ident: ref49
  doi: 10.1001/jamahealthforum.2023.1938
– ident: ref14
  doi: 10.3390/healthcare11060887
– ident: ref5
  doi: 10.1016/j.legalmed.2020.101826
– ident: ref43
  doi: 10.1016/j.jebdp.2005.12.011
– ident: ref50
– ident: ref26
– ident: ref33
  doi: 10.1016/j.adaj.2022.07.012
– ident: ref17
  doi: 10.2196/48568
– ident: ref56
  doi: 10.1038/s41368-023-00239-y
– ident: ref7
  doi: 10.1002/jdd.13010
– ident: ref15
  doi: 10.1021/acs.jchemed.3c00087
– ident: ref54
– ident: ref16
– ident: ref41
  doi: 10.1016/S2589-7500(23)00021-3
– ident: ref45
  doi: 10.1016/j.oooo.2023.09.010
– ident: ref11
  doi: 10.1016/j.jdent.2021.103849
– ident: ref58
– ident: ref21
  doi: 10.2196/20346
– ident: ref36
  doi: 10.37074/jalt.2023.6.1.23
– ident: ref29
– ident: ref48
  doi: 10.1016/j.ijom.2023.09.005
– ident: ref28
  doi: 10.1038/s41746-021-00464-x
– ident: ref25
– ident: ref6
  doi: 10.3390/jcm9113579
– ident: ref51
– ident: ref44
  doi: 10.1016/j.jormas.2023.101471
– ident: ref46
  doi: 10.1016/j.adaj.2023.06.003
– ident: ref19
– ident: ref32
– ident: ref55
  doi: 10.1016/j.jmir.2020.08.011
– ident: ref3
  doi: 10.1111/jerd.12844
– ident: ref10
  doi: 10.1038/sj.bdj.4801062
– ident: ref23
  doi: 10.1016/S2589-7500(23)00023-7
– ident: ref31
  doi: 10.1016/j.jclinepi.2010.07.015
– ident: ref40
  doi: 10.1038/s41746-023-00819-6
– ident: ref38
  doi: 10.7759/cureus.42133
– ident: ref52
– ident: ref59
  doi: 10.1038/s41415-023-5928-0
– ident: ref8
– ident: ref18
– ident: ref39
  doi: 10.3390/healthcare11101480
SSID ssj0020491
Score 2.639428
Snippet The increasing application of generative artificial intelligence large language models (LLMs) in various fields, including dentistry, raises questions about...
Background:The increasing application of generative artificial intelligence large language models (LLMs) in various fields, including dentistry, raises...
BackgroundThe increasing application of generative artificial intelligence large language models (LLMs) in various fields, including dentistry, raises...
SourceID doaj
proquest
pubmed
crossref
SourceType Open Website
Aggregation Database
Index Database
Enrichment Source
StartPage e51580
SubjectTerms Accuracy
Answers
Artificial Intelligence
Bans
Chatbots
Clinical decision making
Clinical medicine
Clinical research
Critical thinking
Decision making
Dentistry
Dentists
Evidence-Based Dentistry
Generative artificial intelligence
Health care
Humans
Internet access
Language
Large language models
Maxillofacial surgery
Medical libraries
Medical screening
Natural language
Neural networks
Oral hygiene
Orthodontics
Patients
Periodontics
Personal information
Privacy
Professional Role
Professionals
Prosthodontics
Scientific evidence
Search Engine
Textbooks
SummonAdditionalLinks – databaseName: DOAJ Directory of Open Access Journals
  dbid: DOA
  link: http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwrV1La9wwEBYlh1AooUkf2TQJUwg5xWRXlmUpt-w2j5ZsySGB3IyeaWGxQ3YX2r_WX9cZW-uEQumll8VI47XwjDTfWKNvGDuwQhnybFk00mci2jyzeTAZ9yEoE6wyhgLF6Vd5eSu-3BV3z0p9UU5YRw_cvbjjwivvZSgUPgzBg9XeWYyTyyiMVzZvybbR562CqRRqIe4drbNXlOiMJnaMTpt4H595npag_--osvUu56_ZRoKFcNoNZ5O9CPUW20uHCuAQ0qkheouQpuMWW5-mjfE37NdZz9oNTQREdXD9dCSAmjp-aVrc4PQzXFECOP52HyuBKqLN5jD5ZhYX1zdHcNE097MAYzSfIzC1hynl7c1xyYYx-rpWEL7jUJYPhN-paVWeNBujX_TwibKQqJDcCUyeCMbxf35g57QtWz0HSmL8-Zbdnp_dTC6zVJYhc0KqRRaVKowldXAVZVDahXyoHQ-iNMbpgmsM4qQti4irgyi9KaJAVIw4QEsjRqP8HVurmzpsM5BuaDFi0x7vEEMnlaXd9ODxuoy6dAN2sFJZ5RJnOZXOmFUYu5Bmq1azA7bfiz10JB1_CoxJ330ncWq3DWhpVbK06l-WNmC7K2up0kSfVxzDXcl1ybH7Y9-NU5T2XUwdmiXKEMc9IskhyrzvrKwfSY6AnT4m7_yPEX5gLzliMMq24WqXrS0el2EPMdPC7rfT4zc1tBg9
  priority: 102
  providerName: Directory of Open Access Journals
– databaseName: Library Science Database
  dbid: M1O
  link: http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwnV1Lb9NAEF5BkQoS4hFegbYapIpT3cb2er3LBTWhDxCBHlqpN2tfLlUjOzSJBPw0fh0z9sZBleDAxYp219Za-Tzzze7sN4xtGy41ebao1MJFvDRpZFKvo8R5L7U3UmsKFMefxfEZ_3ienYcFt1lIq1zaxMZQu9rSGvlegnGFSFSeiHfTbxFVjaLd1VBC4za7g0SZ04c5jr90ARey33id3ad0ZwTaHrpuUn_8w_80Mv1_55aNjzl8yIrl7NrUkqvdxdzs2p83hBv_f_qP2INAP2G_xctjdstXPXb3PaUMUdW3HtsMBxngDYSTSvTPQTABPbY-DpvxT9ivg04pHOoSkEnCyeoYAjW1mtZkUGH_A3yipHO8tgukQFXYJjMYfdXzo5PTHTiq64uJhyFCdgd05WBMuYIzdBMwRP_aDIRLnMpiSjEDNS1LokZD9MUOutd4C6OVqDk-5zt2jptS2TOgxMkfT9nZ4cHp6DgKpSAiy4WcR6WUmTaIN4yWSuGlsj4dKJt4nmttVZYoDByFybMSLRLPnc5KjkwcuYcSmsdx-oytVXXlXzAQdmAwSlQO7-ADK6ShHXzv8Hdeqtz22fYSIIUNOulUrmNSYLxEOCoaHPXZVjds2gqD3BwwJHR1naTj3TTU1xdFMAtF5qRzwmeSXi0WRjlrlMzzkmsnTSr6bGMJpyIYl1mxwlKfve660SzQXo-ufL3AMaSrj-x1gGOet5juZpJikEAL2C___fBX7F6CjI5ydxK5wdbm1wu_iQxsbraaz-w394A1Pg
  priority: 102
  providerName: ProQuest
Title Evaluation of the Performance of Generative AI Large Language Models ChatGPT, Google Bard, and Microsoft Bing Chat in Supporting Evidence-Based Dentistry: Comparative Mixed Methods Study
URI https://www.ncbi.nlm.nih.gov/pubmed/38009003
https://www.proquest.com/docview/2917629726
https://www.proquest.com/docview/2894361206
https://doaj.org/article/5d8dd6e58bec416b9dcb9877f4ad8b36
Volume 25
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
link http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwfV1ta9RAEF60hSqI6Pl22h4jFD81ekk2uxtBpDmvrWLqIT3ot7Cb3bSFI6n3Au1f89c5k-RSK-qXEHYnIdmZ2Xlmd3aGsV3DlSbL5hVaWI8XJvRM6LQXWOeUdkZpTY5ieiyOpvzLafRbNGE7gIu_unZUT2o6n729-nH9ERX-A4UxowC9Q5Os0GvfRGMkqYhByruNhAABsL_FHtwivWWC6kz9_4aXtZk5eMQetvgQ9huGPmZ3XNlj9z5RTA-VZeuxnfakAbyB9igRDS20OtpjW2m7W_6E_Rx3qbyhKgChHkxuzglQU5N0mmY82P8MXykqHK_NCiZQmbTZAkbnenk4OdmDw6o6mzlIUKb2QJcWUgrmW-A8DgkawJoQLvBTVpc0ntS0rlnqJWgsLXS_8R5GN1nH8T1X2JnWtawXQJGN10_Z9GB8Mjry2loNXs6FWnqFUpE2KBDozhTCqTh34TDOA8el1nkcBTF6dsLIqMApg0uro4IjVEZwEAvNfT98xjbKqnQvGIh8aNCNiy0-wYe5UIa22J3Fe1nEMu-z3TX7srxNZE71NGYZOjTE5azmcp8NOrLLJnPHnwQJ8b7rpETbdUM1P8tavc0iq6wVLlL0a74wsc1NrKQsuLbKhKLPtteSk62FNwvQBxZBLAPsft11o97SZowuXbVCGkp8j_ByiDTPG4nrviREFE8rzC____JX7H6AkIuCawK1zTaW85XbQYi0NAN2V57KAdtMxseT74N6oQGvqf9tUKvHLyyUFv4
linkProvider Scholars Portal
linkToHtml http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwtV1fa9swED-6FNrCGFv2L1vbadDtqaaJLcvyYIwmTZusSQgjhb65kiW3gxBnTcLWL7UPsE-3O__LGGxvfTFGOguZO939TjrdARxoLhVZNidRwjg80Z6jPasc11grldVSKXIUhyPRu-CfL_3LDfhZ3oWhsMpSJ2aK2qQx7ZEfuehXCDcMXPFp_s2hqlF0ulqW0MjF4tzefUeXbfGxf4L8fee6p91Jp-cUVQWcmAu5dBIpfaVx6gi8E2FlGFuvGcau5YFScei7IfogQgd-gsLNA6P8hCOoQzMWCsVbLQ_HfQCb3BNNtwab7e5o_KVy8RBvt7bgIQVYo2gfIVigfJN_WLysMMC_0Wxm1U4fw6MCjrLjXH6ewIad1WH7hEKIqApcHfaKiw3sPStuLhEnWaES6rA1LA7nn8KvbpU5nKUJQ2TJxutrCdSU57gmBcuO-2xAQej4zDdMGVVlmy5Y50Ytz8aTQ3aWptdTy9oowodMzQwbUuzgAs0Ga6O9zQjZV5zKak4-BDWVJVKdNtpmw6rf-MA66yTnOM4P7BxmpbMXjAIp757Bxb0w8TnUZunMvgQm4qZGrzE0-AVvxkJqOtG3Bt-DJAziBhyU7IviIm86le-YRug_EZejjMsN2K_I5nmikL8J2sT7qpPyemcN6e11VKiJyDfSGGF9Sb_WEjo0sQ5lECRcGak90YDdUnKiQtksovXSaMDbqhvVBJ39qJlNV0hDefYRzTaR5kUucdVMPHQaaEP71f8HfwPbvclwEA36o_PXsOMi2qO4HlfuQm15u7J7iM6Wer9YEgyu7nsV_gbL70_c
linkToPdf http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwtV3da9swED-6FLLBGFv20Wxtp0G3p5ok_pDlwRjNV5u1CWG00DdPsuR2EOKsSdj6r_Wv250tO2OwvfXFGOksLO50dz_pdAdwoHwhybI5qeTa8VPlOcoz0nG1MUIaJaQkoDie8JML_8tlcLkFd-VdGAqrLHVirqh1ltAeectFXMHdKHR5K7VhEdP-8PPih0MVpOiktSynUYjIqbn9ifBt-WnUR16_d93h4Lx34tgKA07ic7FyUiECqXAa6ISn3IgoMV47Slzjh1ImUeBGiEe4CoMUBd0PtQxSHx08NGkRl36n4-G4D2A7JFRUg-3uYDL9WsE99L07dXhMwdYo5i10HCj35B_WLy8S8G_PNrdww6fwxLqm7KiQpWewZeYNeNincCKqCNeAPXvJgX1g9hYTcZVZ9dCA-tge1D-Hu0GVRZxlKUMvk003VxSoqch3TcqWHY3YGQWk47PYPGVUoW22ZL1ruTqenh-y4yy7mhnWRXE-ZHKu2ZjiCJdoQlgXbW9OyL7jr6wXhCeoqSyX6nTRTmtWTeMj620SnuM4v7BznJfRXjIKqrx9ARf3wsSXUJtnc7MDjCdthQgy0viF3064UHS6bzS-h2kUJk04KNkXJzaHOpXymMWIpYjLcc7lJuxXZIsiacjfBF3ifdVJOb7zhuzmKrYqIw600JqbQNDUOlxFOlGRCMPUl1oojzdht5Sc2CqeZbxZJk14V3WjyqBzIDk32RppKOc-erZtpHlVSFz1Jx4CCNrcfv3_wd9CHVdffDaanL6BRy46fhTi44pdqK1u1mYPHbWV2rcrgsG3-16EvwFPmFQR
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=Evaluation+of+the+Performance+of+Generative+AI+Large+Language+Models+ChatGPT%2C+Google+Bard%2C+and+Microsoft+Bing+Chat+in+Supporting+Evidence-Based+Dentistry%3A+Comparative+Mixed+Methods+Study&rft.jtitle=Journal+of+medical+Internet+research&rft.au=Giannakopoulos%2C+Kostis&rft.au=Kavadella%2C+Argyro&rft.au=Salim%2C+Anas+Aaqel&rft.au=Stamatopoulos%2C+Vassilis&rft.date=2023-12-28&rft.pub=Gunther+Eysenbach+MD+MPH%2C+Associate+Professor&rft.eissn=1438-8871&rft.volume=25&rft.issue=1&rft.spage=e51580&rft_id=info:doi/10.2196%2F51580&rft.externalDBID=HAS_PDF_LINK
thumbnail_l http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=1438-8871&client=summon
thumbnail_m http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=1438-8871&client=summon
thumbnail_s http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=1438-8871&client=summon