Evaluation of the Performance of Generative AI Large Language Models ChatGPT, Google Bard, and Microsoft Bing Chat in Supporting Evidence-Based Dentistry: Comparative Mixed Methods Study

The increasing application of generative artificial intelligence large language models (LLMs) in various fields, including dentistry, raises questions about their accuracy. This study aims to comparatively evaluate the answers provided by 4 LLMs, namely Bard (Google LLC), ChatGPT-3.5 and ChatGPT-4 (...

Full description

Saved in:

Bibliographic Details
Published in	Journal of medical Internet research Vol. 25; no. 1; p. e51580
Main Authors	Giannakopoulos, Kostis, Kavadella, Argyro, Aaqel Salim, Anas, Stamatopoulos, Vassilis, Kaklamanos, Eleftherios G
Format	Journal Article
Language	English
Published	Canada Gunther Eysenbach MD MPH, Associate Professor 28.12.2023 JMIR Publications
Subjects	Accuracy Answers Artificial Intelligence Bans Chatbots Clinical decision making Clinical medicine Clinical research Critical thinking Decision making Dentistry Dentists Evidence-Based Dentistry Generative artificial intelligence Health care Humans Internet access Language Large language models Maxillofacial surgery Medical libraries Medical screening Natural language Neural networks Oral hygiene Orthodontics Patients Periodontics Personal information Privacy Professional Role Professionals Prosthodontics Scientific evidence Search Engine Textbooks generative pretrained transformers evidence-based dentistry dental practice dental professional AI clinical practice artificial intelligence clinical practice guidelines Microsoft Bing ChatGPT large language models clinical decision-making Google Bard
Online Access	Get full text

Cover

Loading…

Abstract	The increasing application of generative artificial intelligence large language models (LLMs) in various fields, including dentistry, raises questions about their accuracy. This study aims to comparatively evaluate the answers provided by 4 LLMs, namely Bard (Google LLC), ChatGPT-3.5 and ChatGPT-4 (OpenAI), and Bing Chat (Microsoft Corp), to clinically relevant questions from the field of dentistry. The LLMs were queried with 20 open-type, clinical dentistry-related questions from different disciplines, developed by the respective faculty of the School of Dentistry, European University Cyprus. The LLMs' answers were graded 0 (minimum) to 10 (maximum) points against strong, traditionally collected scientific evidence, such as guidelines and consensus statements, using a rubric, as if they were examination questions posed to students, by 2 experienced faculty members. The scores were statistically compared to identify the best-performing model using the Friedman and Wilcoxon tests. Moreover, the evaluators were asked to provide a qualitative evaluation of the comprehensiveness, scientific accuracy, clarity, and relevance of the LLMs' answers. Overall, no statistically significant difference was detected between the scores given by the 2 evaluators; therefore, an average score was computed for every LLM. Although ChatGPT-4 statistically outperformed ChatGPT-3.5 (P=.008), Bing Chat (P=.049), and Bard (P=.045), all models occasionally exhibited inaccuracies, generality, outdated content, and a lack of source references. The evaluators noted instances where the LLMs delivered irrelevant information, vague answers, or information that was not fully accurate. This study demonstrates that although LLMs hold promising potential as an aid in the implementation of evidence-based dentistry, their current limitations can lead to potentially harmful health care decisions if not used judiciously. Therefore, these tools should not replace the dentist's critical thinking and in-depth understanding of the subject matter. Further research, clinical validation, and model improvements are necessary for these tools to be fully integrated into dental practice. Dental practitioners must be aware of the limitations of LLMs, as their imprudent use could potentially impact patient care. Regulatory measures should be established to oversee the use of these evolving technologies.
AbstractList	BackgroundThe increasing application of generative artificial intelligence large language models (LLMs) in various fields, including dentistry, raises questions about their accuracy. ObjectiveThis study aims to comparatively evaluate the answers provided by 4 LLMs, namely Bard (Google LLC), ChatGPT-3.5 and ChatGPT-4 (OpenAI), and Bing Chat (Microsoft Corp), to clinically relevant questions from the field of dentistry. MethodsThe LLMs were queried with 20 open-type, clinical dentistry–related questions from different disciplines, developed by the respective faculty of the School of Dentistry, European University Cyprus. The LLMs’ answers were graded 0 (minimum) to 10 (maximum) points against strong, traditionally collected scientific evidence, such as guidelines and consensus statements, using a rubric, as if they were examination questions posed to students, by 2 experienced faculty members. The scores were statistically compared to identify the best-performing model using the Friedman and Wilcoxon tests. Moreover, the evaluators were asked to provide a qualitative evaluation of the comprehensiveness, scientific accuracy, clarity, and relevance of the LLMs’ answers. ResultsOverall, no statistically significant difference was detected between the scores given by the 2 evaluators; therefore, an average score was computed for every LLM. Although ChatGPT-4 statistically outperformed ChatGPT-3.5 (P=.008), Bing Chat (P=.049), and Bard (P=.045), all models occasionally exhibited inaccuracies, generality, outdated content, and a lack of source references. The evaluators noted instances where the LLMs delivered irrelevant information, vague answers, or information that was not fully accurate. ConclusionsThis study demonstrates that although LLMs hold promising potential as an aid in the implementation of evidence-based dentistry, their current limitations can lead to potentially harmful health care decisions if not used judiciously. Therefore, these tools should not replace the dentist’s critical thinking and in-depth understanding of the subject matter. Further research, clinical validation, and model improvements are necessary for these tools to be fully integrated into dental practice. Dental practitioners must be aware of the limitations of LLMs, as their imprudent use could potentially impact patient care. Regulatory measures should be established to oversee the use of these evolving technologies. Background:The increasing application of generative artificial intelligence large language models (LLMs) in various fields, including dentistry, raises questions about their accuracy.Objective:This study aims to comparatively evaluate the answers provided by 4 LLMs, namely Bard (Google LLC), ChatGPT-3.5 and ChatGPT-4 (OpenAI), and Bing Chat (Microsoft Corp), to clinically relevant questions from the field of dentistry.Methods:The LLMs were queried with 20 open-type, clinical dentistry–related questions from different disciplines, developed by the respective faculty of the School of Dentistry, European University Cyprus. The LLMs’ answers were graded 0 (minimum) to 10 (maximum) points against strong, traditionally collected scientific evidence, such as guidelines and consensus statements, using a rubric, as if they were examination questions posed to students, by 2 experienced faculty members. The scores were statistically compared to identify the best-performing model using the Friedman and Wilcoxon tests. Moreover, the evaluators were asked to provide a qualitative evaluation of the comprehensiveness, scientific accuracy, clarity, and relevance of the LLMs’ answers.Results:Overall, no statistically significant difference was detected between the scores given by the 2 evaluators; therefore, an average score was computed for every LLM. Although ChatGPT-4 statistically outperformed ChatGPT-3.5 (P=.008), Bing Chat (P=.049), and Bard (P=.045), all models occasionally exhibited inaccuracies, generality, outdated content, and a lack of source references. The evaluators noted instances where the LLMs delivered irrelevant information, vague answers, or information that was not fully accurate.Conclusions:This study demonstrates that although LLMs hold promising potential as an aid in the implementation of evidence-based dentistry, their current limitations can lead to potentially harmful health care decisions if not used judiciously. Therefore, these tools should not replace the dentist’s critical thinking and in-depth understanding of the subject matter. Further research, clinical validation, and model improvements are necessary for these tools to be fully integrated into dental practice. Dental practitioners must be aware of the limitations of LLMs, as their imprudent use could potentially impact patient care. Regulatory measures should be established to oversee the use of these evolving technologies. The increasing application of generative artificial intelligence large language models (LLMs) in various fields, including dentistry, raises questions about their accuracy. This study aims to comparatively evaluate the answers provided by 4 LLMs, namely Bard (Google LLC), ChatGPT-3.5 and ChatGPT-4 (OpenAI), and Bing Chat (Microsoft Corp), to clinically relevant questions from the field of dentistry. The LLMs were queried with 20 open-type, clinical dentistry-related questions from different disciplines, developed by the respective faculty of the School of Dentistry, European University Cyprus. The LLMs' answers were graded 0 (minimum) to 10 (maximum) points against strong, traditionally collected scientific evidence, such as guidelines and consensus statements, using a rubric, as if they were examination questions posed to students, by 2 experienced faculty members. The scores were statistically compared to identify the best-performing model using the Friedman and Wilcoxon tests. Moreover, the evaluators were asked to provide a qualitative evaluation of the comprehensiveness, scientific accuracy, clarity, and relevance of the LLMs' answers. Overall, no statistically significant difference was detected between the scores given by the 2 evaluators; therefore, an average score was computed for every LLM. Although ChatGPT-4 statistically outperformed ChatGPT-3.5 (P=.008), Bing Chat (P=.049), and Bard (P=.045), all models occasionally exhibited inaccuracies, generality, outdated content, and a lack of source references. The evaluators noted instances where the LLMs delivered irrelevant information, vague answers, or information that was not fully accurate. This study demonstrates that although LLMs hold promising potential as an aid in the implementation of evidence-based dentistry, their current limitations can lead to potentially harmful health care decisions if not used judiciously. Therefore, these tools should not replace the dentist's critical thinking and in-depth understanding of the subject matter. Further research, clinical validation, and model improvements are necessary for these tools to be fully integrated into dental practice. Dental practitioners must be aware of the limitations of LLMs, as their imprudent use could potentially impact patient care. Regulatory measures should be established to oversee the use of these evolving technologies. The increasing application of generative artificial intelligence large language models (LLMs) in various fields, including dentistry, raises questions about their accuracy.BACKGROUNDThe increasing application of generative artificial intelligence large language models (LLMs) in various fields, including dentistry, raises questions about their accuracy.This study aims to comparatively evaluate the answers provided by 4 LLMs, namely Bard (Google LLC), ChatGPT-3.5 and ChatGPT-4 (OpenAI), and Bing Chat (Microsoft Corp), to clinically relevant questions from the field of dentistry.OBJECTIVEThis study aims to comparatively evaluate the answers provided by 4 LLMs, namely Bard (Google LLC), ChatGPT-3.5 and ChatGPT-4 (OpenAI), and Bing Chat (Microsoft Corp), to clinically relevant questions from the field of dentistry.The LLMs were queried with 20 open-type, clinical dentistry-related questions from different disciplines, developed by the respective faculty of the School of Dentistry, European University Cyprus. The LLMs' answers were graded 0 (minimum) to 10 (maximum) points against strong, traditionally collected scientific evidence, such as guidelines and consensus statements, using a rubric, as if they were examination questions posed to students, by 2 experienced faculty members. The scores were statistically compared to identify the best-performing model using the Friedman and Wilcoxon tests. Moreover, the evaluators were asked to provide a qualitative evaluation of the comprehensiveness, scientific accuracy, clarity, and relevance of the LLMs' answers.METHODSThe LLMs were queried with 20 open-type, clinical dentistry-related questions from different disciplines, developed by the respective faculty of the School of Dentistry, European University Cyprus. The LLMs' answers were graded 0 (minimum) to 10 (maximum) points against strong, traditionally collected scientific evidence, such as guidelines and consensus statements, using a rubric, as if they were examination questions posed to students, by 2 experienced faculty members. The scores were statistically compared to identify the best-performing model using the Friedman and Wilcoxon tests. Moreover, the evaluators were asked to provide a qualitative evaluation of the comprehensiveness, scientific accuracy, clarity, and relevance of the LLMs' answers.Overall, no statistically significant difference was detected between the scores given by the 2 evaluators; therefore, an average score was computed for every LLM. Although ChatGPT-4 statistically outperformed ChatGPT-3.5 (P=.008), Bing Chat (P=.049), and Bard (P=.045), all models occasionally exhibited inaccuracies, generality, outdated content, and a lack of source references. The evaluators noted instances where the LLMs delivered irrelevant information, vague answers, or information that was not fully accurate.RESULTSOverall, no statistically significant difference was detected between the scores given by the 2 evaluators; therefore, an average score was computed for every LLM. Although ChatGPT-4 statistically outperformed ChatGPT-3.5 (P=.008), Bing Chat (P=.049), and Bard (P=.045), all models occasionally exhibited inaccuracies, generality, outdated content, and a lack of source references. The evaluators noted instances where the LLMs delivered irrelevant information, vague answers, or information that was not fully accurate.This study demonstrates that although LLMs hold promising potential as an aid in the implementation of evidence-based dentistry, their current limitations can lead to potentially harmful health care decisions if not used judiciously. Therefore, these tools should not replace the dentist's critical thinking and in-depth understanding of the subject matter. Further research, clinical validation, and model improvements are necessary for these tools to be fully integrated into dental practice. Dental practitioners must be aware of the limitations of LLMs, as their imprudent use could potentially impact patient care. Regulatory measures should be established to oversee the use of these evolving technologies.CONCLUSIONSThis study demonstrates that although LLMs hold promising potential as an aid in the implementation of evidence-based dentistry, their current limitations can lead to potentially harmful health care decisions if not used judiciously. Therefore, these tools should not replace the dentist's critical thinking and in-depth understanding of the subject matter. Further research, clinical validation, and model improvements are necessary for these tools to be fully integrated into dental practice. Dental practitioners must be aware of the limitations of LLMs, as their imprudent use could potentially impact patient care. Regulatory measures should be established to oversee the use of these evolving technologies.
Author	Kaklamanos, Eleftherios G Aaqel Salim, Anas Stamatopoulos, Vassilis Giannakopoulos, Kostis Kavadella, Argyro
Author_xml	– sequence: 1 givenname: Kostis orcidid: 0000-0001-7008-7306 surname: Giannakopoulos fullname: Giannakopoulos, Kostis – sequence: 2 givenname: Argyro orcidid: 0009-0003-0560-8373 surname: Kavadella fullname: Kavadella, Argyro – sequence: 3 givenname: Anas orcidid: 0000-0002-7731-6666 surname: Aaqel Salim fullname: Aaqel Salim, Anas – sequence: 4 givenname: Vassilis orcidid: 0000-0002-9044-796X surname: Stamatopoulos fullname: Stamatopoulos, Vassilis – sequence: 5 givenname: Eleftherios G orcidid: 0000-0002-0513-5110 surname: Kaklamanos fullname: Kaklamanos, Eleftherios G
BackLink	https://www.ncbi.nlm.nih.gov/pubmed/38009003$$D View this record in MEDLINE/PubMed
BookMark	eNpdkt2K2zAQhU3Z0v3pvkIRlEKhm1ayZVvq3SZN00BCF3Z7bcbWOFFwJK8kh-bV-nS1k3QpuZGGmY8zM8y5ji6MNRhFt4x-jpnMvqQsFfRVdMV4IkZC5Oziv_gyuvZ-Q2lMuWRvostEUCopTa6iP9MdNB0EbQ2xNQlrJA_oauu2YCocUjM06Hpgh-R-ThbgVti_ZtVBHyytwsaTyRrC7OHpjsysXTVIxuDUHQGjyFJXznpbBzLWZnUAiTbksWtb68KQmu60wr7XaAweFfmGJmgf3P4rmdhtC6fWS_27Ly4xrK3y5DF0av82el1D4_H29N9Ev75PnyY_Roufs_nkfjGqeCbCqBYihbLEiseizlDIChMqqxh5DlDJNJaxkFmZpzXlgucK0poLpjLGZQacseQmmh91lYVN0Tq9BbcvLOjikLBuVUC_StVgkSqhVIapGNqxrJSqKqXI85qDEmWS9Vofj1qts88d-lBsta-wacCg7XzRj8KTjMV0QN-foRvbOdNvWsSS5Vks83ig3p2ortyiehnv34V74MMRGO7gHdYvCKPF4Jzi4Jye-3TGVTocfBEc6OaM_gvUOsKt
CitedBy_id	crossref_primary_10_37990_medr_1446396 crossref_primary_10_1111_edt_13041 crossref_primary_10_1111_edt_13042 crossref_primary_10_37349_edht_2024_00032 crossref_primary_10_1093_jamia_ocae086 crossref_primary_10_1007_s40368_025_01012_x crossref_primary_10_1177_14604582241304679 crossref_primary_10_1186_s12903_025_05732_w crossref_primary_10_3145_thinkepi_2024_e18a04 crossref_primary_10_1186_s12909_024_05630_9 crossref_primary_10_1016_j_pec_2025_108672 crossref_primary_10_2196_54580 crossref_primary_10_3389_fdmed_2024_1456208 crossref_primary_10_34248_bsengineering_1544165 crossref_primary_10_1038_s41746_024_01258_7 crossref_primary_10_1111_jcpe_14101 crossref_primary_10_3390_diagnostics14161779 crossref_primary_10_1111_jep_14084 crossref_primary_10_1177_10497315241313071 crossref_primary_10_3390_bioengineering11111145 crossref_primary_10_1016_j_ijmedinf_2025_105787 crossref_primary_10_1111_odi_15082 crossref_primary_10_4239_wjd_v16_i3_98408 crossref_primary_10_1002_lary_31434 crossref_primary_10_1038_s41598_025_94576_z crossref_primary_10_1002_hsr2_70300 crossref_primary_10_3390_app142310802 crossref_primary_10_1111_edt_13020 crossref_primary_10_1002_jdd_13882 crossref_primary_10_3389_fmed_2024_1406842 crossref_primary_10_3389_frai_2024_1379297 crossref_primary_10_1111_jcal_70004 crossref_primary_10_1186_s12911_024_02757_z crossref_primary_10_1016_j_ijmedinf_2024_105474 crossref_primary_10_1111_jnu_13036 crossref_primary_10_1016_j_resmer_2024_101091 crossref_primary_10_1159_000538538 crossref_primary_10_1007_s10067_024_07154_5 crossref_primary_10_1016_j_prosdent_2025_02_008 crossref_primary_10_1002_ca_24244 crossref_primary_10_1097_JS9_0000000000001775 crossref_primary_10_2196_59258 crossref_primary_10_2196_60083 crossref_primary_10_1111_eje_13069 crossref_primary_10_2196_55933 crossref_primary_10_1253_circrep_CR_24_0019 crossref_primary_10_1016_j_prosdent_2024_10_020 crossref_primary_10_2147_CCID_S478309 crossref_primary_10_2196_52746 crossref_primary_10_2196_22769 crossref_primary_10_1111_edt_12999 crossref_primary_10_1111_jerd_13447
Cites_doi	10.48550/arXiv.2212.14882 10.1038/s41415-023-5926-2 10.1155/2022/1410448 10.1016/j.diii.2023.02.006 10.1259/dmfr.20190107 10.1111/jerd.13046 10.1101/2023.02.02.23285399 10.7759/cureus.38317 10.1016/j.dsx.2023.102744 10.1001/jamahealthforum.2023.1938 10.3390/healthcare11060887 10.1016/j.legalmed.2020.101826 10.1016/j.jebdp.2005.12.011 10.1016/j.adaj.2022.07.012 10.2196/48568 10.1038/s41368-023-00239-y 10.1002/jdd.13010 10.1021/acs.jchemed.3c00087 10.1016/S2589-7500(23)00021-3 10.1016/j.oooo.2023.09.010 10.1016/j.jdent.2021.103849 10.2196/20346 10.37074/jalt.2023.6.1.23 10.1016/j.ijom.2023.09.005 10.1038/s41746-021-00464-x 10.3390/jcm9113579 10.1016/j.jormas.2023.101471 10.1016/j.adaj.2023.06.003 10.1016/j.jmir.2020.08.011 10.1111/jerd.12844 10.1038/sj.bdj.4801062 10.1016/S2589-7500(23)00023-7 10.1016/j.jclinepi.2010.07.015 10.1038/s41746-023-00819-6 10.7759/cureus.42133 10.1038/s41415-023-5928-0 10.3390/healthcare11101480
ContentType	Journal Article
Copyright	Kostis Giannakopoulos, Argyro Kavadella, Anas Aaqel Salim, Vassilis Stamatopoulos, Eleftherios G Kaklamanos. Originally published in the Journal of Medical Internet Research (https://www.jmir.org), 28.12.2023. 2023. This work is licensed under https://creativecommons.org/licenses/by/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.
Copyright_xml	– notice: Kostis Giannakopoulos, Argyro Kavadella, Anas Aaqel Salim, Vassilis Stamatopoulos, Eleftherios G Kaklamanos. Originally published in the Journal of Medical Internet Research (https://www.jmir.org), 28.12.2023. – notice: 2023. This work is licensed under https://creativecommons.org/licenses/by/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.
DBID	AAYXX CITATION CGR CUY CVF ECM EIF NPM 3V. 7QJ 7RV 7X7 7XB 8FI 8FJ 8FK ABUWG AFKRA ALSLI AZQEC BENPR CCPQU CNYFK DWQXO E3H F2A FYUFA GHDGH K9. KB0 M0S M1O NAPCQ PHGZM PHGZT PIMPY PKEHL PPXIY PQEST PQQKQ PQUKI PRQQA 7X8 DOA
DOI	10.2196/51580
DatabaseName	CrossRef Medline MEDLINE MEDLINE (Ovid) MEDLINE MEDLINE PubMed ProQuest Central (Corporate) Applied Social Sciences Index & Abstracts (ASSIA) Nursing & Allied Health Database Health & Medical Collection ProQuest Central (purchase pre-March 2016) Hospital Premium Collection Hospital Premium Collection (Alumni Edition) ProQuest Central (Alumni) (purchase pre-March 2016) ProQuest Central (Alumni) ProQuest Central UK/Ireland Social Science Premium Collection ProQuest Central Essentials ProQuest Central ProQuest One Community College Library & Information Science Collection ProQuest Central Korea Library & Information Sciences Abstracts (LISA) Library & Information Science Abstracts (LISA) Health Research Premium Collection Health Research Premium Collection (Alumni) ProQuest Health & Medical Complete (Alumni) Nursing & Allied Health Database (Alumni Edition) ProQuest Health & Medical Collection Library Science Database Nursing & Allied Health Premium ProQuest Central Premium ProQuest One Academic Publicly Available Content Database ProQuest One Academic Middle East (New) ProQuest One Health & Nursing ProQuest One Academic Eastern Edition (DO NOT USE) ProQuest One Academic ProQuest One Academic UKI Edition ProQuest One Social Sciences MEDLINE - Academic DOAJ Directory of Open Access Journals
DatabaseTitle	CrossRef MEDLINE Medline Complete MEDLINE with Full Text PubMed MEDLINE (Ovid) Publicly Available Content Database ProQuest One Academic Middle East (New) Library and Information Science Abstracts (LISA) ProQuest Central Essentials ProQuest Health & Medical Complete (Alumni) ProQuest Central (Alumni Edition) ProQuest One Community College ProQuest One Health & Nursing Applied Social Sciences Index and Abstracts (ASSIA) ProQuest Central ProQuest Library Science Health Research Premium Collection Health and Medicine Complete (Alumni Edition) ProQuest Central Korea Library & Information Science Collection ProQuest Central (New) Social Science Premium Collection ProQuest One Social Sciences ProQuest One Academic Eastern Edition ProQuest Nursing & Allied Health Source ProQuest Hospital Collection Health Research Premium Collection (Alumni) ProQuest Hospital Collection (Alumni) Nursing & Allied Health Premium ProQuest Health & Medical Complete ProQuest One Academic UKI Edition ProQuest Nursing & Allied Health Source (Alumni) ProQuest One Academic ProQuest One Academic (New) ProQuest Central (Alumni) MEDLINE - Academic
DatabaseTitleList	Publicly Available Content Database MEDLINE MEDLINE - Academic
Database_xml	– sequence: 1 dbid: DOA name: Directory of Open Access Journals url: https://www.doaj.org/ sourceTypes: Open Website – sequence: 2 dbid: NPM name: PubMed url: https://proxy.k.utb.cz/login?url=http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=PubMed sourceTypes: Index Database – sequence: 3 dbid: EIF name: MEDLINE url: https://proxy.k.utb.cz/login?url=https://www.webofscience.com/wos/medline/basic-search sourceTypes: Index Database – sequence: 4 dbid: BENPR name: ProQuest Central url: https://www.proquest.com/central sourceTypes: Aggregation Database
DeliveryMethod	fulltext_linktorsrc
Discipline	Medicine Library & Information Science Dentistry
EISSN	1438-8871
ExternalDocumentID	oai_doaj_org_article_5d8dd6e58bec416b9dcb9877f4ad8b36 38009003 10_2196_51580
Genre	Research Support, Non-U.S. Gov't Journal Article
GroupedDBID	--- .4I .DC 29L 2WC 36B 53G 5GY 5VS 77K 7RV 7X7 8FI 8FJ AAFWJ AAKPC AAWTL AAYXX ABDBF ABIVO ABUWG ACGFO ADBBV AEGXH AENEX AFKRA AFPKN AIAGR ALIPV ALMA_UNASSIGNED_HOLDINGS ALSLI AOIJS BAWUL BCNDV BENPR CCPQU CITATION CNYFK CS3 DIK DU5 DWQXO E3Z EAP EBD EBS EJD ELW EMB EMOBN ESX F5P FRP FYUFA GROUPED_DOAJ GX1 HMCUK HYE IAO ICO IEA IHR INH ISN ITC KQ8 M1O M48 NAPCQ OK1 OVT P2P PGMZT PHGZM PHGZT PIMPY PQQKQ RNS RPM SJN SV3 TR2 UKHRP XSB CGR CUY CVF ECM EIF NPM 3V. 7QJ 7XB 8FK ACUHS AZQEC E3H F2A K9. PKEHL PPXIY PQEST PQUKI PRQQA 7X8 PUEGO
ID	FETCH-LOGICAL-c468t-f885abbec428f6e89ce309c2e47aac95292896b75f04847da5f481d61496a4113
IEDL.DBID	M48
ISSN	1438-8871
IngestDate	Wed Aug 27 01:26:08 EDT 2025 Fri Jul 11 15:38:31 EDT 2025 Fri Jul 25 10:10:39 EDT 2025 Wed Feb 19 02:08:38 EST 2025 Thu Apr 24 23:02:06 EDT 2025 Tue Jul 01 02:06:12 EDT 2025
IsDoiOpenAccess	true
IsOpenAccess	true
IsPeerReviewed	true
IsScholarly	true
Issue	1
Keywords	generative pretrained transformers evidence-based dentistry dental practice dental professional AI clinical practice artificial intelligence clinical practice guidelines Microsoft Bing ChatGPT large language models clinical decision-making Google Bard
Language	English
License	Kostis Giannakopoulos, Argyro Kavadella, Anas Aaqel Salim, Vassilis Stamatopoulos, Eleftherios G Kaklamanos. Originally published in the Journal of Medical Internet Research (https://www.jmir.org), 28.12.2023.
LinkModel	DirectLink
MergedId	FETCHMERGED-LOGICAL-c468t-f885abbec428f6e89ce309c2e47aac95292896b75f04847da5f481d61496a4113
Notes	ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 14 content type line 23
ORCID	0000-0001-7008-7306 0000-0002-9044-796X 0009-0003-0560-8373 0000-0002-7731-6666 0000-0002-0513-5110
OpenAccessLink	http://journals.scholarsportal.info/openUrl.xqy?doi=10.2196/51580
PMID	38009003
PQID	2917629726
PQPubID	2033121
ParticipantIDs	doaj_primary_oai_doaj_org_article_5d8dd6e58bec416b9dcb9877f4ad8b36 proquest_miscellaneous_2894361206 proquest_journals_2917629726 pubmed_primary_38009003 crossref_primary_10_2196_51580 crossref_citationtrail_10_2196_51580
ProviderPackageCode	CITATION AAYXX
PublicationCentury	2000
PublicationDate	2023-12-28
PublicationDateYYYYMMDD	2023-12-28
PublicationDate_xml	– month: 12 year: 2023 text: 2023-12-28 day: 28
PublicationDecade	2020
PublicationPlace	Canada
PublicationPlace_xml	– name: Canada – name: Toronto
PublicationTitle	Journal of medical Internet research
PublicationTitleAlternate	J Med Internet Res
PublicationYear	2023
Publisher	Gunther Eysenbach MD MPH, Associate Professor JMIR Publications
Publisher_xml	– name: Gunther Eysenbach MD MPH, Associate Professor – name: JMIR Publications
References	Eggmann, F (ref12) 2023; 44 ref13 ref57 ref56 ref15 ref59 ref14 ref58 ref53 ref52 ref11 ref55 ref10 ref54 ref17 ref16 ref19 ref18 ref51 ref50 ref46 ref45 ref48 ref47 ref42 ref41 ref44 ref43 ref49 ref8 ref7 ref9 ref4 ref3 ref6 ref5 ref40 ref35 ref34 ref37 ref36 ref31 ref30 ref33 ref32 ref2 ref1 ref39 ref38 ref24 ref23 ref26 ref25 ref20 ref22 ref21 ref28 ref27 ref29
References_xml	– ident: ref37 – ident: ref1 – ident: ref42 doi: 10.48550/arXiv.2212.14882 – ident: ref47 doi: 10.1038/s41415-023-5926-2 – ident: ref24 – ident: ref35 doi: 10.1155/2022/1410448 – ident: ref27 – ident: ref57 doi: 10.1016/j.diii.2023.02.006 – ident: ref4 doi: 10.1259/dmfr.20190107 – ident: ref30 doi: 10.1111/jerd.13046 – ident: ref9 – ident: ref22 doi: 10.1101/2023.02.02.23285399 – ident: ref53 – ident: ref34 – ident: ref20 doi: 10.7759/cureus.38317 – volume: 44 start-page: 220 issue: 4 year: 2023 ident: ref12 publication-title: Compend Contin Educ Dent – ident: ref2 – ident: ref13 doi: 10.1016/j.dsx.2023.102744 – ident: ref49 doi: 10.1001/jamahealthforum.2023.1938 – ident: ref14 doi: 10.3390/healthcare11060887 – ident: ref5 doi: 10.1016/j.legalmed.2020.101826 – ident: ref43 doi: 10.1016/j.jebdp.2005.12.011 – ident: ref50 – ident: ref26 – ident: ref33 doi: 10.1016/j.adaj.2022.07.012 – ident: ref17 doi: 10.2196/48568 – ident: ref56 doi: 10.1038/s41368-023-00239-y – ident: ref7 doi: 10.1002/jdd.13010 – ident: ref15 doi: 10.1021/acs.jchemed.3c00087 – ident: ref54 – ident: ref16 – ident: ref41 doi: 10.1016/S2589-7500(23)00021-3 – ident: ref45 doi: 10.1016/j.oooo.2023.09.010 – ident: ref11 doi: 10.1016/j.jdent.2021.103849 – ident: ref58 – ident: ref21 doi: 10.2196/20346 – ident: ref36 doi: 10.37074/jalt.2023.6.1.23 – ident: ref29 – ident: ref48 doi: 10.1016/j.ijom.2023.09.005 – ident: ref28 doi: 10.1038/s41746-021-00464-x – ident: ref25 – ident: ref6 doi: 10.3390/jcm9113579 – ident: ref51 – ident: ref44 doi: 10.1016/j.jormas.2023.101471 – ident: ref46 doi: 10.1016/j.adaj.2023.06.003 – ident: ref19 – ident: ref32 – ident: ref55 doi: 10.1016/j.jmir.2020.08.011 – ident: ref3 doi: 10.1111/jerd.12844 – ident: ref10 doi: 10.1038/sj.bdj.4801062 – ident: ref23 doi: 10.1016/S2589-7500(23)00023-7 – ident: ref31 doi: 10.1016/j.jclinepi.2010.07.015 – ident: ref40 doi: 10.1038/s41746-023-00819-6 – ident: ref38 doi: 10.7759/cureus.42133 – ident: ref52 – ident: ref59 doi: 10.1038/s41415-023-5928-0 – ident: ref8 – ident: ref18 – ident: ref39 doi: 10.3390/healthcare11101480
SSID	ssj0020491
Score	2.639428
Snippet	The increasing application of generative artificial intelligence large language models (LLMs) in various fields, including dentistry, raises questions about... Background:The increasing application of generative artificial intelligence large language models (LLMs) in various fields, including dentistry, raises... BackgroundThe increasing application of generative artificial intelligence large language models (LLMs) in various fields, including dentistry, raises...
SourceID	doaj proquest pubmed crossref
SourceType	Open Website Aggregation Database Index Database Enrichment Source
StartPage	e51580
SubjectTerms	Accuracy Answers Artificial Intelligence Bans Chatbots Clinical decision making Clinical medicine Clinical research Critical thinking Decision making Dentistry Dentists Evidence-Based Dentistry Generative artificial intelligence Health care Humans Internet access Language Large language models Maxillofacial surgery Medical libraries Medical screening Natural language Neural networks Oral hygiene Orthodontics Patients Periodontics Personal information Privacy Professional Role Professionals Prosthodontics Scientific evidence Search Engine Textbooks
SummonAdditionalLinks	– databaseName: DOAJ Directory of Open Access Journals dbid: DOA link: http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwrV1La9wwEBYlh1AooUkf2TQJUwg5xWRXlmUpt-w2j5ZsySGB3IyeaWGxQ3YX2r_WX9cZW-uEQumll8VI47XwjDTfWKNvGDuwQhnybFk00mci2jyzeTAZ9yEoE6wyhgLF6Vd5eSu-3BV3z0p9UU5YRw_cvbjjwivvZSgUPgzBg9XeWYyTyyiMVzZvybbR562CqRRqIe4drbNXlOiMJnaMTpt4H595npag_--osvUu56_ZRoKFcNoNZ5O9CPUW20uHCuAQ0qkheouQpuMWW5-mjfE37NdZz9oNTQREdXD9dCSAmjp-aVrc4PQzXFECOP52HyuBKqLN5jD5ZhYX1zdHcNE097MAYzSfIzC1hynl7c1xyYYx-rpWEL7jUJYPhN-paVWeNBujX_TwibKQqJDcCUyeCMbxf35g57QtWz0HSmL8-Zbdnp_dTC6zVJYhc0KqRRaVKowldXAVZVDahXyoHQ-iNMbpgmsM4qQti4irgyi9KaJAVIw4QEsjRqP8HVurmzpsM5BuaDFi0x7vEEMnlaXd9ODxuoy6dAN2sFJZ5RJnOZXOmFUYu5Bmq1azA7bfiz10JB1_CoxJ330ncWq3DWhpVbK06l-WNmC7K2up0kSfVxzDXcl1ybH7Y9-NU5T2XUwdmiXKEMc9IskhyrzvrKwfSY6AnT4m7_yPEX5gLzliMMq24WqXrS0el2EPMdPC7rfT4zc1tBg9 priority: 102 providerName: Directory of Open Access Journals – databaseName: Library Science Database dbid: M1O link: http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwnV1Lb9NAEF5BkQoS4hFegbYapIpT3cb2er3LBTWhDxCBHlqpN2tfLlUjOzSJBPw0fh0z9sZBleDAxYp219Za-Tzzze7sN4xtGy41ebao1MJFvDRpZFKvo8R5L7U3UmsKFMefxfEZ_3ienYcFt1lIq1zaxMZQu9rSGvlegnGFSFSeiHfTbxFVjaLd1VBC4za7g0SZ04c5jr90ARey33id3ad0ZwTaHrpuUn_8w_80Mv1_55aNjzl8yIrl7NrUkqvdxdzs2p83hBv_f_qP2INAP2G_xctjdstXPXb3PaUMUdW3HtsMBxngDYSTSvTPQTABPbY-DpvxT9ivg04pHOoSkEnCyeoYAjW1mtZkUGH_A3yipHO8tgukQFXYJjMYfdXzo5PTHTiq64uJhyFCdgd05WBMuYIzdBMwRP_aDIRLnMpiSjEDNS1LokZD9MUOutd4C6OVqDk-5zt2jptS2TOgxMkfT9nZ4cHp6DgKpSAiy4WcR6WUmTaIN4yWSuGlsj4dKJt4nmttVZYoDByFybMSLRLPnc5KjkwcuYcSmsdx-oytVXXlXzAQdmAwSlQO7-ADK6ShHXzv8Hdeqtz22fYSIIUNOulUrmNSYLxEOCoaHPXZVjds2gqD3BwwJHR1naTj3TTU1xdFMAtF5qRzwmeSXi0WRjlrlMzzkmsnTSr6bGMJpyIYl1mxwlKfve660SzQXo-ufL3AMaSrj-x1gGOet5juZpJikEAL2C___fBX7F6CjI5ydxK5wdbm1wu_iQxsbraaz-w394A1Pg priority: 102 providerName: ProQuest
Title	Evaluation of the Performance of Generative AI Large Language Models ChatGPT, Google Bard, and Microsoft Bing Chat in Supporting Evidence-Based Dentistry: Comparative Mixed Methods Study
URI	https://www.ncbi.nlm.nih.gov/pubmed/38009003 https://www.proquest.com/docview/2917629726 https://www.proquest.com/docview/2894361206 https://doaj.org/article/5d8dd6e58bec416b9dcb9877f4ad8b36
Volume	25
hasFullText	1
inHoldings	1
isFullTextHit
isPrint
link	http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwfV1ta9RAEF60hSqI6Pl22h4jFD81ekk2uxtBpDmvrWLqIT3ot7Cb3bSFI6n3Au1f89c5k-RSK-qXEHYnIdmZ2Xlmd3aGsV3DlSbL5hVaWI8XJvRM6LQXWOeUdkZpTY5ieiyOpvzLafRbNGE7gIu_unZUT2o6n729-nH9ERX-A4UxowC9Q5Os0GvfRGMkqYhByruNhAABsL_FHtwivWWC6kz9_4aXtZk5eMQetvgQ9huGPmZ3XNlj9z5RTA-VZeuxnfakAbyB9igRDS20OtpjW2m7W_6E_Rx3qbyhKgChHkxuzglQU5N0mmY82P8MXykqHK_NCiZQmbTZAkbnenk4OdmDw6o6mzlIUKb2QJcWUgrmW-A8DgkawJoQLvBTVpc0ntS0rlnqJWgsLXS_8R5GN1nH8T1X2JnWtawXQJGN10_Z9GB8Mjry2loNXs6FWnqFUpE2KBDozhTCqTh34TDOA8el1nkcBTF6dsLIqMApg0uro4IjVEZwEAvNfT98xjbKqnQvGIh8aNCNiy0-wYe5UIa22J3Fe1nEMu-z3TX7srxNZE71NGYZOjTE5azmcp8NOrLLJnPHnwQJ8b7rpETbdUM1P8tavc0iq6wVLlL0a74wsc1NrKQsuLbKhKLPtteSk62FNwvQBxZBLAPsft11o97SZowuXbVCGkp8j_ByiDTPG4nrviREFE8rzC____JX7H6AkIuCawK1zTaW85XbQYi0NAN2V57KAdtMxseT74N6oQGvqf9tUKvHLyyUFv4
linkProvider	Scholars Portal
linkToHtml	http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwtV1fa9swED-6FNrCGFv2L1vbadDtqaaJLcvyYIwmTZusSQgjhb65kiW3gxBnTcLWL7UPsE-3O__LGGxvfTFGOguZO939TjrdARxoLhVZNidRwjg80Z6jPasc11grldVSKXIUhyPRu-CfL_3LDfhZ3oWhsMpSJ2aK2qQx7ZEfuehXCDcMXPFp_s2hqlF0ulqW0MjF4tzefUeXbfGxf4L8fee6p91Jp-cUVQWcmAu5dBIpfaVx6gi8E2FlGFuvGcau5YFScei7IfogQgd-gsLNA6P8hCOoQzMWCsVbLQ_HfQCb3BNNtwab7e5o_KVy8RBvt7bgIQVYo2gfIVigfJN_WLysMMC_0Wxm1U4fw6MCjrLjXH6ewIad1WH7hEKIqApcHfaKiw3sPStuLhEnWaES6rA1LA7nn8KvbpU5nKUJQ2TJxutrCdSU57gmBcuO-2xAQej4zDdMGVVlmy5Y50Ytz8aTQ3aWptdTy9oowodMzQwbUuzgAs0Ga6O9zQjZV5zKak4-BDWVJVKdNtpmw6rf-MA66yTnOM4P7BxmpbMXjAIp757Bxb0w8TnUZunMvgQm4qZGrzE0-AVvxkJqOtG3Bt-DJAziBhyU7IviIm86le-YRug_EZejjMsN2K_I5nmikL8J2sT7qpPyemcN6e11VKiJyDfSGGF9Sb_WEjo0sQ5lECRcGak90YDdUnKiQtksovXSaMDbqhvVBJ39qJlNV0hDefYRzTaR5kUucdVMPHQaaEP71f8HfwPbvclwEA36o_PXsOMi2qO4HlfuQm15u7J7iM6Wer9YEgyu7nsV_gbL70_c
linkToPdf	http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwtV3da9swED-6FLLBGFv20Wxtp0G3p5ok_pDlwRjNV5u1CWG00DdPsuR2EOKsSdj6r_Wv250tO2OwvfXFGOksLO50dz_pdAdwoHwhybI5qeTa8VPlOcoz0nG1MUIaJaQkoDie8JML_8tlcLkFd-VdGAqrLHVirqh1ltAeectFXMHdKHR5K7VhEdP-8PPih0MVpOiktSynUYjIqbn9ifBt-WnUR16_d93h4Lx34tgKA07ic7FyUiECqXAa6ISn3IgoMV47Slzjh1ImUeBGiEe4CoMUBd0PtQxSHx08NGkRl36n4-G4D2A7JFRUg-3uYDL9WsE99L07dXhMwdYo5i10HCj35B_WLy8S8G_PNrdww6fwxLqm7KiQpWewZeYNeNincCKqCNeAPXvJgX1g9hYTcZVZ9dCA-tge1D-Hu0GVRZxlKUMvk003VxSoqch3TcqWHY3YGQWk47PYPGVUoW22ZL1ruTqenh-y4yy7mhnWRXE-ZHKu2ZjiCJdoQlgXbW9OyL7jr6wXhCeoqSyX6nTRTmtWTeMj620SnuM4v7BznJfRXjIKqrx9ARf3wsSXUJtnc7MDjCdthQgy0viF3064UHS6bzS-h2kUJk04KNkXJzaHOpXymMWIpYjLcc7lJuxXZIsiacjfBF3ifdVJOb7zhuzmKrYqIw600JqbQNDUOlxFOlGRCMPUl1oojzdht5Sc2CqeZbxZJk14V3WjyqBzIDk32RppKOc-erZtpHlVSFz1Jx4CCNrcfv3_wd9CHVdffDaanL6BRy46fhTi44pdqK1u1mYPHbWV2rcrgsG3-16EvwFPmFQR
openUrl	ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=Evaluation+of+the+Performance+of+Generative+AI+Large+Language+Models+ChatGPT%2C+Google+Bard%2C+and+Microsoft+Bing+Chat+in+Supporting+Evidence-Based+Dentistry%3A+Comparative+Mixed+Methods+Study&rft.jtitle=Journal+of+medical+Internet+research&rft.au=Giannakopoulos%2C+Kostis&rft.au=Kavadella%2C+Argyro&rft.au=Salim%2C+Anas+Aaqel&rft.au=Stamatopoulos%2C+Vassilis&rft.date=2023-12-28&rft.pub=Gunther+Eysenbach+MD+MPH%2C+Associate+Professor&rft.eissn=1438-8871&rft.volume=25&rft.issue=1&rft.spage=e51580&rft_id=info:doi/10.2196%2F51580&rft.externalDBID=HAS_PDF_LINK
thumbnail_l	http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=1438-8871&client=summon
thumbnail_m	http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=1438-8871&client=summon
thumbnail_s	http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=1438-8871&client=summon