Pharmacy Faculty Scholarship

Let's Have a Chat: How Well Does an Artificial Intelligence Chatbot Answer Clinical Infectious Diseases Pharmacotherapy Questions?

Wesley D. Kufel, Binghamton University–SUNYFollow
Kathleen D. Hanrahan
Robert W. Seabury
Katie A. Parsels
Jason C. Gallagher
Conan MacDougall
Elizabeth W. Covington
Elias B. Chahine
Rachel S. Britt
Jeffrey M. Steele

Document Type

Article

Publication Date

10-25-2024

Keywords

artificial intelligence, chatbot, ChatGPT, infectious diseases, pharmacist

Abstract

Background. It is unknown whether ChatGPT provides quality responses to infectious diseases (ID) pharmacotherapy questions. This study surveyed ID pharmacist subject matter experts (SMEs) to assess the quality of ChatGPT version 3.5 (GPT-3.5) responses.

Methods. The primary outcome was the percentage of GPT-3.5 responses considered useful by SME rating. Secondary outcomes were SMEs’ ratings of correctness, completeness, and safety. Rating definitions were based on literature review. One hundred ID pharmacotherapy questions were entered into GPT-3.5 without custom instructions or additional prompts, and responses were recorded. A 0–10 rating scale for correctness, completeness, and safety was developed and validated for interrater reliability. Continuous and categorical variables were assessed for interrater reliability via average measures intraclass correlation coefficient and Fleiss multirater kappa, respectively. SMEs’ responses were compared by the Kruskal-Wallis test and chi-square test for continuous and categorical variables.

Results. SMEs considered 41.8% of responses useful. Median (IQR) ratings for correctness, completeness, and safety were 7 (4–9), 5 (3–8), and 8 (4–10), respectively. The Fleiss multirater kappa for usefulness was 0.379 (95% CI, .317–.441) indicating fair agreement, and intraclass correlation coefficients were 0.820 (95% CI, .758–.870), 0.745 (95% CI, .656–.816), and 0.833 (95% CI, .775–.880) for correctness, completeness, and safety, indicating at least substantial agreement. No significant difference was observed among SME responses for percentage of responses considered useful.

Conclusions. Fewer than 50% of GPT-3.5 responses were considered useful by SMEs. Responses were mostly considered correct and safe but were often incomplete, suggesting that GPT-3.5 responses may not replace an ID pharmacist’s responses.

Comments

https://doi.org/10.1093/ofid/ofae641

Publisher Attribution

© The Author(s) 2024. Published by Oxford University Press on behalf of Infectious Diseases Society of America. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.

Recommended Citation

Kufel, Wesley D.; Hanrahan, Kathleen D.; Seabury, Robert W.; Parsels, Katie A.; Gallagher, Jason C.; MacDougall, Conan; Covington, Elizabeth W.; Chahine, Elias B.; Britt, Rachel S.; and Steele, Jeffrey M., "Let's Have a Chat: How Well Does an Artificial Intelligence Chatbot Answer Clinical Infectious Diseases Pharmacotherapy Questions?" (2024). Pharmacy Faculty Scholarship. 52.
https://orb.binghamton.edu/pharmacy_fac/52

Creative Commons License

This work is licensed under a Creative Commons Attribution 4.0 International License.

Download

COinS

Pharmacy Faculty Scholarship

Let's Have a Chat: How Well Does an Artificial Intelligence Chatbot Answer Clinical Infectious Diseases Pharmacotherapy Questions?

Document Type

Publication Date

Keywords

Abstract

Comments

Publisher Attribution

Recommended Citation

Creative Commons License

Browse

Author Corner

Links

Pharmacy Faculty Scholarship

Let's Have a Chat: How Well Does an Artificial Intelligence Chatbot Answer Clinical Infectious Diseases Pharmacotherapy Questions?

Authors

Document Type

Publication Date

Keywords

Abstract

Comments

Publisher Attribution

Recommended Citation

Creative Commons License

Share

Browse

Author Corner

Links