Bing chat for kidney stone management questions based on the AUA guidelines: a comparison of chatbot conversation style modes

Whiles BB, Bird VG, Canales BK, DiBianco JM, Terry RS (2023) Caution! AI bot has entered the patient chat: chatgpt has limitations in providing accurate urologic healthcare advice. Urology 180:278–284. https://doi.org/10.1016/j.urology.2023.07.010

Article PubMed Google Scholar

Davis R, Eppler M, Ayo-Ajibola O et al (2023) Evaluating the effectiveness of artificial intelligence–powered large language models application in disseminating appropriate and readable health information in urology. J Urol 210(4):688–694. https://doi.org/10.1097/JU.0000000000003615

Article PubMed Google Scholar

Cocci A, Pezzoli M, Lo Re M et al (2024) Quality of information and appropriateness of ChatGPT outputs for urology patients. Prostate Cancer Prostatic Dis 27(1):103–108. https://doi.org/10.1038/s41391-023-00705-y

Article PubMed Google Scholar

Assimos D, Krambeck A, Miller NL et al (2016) Surgical management of stones: american urological association/endourological society guideline. PART II J Urol 196(4):1161–1169. https://doi.org/10.1016/j.juro.2016.05.091

Article PubMed Google Scholar

Charnock D, Shepperd S, Needham G, Gann R (1999) DISCERN: an instrument for judging the quality of written consumer health information on treatment choices. J Epidemiol Commun Health 53(2):105–111. https://doi.org/10.1136/jech.53.2.105

Article CAS Google Scholar

Maynez J, Narayan S, Bohnet B, McDonald R. On faithfulness and factuality in abstractive summarization. Association for Computational Linguistics. 2020; 1906–19.

Musheyev D, Pan A, Loeb S, Kabarriti AE (2024) How well do artificial intelligence chatbots respond to the top search queries about urological malignancies? Eur Urol 85(1):13–16. https://doi.org/10.1016/j.eururo.2023.07.004

Article PubMed Google Scholar

Eckrich J, Ellinger J, Cox A, Stein J, Ritter M, Blaikie A, Kuhn S, Buhr CR (2024) Urology consultants versus large language models: potentials and hazards for medical advice in urology. BJUI Compass 5(5):438–444

Article PubMed PubMed Central Google Scholar

IBM. What is model drift? Published online on July 2024. https://www.ibm.com/think/topics/model-drift. Accessed on Feb 3rd, 2025.

Heilmeyer F, Böhringer D, Reinhard T, Arens S, Lyssenko L, Haverkamp C (2024) Viability of open large language models for clinical documentation in german health care: real-world model evaluation study. JMIR Med Inform 28(12):e59617. https://doi.org/10.2196/59617

Article Google Scholar

Thirunavukarasu AJ, Ting DSJ, Elangovan K, Gutierrez L, Tan TF, Ting DSW (2023) Large language models in medicine. Nat Med 29(8):1930–1940. https://doi.org/10.1038/s41591-023-02448-8. (Epub 2023 Jul 17 PMID: 37460753)

Article CAS PubMed Google Scholar

Goh E, Gallo R, Hom J, Strong E, Weng Y, Kerman H, Cool JA, Kanjee Z, Parsons AS, Ahuja N, Horvitz E, Yang D, Milstein A, Olson APJ, Rodman A, Chen JH (2024) Large language model influence on diagnostic reasoning: a randomized clinical trial. JAMA Netw Open 7(10):e2440969. https://doi.org/10.1001/jamanetworkopen.2024.40969

Article PubMed PubMed Central Google Scholar

Van Veen D, Van Uden C, Blankemeier L, Delbrouck JB, Aali A, Bluethgen C, Pareek A, Polacin M, Reis EP, Seehofnerová A, Rohatgi N, Hosamani P, Collins W, Ahuja N, Langlotz CP, Hom J, Gatidis S, Pauly J, Chaudhari AS (2024) Adapted large language models can outperform medical experts in clinical text summarization. Nat Med 30(4):1134–1142. https://doi.org/10.1038/s41591-024-02855-5

Article CAS PubMed PubMed Central Google Scholar

Sezgin E (2024) Redefining virtual assistants in health care: the future with large language models. J Med Internet Res 19(26):e53225. https://doi.org/10.2196/53225

Article Google Scholar

Gilson A, Safranek CW, Huang T, Socrates V, Chi L, Taylor RA, Chartash D (2023) How does ChatGPT perform on the united states medical licensing examination (USMLE)? The Implications of Large Language Models for Medical Education and Knowledge Assessment. JMIR Med Educ 8(9):e45312.

Article Google Scholar

Bhayana R (2024) Chatbots and large language models in radiology: a practical primer for clinical and research applications. Radiology 310(1):e232756. https://doi.org/10.1148/radiol.232756. (PMID: 38226883)

Article PubMed Google Scholar

Preiksaitis C, Ashenburg N, Bunney G, Chu A, Kabeer R, Riley F, Ribeira R, Rose C (2024) The role of large language models in transforming emergency medicine: scoping review. JMIR Med Inform 10(12):e53787.

Article Google Scholar

Ong JCL, Chang SY, William W, Butte AJ, Shah NH, Chew LST, Liu N, Doshi-Velez F, Lu W, Savulescu J, Ting DSW (2024) Ethical and regulatory challenges of large language models in medicine. Lancet Digit Health 6(6):e428–e432. https://doi.org/10.1016/S2589-7500(24)00061-X. (Epub 2024 Apr 23 PMID: 38658283)

Article CAS PubMed Google Scholar

Eraslan G, Avsec Ž, Gagneur J, Theis FJ (2019) Deep learning: new computational modelling techniques for genomics. Nat Rev Genet 20(7):389–403. https://doi.org/10.1038/s41576-019-0122-6. (PMID: 30971806)

Article CAS PubMed Google Scholar

Chen L, Zaharia M, Zou J. How is ChatGPT’s behavior changing over time? Published online 2023. https://doi.org/10.48550/ARXIV.2307.09009

View original article

WORLD JOURNAL OF UROLOGY

Like

Share Bookmark

0 0 0 0 0 0 0

More from this channel

Bing chat for kidney stone management questions based on the AUA guidelines: a comparison of chatbot conversation style modes

Comments (0)