Bing chat for kidney stone management questions based on the AUA guidelines: a comparison of chatbot conversation style modes

Whiles BB, Bird VG, Canales BK, DiBianco JM, Terry RS (2023) Caution! AI bot has entered the patient chat: chatgpt has limitations in providing accurate urologic healthcare advice. Urology 180:278–284. https://doi.org/10.1016/j.urology.2023.07.010

Article  PubMed  Google Scholar 

Davis R, Eppler M, Ayo-Ajibola O et al (2023) Evaluating the effectiveness of artificial intelligence–powered large language models application in disseminating appropriate and readable health information in urology. J Urol 210(4):688–694. https://doi.org/10.1097/JU.0000000000003615

Article  PubMed  Google Scholar 

Cocci A, Pezzoli M, Lo Re M et al (2024) Quality of information and appropriateness of ChatGPT outputs for urology patients. Prostate Cancer Prostatic Dis 27(1):103–108. https://doi.org/10.1038/s41391-023-00705-y

Article  PubMed  Google Scholar 

Assimos D, Krambeck A, Miller NL et al (2016) Surgical management of stones: american urological association/endourological society guideline. PART II J Urol 196(4):1161–1169. https://doi.org/10.1016/j.juro.2016.05.091

Article  PubMed  Google Scholar 

Charnock D, Shepperd S, Needham G, Gann R (1999) DISCERN: an instrument for judging the quality of written consumer health information on treatment choices. J Epidemiol Commun Health 53(2):105–111. https://doi.org/10.1136/jech.53.2.105

Article  CAS  Google Scholar 

Maynez J, Narayan S, Bohnet B, McDonald R. On faithfulness and factuality in abstractive summarization. Association for Computational Linguistics. 2020; 1906–19.

Musheyev D, Pan A, Loeb S, Kabarriti AE (2024) How well do artificial intelligence chatbots respond to the top search queries about urological malignancies? Eur Urol 85(1):13–16. https://doi.org/10.1016/j.eururo.2023.07.004

Article  PubMed  Google Scholar 

Eckrich J, Ellinger J, Cox A, Stein J, Ritter M, Blaikie A, Kuhn S, Buhr CR (2024) Urology consultants versus large language models: potentials and hazards for medical advice in urology. BJUI Compass 5(5):438–444

Article  PubMed  PubMed Central  Google Scholar 

IBM. What is model drift? Published online on July 2024. https://www.ibm.com/think/topics/model-drift. Accessed on Feb 3rd, 2025.

Heilmeyer F, Böhringer D, Reinhard T, Arens S, Lyssenko L, Haverkamp C (2024) Viability of open large language models for clinical documentation in german health care: real-world model evaluation study. JMIR Med Inform 28(12):e59617. https://doi.org/10.2196/59617

Article  Google Scholar 

Thirunavukarasu AJ, Ting DSJ, Elangovan K, Gutierrez L, Tan TF, Ting DSW (2023) Large language models in medicine. Nat Med 29(8):1930–1940. https://doi.org/10.1038/s41591-023-02448-8. (Epub 2023 Jul 17 PMID: 37460753)

Article  CAS  PubMed  Google Scholar 

Goh E, Gallo R, Hom J, Strong E, Weng Y, Kerman H, Cool JA, Kanjee Z, Parsons AS, Ahuja N, Horvitz E, Yang D, Milstein A, Olson APJ, Rodman A, Chen JH (2024) Large language model influence on diagnostic reasoning: a randomized clinical trial. JAMA Netw Open 7(10):e2440969. https://doi.org/10.1001/jamanetworkopen.2024.40969

Article  PubMed  PubMed Central  Google Scholar 

Van Veen D, Van Uden C, Blankemeier L, Delbrouck JB, Aali A, Bluethgen C, Pareek A, Polacin M, Reis EP, Seehofnerová A, Rohatgi N, Hosamani P, Collins W, Ahuja N, Langlotz CP, Hom J, Gatidis S, Pauly J, Chaudhari AS (2024) Adapted large language models can outperform medical experts in clinical text summarization. Nat Med 30(4):1134–1142. https://doi.org/10.1038/s41591-024-02855-5

Article  CAS  PubMed  PubMed Central  Google Scholar 

Sezgin E (2024) Redefining virtual assistants in health care: the future with large language models. J Med Internet Res 19(26):e53225. https://doi.org/10.2196/53225

Article  Google Scholar 

Gilson A, Safranek CW, Huang T, Socrates V, Chi L, Taylor RA, Chartash D (2023) How does ChatGPT perform on the united states medical licensing examination (USMLE)? The Implications of Large Language Models for Medical Education and Knowledge Assessment. JMIR Med Educ 8(9):e45312.

Article  Google Scholar 

Bhayana R (2024) Chatbots and large language models in radiology: a practical primer for clinical and research applications. Radiology 310(1):e232756. https://doi.org/10.1148/radiol.232756. (PMID: 38226883)

Article  PubMed  Google Scholar 

Preiksaitis C, Ashenburg N, Bunney G, Chu A, Kabeer R, Riley F, Ribeira R, Rose C (2024) The role of large language models in transforming emergency medicine: scoping review. JMIR Med Inform 10(12):e53787.

Article  Google Scholar 

Ong JCL, Chang SY, William W, Butte AJ, Shah NH, Chew LST, Liu N, Doshi-Velez F, Lu W, Savulescu J, Ting DSW (2024) Ethical and regulatory challenges of large language models in medicine. Lancet Digit Health 6(6):e428–e432. https://doi.org/10.1016/S2589-7500(24)00061-X. (Epub 2024 Apr 23 PMID: 38658283)

Article  CAS  PubMed  Google Scholar 

Eraslan G, Avsec Ž, Gagneur J, Theis FJ (2019) Deep learning: new computational modelling techniques for genomics. Nat Rev Genet 20(7):389–403. https://doi.org/10.1038/s41576-019-0122-6. (PMID: 30971806)

Article  CAS  PubMed  Google Scholar 

Chen L, Zaharia M, Zou J. How is ChatGPT’s behavior changing over time? Published online 2023. https://doi.org/10.48550/ARXIV.2307.09009

Comments (0)

No login
gif