Skip to main content

Main menu

  • HOME
  • ARTICLES
    • Current Issue
    • Ahead of Print
    • Archives
    • Abstracts In Press
    • Special Issue Archive
    • Subject Collections
  • INFO FOR
    • Authors
    • Reviewers
    • Call For Papers
    • Subscribers
    • Advertisers
  • SUBMIT
    • Manuscript
    • Peer Review
  • ABOUT
    • The JABFM
    • The Editing Fellowship
    • Editorial Board
    • Indexing
    • Editors' Blog
  • CLASSIFIEDS
  • Other Publications
    • abfm

User menu

Search

  • Advanced search
American Board of Family Medicine
  • Other Publications
    • abfm
American Board of Family Medicine

American Board of Family Medicine

Advanced Search

  • HOME
  • ARTICLES
    • Current Issue
    • Ahead of Print
    • Archives
    • Abstracts In Press
    • Special Issue Archive
    • Subject Collections
  • INFO FOR
    • Authors
    • Reviewers
    • Call For Papers
    • Subscribers
    • Advertisers
  • SUBMIT
    • Manuscript
    • Peer Review
  • ABOUT
    • The JABFM
    • The Editing Fellowship
    • Editorial Board
    • Indexing
    • Editors' Blog
  • CLASSIFIEDS
  • JABFM on Bluesky
  • JABFM On Facebook
  • JABFM On Twitter
  • JABFM On YouTube
Research ArticleOriginal Research

Performance Evaluation of the Generative Pre-trained Transformer (GPT-4) on the Family Medicine In-Training Examination

Ting Wang, Arch G. Mainous, Keith Stelter, Thomas R. O’Neill and Warren P. Newton
The Journal of the American Board of Family Medicine August 2024, jabfm.2023.230433R1; DOI: https://doi.org/10.3122/jabfm.2023.230433R1
Ting Wang
From the American Board of Family Medicine, Lexington, KY (TW, KS, TRO, WPN); Department of Health Services Research, Management and Policy, University of Florida, Gainesville, FL (AGM); Department of Community Health and Family Medicine, University of Florida, Gainesville, FL (AGM); Mayo Clinic Health System, Mankato, MN (KS); Department of Family Medicine, University of North Carolina, NC (WPN).
PhD
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Arch G. Mainous III
From the American Board of Family Medicine, Lexington, KY (TW, KS, TRO, WPN); Department of Health Services Research, Management and Policy, University of Florida, Gainesville, FL (AGM); Department of Community Health and Family Medicine, University of Florida, Gainesville, FL (AGM); Mayo Clinic Health System, Mankato, MN (KS); Department of Family Medicine, University of North Carolina, NC (WPN).
PhD
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Keith Stelter
From the American Board of Family Medicine, Lexington, KY (TW, KS, TRO, WPN); Department of Health Services Research, Management and Policy, University of Florida, Gainesville, FL (AGM); Department of Community Health and Family Medicine, University of Florida, Gainesville, FL (AGM); Mayo Clinic Health System, Mankato, MN (KS); Department of Family Medicine, University of North Carolina, NC (WPN).
MD
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Thomas R. O’Neill
From the American Board of Family Medicine, Lexington, KY (TW, KS, TRO, WPN); Department of Health Services Research, Management and Policy, University of Florida, Gainesville, FL (AGM); Department of Community Health and Family Medicine, University of Florida, Gainesville, FL (AGM); Mayo Clinic Health System, Mankato, MN (KS); Department of Family Medicine, University of North Carolina, NC (WPN).
PhD
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Warren P. Newton
From the American Board of Family Medicine, Lexington, KY (TW, KS, TRO, WPN); Department of Health Services Research, Management and Policy, University of Florida, Gainesville, FL (AGM); Department of Community Health and Family Medicine, University of Florida, Gainesville, FL (AGM); Mayo Clinic Health System, Mankato, MN (KS); Department of Family Medicine, University of North Carolina, NC (WPN).
MD, MPH
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • Article
  • Figures & Data
  • References
  • Info & Metrics
  • PDF
Loading

References

  1. 1.
    1. Arora A,
    2. Arora A
    . The promise of large language models in health care. Lancet 2023;401:641.
    OpenUrlCrossRef
  2. 2.
    1. Sallam M
    . The utility of ChatGPT as an example of large language models in healthcare education, research and practice: Systematic review on the future perspectives and potential limitations. medRxiv 2023. Published online 2023–02.
  3. 3.
    1. Abbasian M,
    2. Azimi I,
    3. Rahmani AM,
    4. Jain R
    . Conversational health agents: a personalized LLM-powered agent framework. Published online January 23, 2024. Accessed January 30, 2024. Available at: http://arxiv.org/abs/2310.02374.
  4. 4.
    1. Zhu L,
    2. Mou W,
    3. Chen R
    . Can the ChatGPT and other large language models with internet-connected database solve the questions and concerns of patient with prostate cancer and help democratize medical knowledge? J Transl Med 2023;21:269.
    OpenUrlCrossRef
  5. 5.
    1. Rao AS,
    2. Pang M,
    3. Kim J,
    4. et al
    . Assessing the utility of ChatGPT throughout the entire clinical workflow. medRxiv 2023. Published online 2023–02.
  6. 6.
    1. Shaheen MY
    . Applications of artificial intelligence (AI) in healthcare: A review. ScienceOpen Preprints. Published online 2021. Accessed January 30, 2024. Available at: https://www.scienceopen.com/hosted-document?doi=10.14293/S2199-1006.1.SOR-.PPVRY8K.v1.
  7. 7.
    1. Patil S,
    2. Shankar H
    . Transforming healthcare: harnessing the power of AI in the modern era. International Journal of Multidisciplinary Sciences and Arts 2023;2:60–70.
    OpenUrl
  8. 8.
    1. Hanna K
    . Exploring the applications of ChatGPT in family medicine education: five innovative ways for faculty integration. PRiMER 2023;7:26. Accessed January 30, 2024. Available at: https://journals.stfm.org/primer/2023/lte-hanna-2023-0053/.
    OpenUrl
  9. 9.
    1. Han ER,
    2. Yeo S,
    3. Kim MJ,
    4. Lee YH,
    5. Park KH,
    6. Roh H
    . Medical education trends for future physicians in the era of advanced technology and artificial intelligence: an integrative review. BMC Med Educ 2019;19:1–15.
    OpenUrl
  10. 10.
    1. Nori H,
    2. King N,
    3. McKinney SM,
    4. Carignan D,
    5. Horvitz E
    . Capabilities of gpt-4 on medical challenge problems. arXiv preprint 2023; arXiv:230313375. Published online.
  11. 11.
    1. Brin D,
    2. Sorin V,
    3. Vaid A,
    4. et al
    . Comparing ChatGPT and GPT-4 performance in USMLE soft skill assessments. Sci Rep 2023;13:16492.
    OpenUrl
  12. 12.
    1. Ali R,
    2. Tang OY,
    3. Connolly ID,
    4. et al
    . Performance of ChatGPT and GPT-4 on neurosurgery written board examinations. Neurosurgery 2022. Published online10–1227.
  13. 13.
    1. Hofmann HL,
    2. Guerra GA,
    3. Le JL,
    4. et al
    . The rapid development of artificial intelligence: GPT-4’s performance on orthopedic surgery board questions. Orthopedics 2023;47:1–5. Published online September 27.
    OpenUrl
  14. 14.
    GPT-4. Available at: https://openai.com/research/gpt-4.
  15. 15.
    1. Lee P,
    2. Bubeck S,
    3. Petro J
    . Benefits, limits, and risks of GPT-4 as an AI chatbot for medicine. Drazen JM, Kohane IS, Leong TY, eds. N Engl J Med 2023;388:1233–9.
    OpenUrlCrossRefPubMed
  16. 16.
    1. Ito N,
    2. Kadomatsu S,
    3. Fujisawa M,
    4. et al
    . The accuracy and potential racial and ethnic biases of GPT-4 in the diagnosis and triage of health conditions: evaluation study. JMIR Med Educ 2023;9:e47532.
    OpenUrl
  17. 17.
    1. Gertz RJ,
    2. Bunck AC,
    3. Lennartz S,
    4. et al
    . GPT-4 for automated determination of radiologic study and protocol based on radiology request forms: a feasibility study. Radiology 2023;307:e230877.
    OpenUrlCrossRef
  18. 18.
    1. Fink MA,
    2. Bischoff A,
    3. Fink CA,
    4. et al
    . Potential of ChatGPT and GPT-4 for data mining of free-text CT reports on lung cancer. Radiology 2023;308:e231362.
    OpenUrlCrossRefPubMed
  19. 19.
    1. Zhou J,
    2. He X,
    3. Sun L,
    4. et al
    . SkinGPT-4: an interactive dermatology diagnostic system with visual large language model. Published online 2023. Accessed January 30, 2024. Available at: https://repository.kaust.edu.sa/handle/10754/692649.
  20. 20.
    1. Han C,
    2. Kim DW,
    3. Kim S,
    4. et al
    . Evaluation of GPT-4 for 10-year cardiovascular risk prediction: insights from the UK Biobank and KoGES data. iScience 2023. Accessed January 30, 2024. Available at: https://www.cell.com/iscience/pdf/S2589-0042(24)00243-8.pdf.
  21. 21.
    1. Cheng S,
    2. Chang C,
    3. Chang W,
    4. et al
    . The now and future of ChatGPT and GPT in psychiatry. Psychiatry Clin Neurosci 2023;77:592–6.
    OpenUrl
  22. 22.
    1. Jee SH,
    2. Cabana MD
    . Indices for continuity of care: a systematic review of the literature. Med Care Res Rev 2006;63:158–88.
    OpenUrlCrossRefPubMedWeb of Science
  23. 23.
    1. Fernández-Ruiz I
    . Low-dose colchicine shows promise in chronic coronary disease. Nat Rev Cardiol 2020;17:680–1.
    OpenUrl
Previous
Back to top

In this issue

The Journal of the American Board of Family     Medicine: 38 (1)
The Journal of the American Board of Family Medicine
Vol. 38, Issue 1
January-February 2025
  • Table of Contents
  • Table of Contents (PDF)
  • Cover (PDF)
  • Index by author
  • Back Matter (PDF)
  • Front Matter (PDF)
Print
Download PDF
Article Alerts
Sign In to Email Alerts with your Email Address
Email Article

Thank you for your interest in spreading the word on American Board of Family Medicine.

NOTE: We only request your email address so that the person you are recommending the page to knows that you wanted them to see it, and that it is not junk mail. We do not capture any email address.

Enter multiple addresses on separate lines or separate them with commas.
Performance Evaluation of the Generative Pre-trained Transformer (GPT-4) on the Family Medicine In-Training Examination
(Your Name) has sent you a message from American Board of Family Medicine
(Your Name) thought you would like to see the American Board of Family Medicine web site.
CAPTCHA
This question is for testing whether or not you are a human visitor and to prevent automated spam submissions.
2 + 6 =
Solve this simple math problem and enter the result. E.g. for 1+3, enter 4.
Citation Tools
Performance Evaluation of the Generative Pre-trained Transformer (GPT-4) on the Family Medicine In-Training Examination
Ting Wang, Arch G. Mainous, Keith Stelter, Thomas R. O’Neill, Warren P. Newton
The Journal of the American Board of Family Medicine Aug 2024, jabfm.2023.230433R1; DOI: 10.3122/jabfm.2023.230433R1

Citation Manager Formats

  • BibTeX
  • Bookends
  • EasyBib
  • EndNote (tagged)
  • EndNote 8 (xml)
  • Medlars
  • Mendeley
  • Papers
  • RefWorks Tagged
  • Ref Manager
  • RIS
  • Zotero
Share
Performance Evaluation of the Generative Pre-trained Transformer (GPT-4) on the Family Medicine In-Training Examination
Ting Wang, Arch G. Mainous, Keith Stelter, Thomas R. O’Neill, Warren P. Newton
The Journal of the American Board of Family Medicine Aug 2024, jabfm.2023.230433R1; DOI: 10.3122/jabfm.2023.230433R1
Twitter logo Facebook logo Mendeley logo
  • Tweet Widget
  • Facebook Like
  • Google Plus One

Jump to section

  • Article
    • Abstract
    • Introduction
    • Methods
    • Results
    • Discussion
    • Conclusions
    • Appendix
    • Notes
    • References
  • Figures & Data
  • References
  • Info & Metrics
  • PDF

Related Articles

  • No related articles found.
  • PubMed
  • Google Scholar

Cited By...

  • Clinically Relevant Family Medicine Research: Board Certification Updates
  • Artificial Intelligence and Family Medicine
  • Google Scholar

More in this TOC Section

  • Integrating Adverse Childhood Experiences and Social Risks Screening in Adult Primary Care
  • A Pilot Comparison of Clinical Data Collection Methods Using Paper, Electronic Health Record Prompt, and a Smartphone Application
  • Associations Between Modifiable Preconception Care Indicators and Pregnancy Outcomes
Show more Original Research

Similar Articles

Keywords

  • Continuing Education
  • Family Medicine
  • Medical Education

Navigate

  • Home
  • Current Issue
  • Past Issues

Authors & Reviewers

  • Info For Authors
  • Info For Reviewers
  • Submit A Manuscript/Review

Other Services

  • Get Email Alerts
  • Classifieds
  • Reprints and Permissions

Other Resources

  • Forms
  • Contact Us
  • ABFM News

© 2025 American Board of Family Medicine

Powered by HighWire