Harnessing Natural Language Processing to Support Decisions Around Workplace-Based Assessment: Machine Learning Study of Competency-Based Medical Education

Item request has been placed!

Item request cannot be made.

Processing Request

اقرأ أكثر حفظ في قائمتي

المؤلفون: Yilmaz, Yusuf; Nunez, Alma Jurado; Ariaeinejad, Ali; Lee, Mark; Sherbino, Jonathan; Chan, Teresa M.
الموضوع:
natural language processing; machine learning algorithms; competency-based medical education; assessment; medical education; medical residents; machine learning; work performance; prediction models; Modular Assessment Program; Performance; Residents; Committees; Reliability; Cognition; Scores; Mcmap
نوع التسجيلة:
article in journal/newspaper
اللغة:
English

معلومة اضافية
- بيانات النشر:
  Jmir Publications, Inc
- الموضوع:
  2022
- Collection:
  Ege University Institutional Repository
- نبذة مختصرة :
  Background: Residents receive a numeric performance rating (eg, 1-7 scoring scale) along with a narrative (ie, qualitative) feedback based on their performance in each workplace-based assessment (WBA). Aggregated qualitative data from WBA can be overwhelming to process and fairly adjudicate as part of a global decision about learner competence. Current approaches with qualitative data require a human rater to maintain attention and appropriately weigh various data inputs within the constraints of working memory before rendering a global judgment of performance. Objective: This study explores natural language processing (NLP) and machine learning (ML) applications for identifying trainees at risk using a large WBA narrative comment data set associated with numerical ratings. Methods: NLP was performed retrospectively on a complete data set of narrative comments (ie, text-based feedback to residents based on their performance on a task) derived from WBAs completed by faculty members from multiple hospitals associated with a single, large, residency program at McMaster University, Canada. Narrative comments were vectorized to quantitative ratings using the bag-of-n-grams technique with 3 input types: unigram, bigrams, and trigrams. Supervised ML models using linear regression were trained with the quantitative ratings, performed binary classification, and output a prediction of whether a resident fell into the category of at risk or not at risk. Sensitivity, specificity, and accuracy metrics are reported. Results: The database comprised 7199 unique direct observation assessments, containing both narrative comments and a rating between 3 and 7 in imbalanced distribution (scores 3-5: 726 ratings; and scores 6-7: 4871 ratings). A total of 141 unique raters from 5 different hospitals and 45 unique residents participated over the course of 5 academic years. When comparing the 3 different input types for diagnosing if a trainee would be rated low (ie, 1-5) or high (ie, 6 or 7), our accuracy for trigrams was 87%, bigrams ...
- Relation:
  Jmir Medical Education; Makale - Uluslararası Hakemli Dergi - Kurum Öğretim Elemanı; https://doi.org/10.2196/30537; https://hdl.handle.net/11454/78188; WOS:000848716700003; Q1; N/A
- الرقم المعرف:
  10.2196/30537
- الدخول الالكتروني :
  https://hdl.handle.net/11454/78188
  https://doi.org/10.2196/30537
- Rights:
  info:eu-repo/semantics/openAccess
- الرقم المعرف:
  edsbas.496677F0

تعليقات

No Comments.

Harnessing Natural Language Processing to Support Decisions Around Workplace-Based Assessment: Machine Learning Study of Competency-Based Medical Education

اتصل بنا

اتبع