Item request has been placed! ×
Item request cannot be made. ×
loading  Processing Request

Preventing dataset shift from breaking machine-learning biomarkers

Item request has been placed! ×
Item request cannot be made. ×
loading   Processing Request
  • معلومة اضافية
    • Contributors:
      McGill University = Université McGill [Montréal, Canada]; Modelling brain structure, function and variability based on high-field MRI data (PARIETAL); Service NEUROSPIN (NEUROSPIN); Université Paris-Saclay-Direction de Recherche Fondamentale (CEA) (DRF (CEA)); Commissariat à l'énergie atomique et aux énergies alternatives (CEA)-Commissariat à l'énergie atomique et aux énergies alternatives (CEA)-Université Paris-Saclay-Direction de Recherche Fondamentale (CEA) (DRF (CEA)); Commissariat à l'énergie atomique et aux énergies alternatives (CEA)-Commissariat à l'énergie atomique et aux énergies alternatives (CEA)-Inria Saclay - Ile de France; Institut National de Recherche en Informatique et en Automatique (Inria)-Institut National de Recherche en Informatique et en Automatique (Inria); This work was partially funded by DirtyData - ANR-17-CE23-0018, the National Institutes of Health (NIH) NIH-NIBIB P41 EB019936 (ReproNim) NIH-NIMH R01 MH083320 (CANDIShare) and NIH RF1 MH120021 (NIDM), the National Institute Of Mental Health under Award Number R01MH096906 (Neurosynth), the Canada First Research Excellence Fund, awarded to McGill University for the Healthy Brains for Healthy Lives initiative and the Brain Canada Foundation with support from Health Canada, Health Canada, through the Canada Brain Research Fund in partnership with the Montreal Neurological Institute.; ANR-17-CE23-0018,DirtyData,Intégration et nettoyage de données pour l'analyse statistique(2017); Inria Saclay - Ile de France; Institut National de Recherche en Informatique et en Automatique (Inria)-Institut National de Recherche en Informatique et en Automatique (Inria)-Service NEUROSPIN (NEUROSPIN); Commissariat à l'énergie atomique et aux énergies alternatives (CEA)-Commissariat à l'énergie atomique et aux énergies alternatives (CEA)
    • بيانات النشر:
      Oxford University Press, 2021.
    • الموضوع:
      2021
    • نبذة مختصرة :
      Machine learning brings the hope of finding new biomarkers extracted from cohorts with rich biomedical measurements. A good biomarker is one that gives reliable detection of the corresponding condition. However, biomarkers are often extracted from a cohort that differs from the target population. Such a mismatch, known as a dataset shift, can undermine the application of the biomarker to new individuals. Dataset shifts are frequent in biomedical research, e.g. because of recruitment biases. When a dataset shift occurs, standard machine-learning techniques do not suffice to extract and validate biomarkers. This article provides an overview of when and how dataset shifts breaks machine-learning extracted biomarkers, as well as detection and correction strategies.
      Comment: GigaScience, BioMed Central, In press
    • ISSN:
      2047-217X
    • Rights:
      OPEN
    • الرقم المعرف:
      edsair.doi.dedup.....d31a0ad6e09debb1c6ffff82ff5bd2d2