Item request has been placed! ×
Item request cannot be made. ×
loading  Processing Request

Impact of synthetic data on training a deep learning model for lesion detection and classification in contrast-enhanced mammography

Item request has been placed! ×
Item request cannot be made. ×
loading   Processing Request
  • معلومة اضافية
    • الموضوع:
      2025
    • Collection:
      Maastricht University Research Publications
    • نبذة مختصرة :
      PURPOSE: Predictive models for contrast-enhanced mammography often perform better at detecting and classifying enhancing masses than (non-enhancing) microcalcification clusters. We aim to investigate whether incorporating synthetic data with simulated microcalcification clusters during training can enhance model performance. APPROACH: Microcalcification clusters were simulated in low-energy images of lesion-free breasts from 782 patients, considering local texture features. Enhancement was simulated in the corresponding recombined images. A deep learning (DL) model for lesion detection and classification was trained with varying ratios of synthetic and real (850 patients) data. In addition, a handcrafted radiomics classifier was trained using delineations and class labels from real data, and predictions from both models were ensembled. Validation was performed on internal (212 patients) and external (279 patients) real datasets. RESULTS: The DL model trained exclusively with synthetic data detected over 60% of malignant lesions. Adding synthetic data to smaller real training sets improved detection sensitivity for malignant lesions but decreased precision. Performance plateaued at a detection sensitivity of 0.80. The ensembled DL and radiomics models performed worse than the standalone DL model, decreasing the area under this receiver operating characteristic curve from 0.75 to 0.60 on the external validation set, likely due to falsely detected suspicious regions of interest. CONCLUSIONS: Synthetic data can enhance DL model performance, provided model setup and data distribution are optimized. The possibility to detect malignant lesions without real data present in the training set confirms the utility of synthetic data. It can serve as a helpful tool, especially when real data are scarce, and it is most effective when complementing real data.
    • Relation:
      info:eu-repo/semantics/altIdentifier/pissn/2329-4302; info:eu-repo/semantics/altIdentifier/eissn/2329-4310
    • الرقم المعرف:
      10.1117/1.JMI.12.S2.S22006
    • الدخول الالكتروني :
      https://cris.maastrichtuniversity.nl/en/publications/ac15ab1e-6746-4e89-94e3-c49abae52b72
      https://doi.org/10.1117/1.JMI.12.S2.S22006
    • Rights:
      info:eu-repo/semantics/openAccess ; http://creativecommons.org/licenses/by/4.0/
    • الرقم المعرف:
      edsbas.E63BA3F0