Item request has been placed! ×
Item request cannot be made. ×
loading  Processing Request

Chemically Aware Model Builder (camb): an R package for property and bioactivity modelling of small molecules

Item request has been placed! ×
Item request cannot be made. ×
loading   Processing Request
  • معلومة اضافية
    • Contributors:
      UM, Maya; University of Cambridge [UK] (CAM); Bioinformatique structurale - Structural Bioinformatics; Institut Pasteur [Paris] (IP)-Centre National de la Recherche Scientifique (CNRS); European Bioinformatics Institute [Hinxton] (EMBL-EBI); EMBL Heidelberg; Unilever Research Port Sunlight Laboratory Bebington L63 3JW Wirral UK; ICC thanks the Paris-Pasteur International PhD Programme and Institut Pasteur for funding. TM thanks CNRS and Institut Pasteur for funding. DSM and RCG thanks Unilever for funding. GvW thanks EMBL (EIPOD) and Marie Curie (COFUND) for funding. AB thanks Unilever and the European Research Commission (Starting Grant ERC-2013-StG 336159 MIXTURE) for funding.; European Project: 336159,EC:FP7:ERC,ERC-2013-StG,MIXTURE(2014); Institut Pasteur [Paris]-Centre National de la Recherche Scientifique (CNRS); Apollo - University of Cambridge Repository
    • بيانات النشر:
      Springer Science and Business Media LLC, 2015.
    • الموضوع:
      2015
    • نبذة مختصرة :
      BACKGROUND: In silico predictive models have proved to be valuable for the optimisation of compound potency, selectivity and safety profiles in the drug discovery process. RESULTS: camb is an R package that provides an environment for the rapid generation of quantitative Structure-Property and Structure-Activity models for small molecules (including QSAR, QSPR, QSAM, PCM) and is aimed at both advanced and beginner R users. camb's capabilities include the standardisation of chemical structure representation, computation of 905 one-dimensional and 14 fingerprint type descriptors for small molecules, 8 types of amino acid descriptors, 13 whole protein sequence descriptors, filtering methods for feature selection, generation of predictive models (using an interface to the R package caret), as well as techniques to create model ensembles using techniques from the R package caretEnsemble). Results can be visualised through high-quality, customisable plots (R package ggplot2). CONCLUSIONS: Overall, camb constitutes an open-source framework to perform the following steps: (1) compound standardisation, (2) molecular and protein descriptor calculation, (3) descriptor pre-processing and model training, visualisation and validation, and (4) bioactivity/property prediction for new molecules. camb aims to speed model generation, in order to provide reproducibility and tests of robustness. QSPR and proteochemometric case studies are included which demonstrate camb's application.Graphical abstractFrom compounds and data to models: a complete model building workflow in one package.
    • File Description:
      application/pdf
    • ISSN:
      1758-2946
    • الرقم المعرف:
      10.1186/s13321-015-0086-2
    • الرقم المعرف:
      10.17863/cam.22398
    • Rights:
      CC BY
      CC BY SA
    • الرقم المعرف:
      edsair.doi.dedup.....c85d64095569f8b3ddb8a264cd299cbb