Item request has been placed! ×
Item request cannot be made. ×
loading  Processing Request

Microseek: A Protein-Based Metagenomic Pipeline for Virus Diagnostic and Discovery

Item request has been placed! ×
Item request cannot be made. ×
loading   Processing Request
  • معلومة اضافية
    • Contributors:
      Institut Pasteur Paris (IP); Découverte de pathogènes – Pathogen discovery; Institut Pasteur Paris (IP)-Université Paris Cité (UPCité); Hub Bioinformatique et Biostatistique - Bioinformatics and Biostatistics HUB; École nationale vétérinaire d'Alfort (ENVA); This research received no external funding.; We thank Pascal Campagne for his critical review.
    • بيانات النشر:
      HAL CCSD
      MDPI
    • الموضوع:
      2022
    • Collection:
      Institut Pasteur: HAL
    • نبذة مختصرة :
      International audience ; We present Microseek, a pipeline for virus identification and discovery based on RVDB-prot, a comprehensive, curated and regularly updated database of viral proteins. Microseek analyzes metagenomic Next Generation Sequencing (mNGS) raw data by performing quality steps, de novo assembly, and by scoring the Lowest Common Ancestor (LCA) from translated reads and contigs. Microseek runs on a local computer. The outcome of the pipeline is displayed through a user-friendly and dynamic graphical interface. Based on two representative mNGS datasets de-rived from human tissue and plasma specimens, we illustrate how Microseek works, and we report its performances. In silico spikes of known viral sequences, but also spikes of fake Neo-pneumovirus viral sequences generated with variable evolutionary distances from known mem-bers of the Pneumoviridae family, were used. Results were compared to Chan Zuckerberg ID (CZ ID), a reference cloud-based mNGS pipeline. We show that Microseek reliably identifies known viral sequences and performs well for the detection of distant pseudoviral sequences, especially in complex samples such as in human plasma, while minimizing non-relevant hits.
    • Relation:
      pasteur-03773653; https://pasteur.hal.science/pasteur-03773653; https://pasteur.hal.science/pasteur-03773653/document; https://pasteur.hal.science/pasteur-03773653/file/viruses-14-01990-v2.pdf
    • الرقم المعرف:
      10.3390/v14091990
    • الدخول الالكتروني :
      https://pasteur.hal.science/pasteur-03773653
      https://pasteur.hal.science/pasteur-03773653/document
      https://pasteur.hal.science/pasteur-03773653/file/viruses-14-01990-v2.pdf
      https://doi.org/10.3390/v14091990
    • Rights:
      http://creativecommons.org/licenses/by/ ; info:eu-repo/semantics/OpenAccess
    • الرقم المعرف:
      edsbas.CA8EEBDF