Item request has been placed! ×
Item request cannot be made. ×
loading  Processing Request

MG-Digger: An Automated Pipeline to Search for Giant Virus-Related Sequences in Metagenomes

Item request has been placed! ×
Item request cannot be made. ×
loading   Processing Request
  • معلومة اضافية
    • Contributors:
      Unité de Recherche sur les Maladies Infectieuses et Tropicales Emergentes (URMITE); Institut de Recherche pour le Développement (IRD)-Aix Marseille Université (AMU)-Institut National de la Santé et de la Recherche Médicale (INSERM)-IFR48; Institut des sciences biologiques - CNRS Biologie (INSB-CNRS)-Institut des sciences biologiques - CNRS Biologie (INSB-CNRS)-Centre National de la Recherche Scientifique (CNRS)
    • بيانات النشر:
      CCSD
      Frontiers Media
    • الموضوع:
      2016
    • Collection:
      Aix-Marseille Université: HAL
    • نبذة مختصرة :
      International audience ; The number of metagenomic studies conducted each year is growing dramatically. Storage and analysis of such big data is difficult and time-consuming. Interestingly, analysis shows that environmental and human metagenomes include a significant amount of non-annotated sequences, representing a `dark matter.' We established a bioinformatics pipeline that automatically detects metagenome reads matching query sequences from a given set and applied this tool to the detection of sequences matching large and giant DNA viral members of the proposed order Megavirales or virophages. A total of 1,045 environmental and human metagenomes (approximate to Terabase) were collected, processed, and stored on our bioinformatics server. In addition, nucleotide and protein sequences from 93 Megavirales representatives, including 19 giant viruses of amoeba, and 5 virophages, were collected. The pipeline was generated by scripts written in Python language and entitled MG-Digger. Metagenomes previously found to contain megavirus-like sequences were tested as controls. MG-Digger was able to annotate 100s of metagenome sequences as best matching those of giant viruses. These sequences were most often found to be similar to phycodnavirus or mimivirus sequences, but included reads related to recently available pandoraviruses, Pithovirus sibericum, and faustoviruses. Compared to other tools, MG-Digger combined stand-alone use on Linux or Windows operating systems through a user-friendly interface, implementation of ready-to-use customized metagenome databases and query sequence databases, adjustable parameters for BLAST searches, and creation of output files containing selected reads with best match identification. Compared to Metavir 2, a reference tool in viral metagenome analysis. MG-Digger detected 8% more true positive Megavirales-related reads in a control metagenome. The present work shows that massive, automated and recurrent analyses of metagenomes are effective in improving knowledge about the presence and ...
    • الرقم المعرف:
      10.3389/fmicb.2016.00428
    • الدخول الالكتروني :
      https://hal.science/hal-01459566
      https://hal.science/hal-01459566v1/document
      https://hal.science/hal-01459566v1/file/fmicb-07-00428.pdf
      https://doi.org/10.3389/fmicb.2016.00428
    • Rights:
      http://creativecommons.org/licenses/by/ ; info:eu-repo/semantics/OpenAccess
    • الرقم المعرف:
      edsbas.49AEB5F5