Item request has been placed! ×
Item request cannot be made. ×
loading  Processing Request

Entourage: all-in-one sequence analysis software for genome assembly, virus detection, virus discovery, and intrasample variation profiling.

Item request has been placed! ×
Item request cannot be made. ×
loading   Processing Request
  • معلومة اضافية
    • المصدر:
      Publisher: BioMed Central Country of Publication: England NLM ID: 100965194 Publication Model: Electronic Cited Medium: Internet ISSN: 1471-2105 (Electronic) Linking ISSN: 14712105 NLM ISO Abbreviation: BMC Bioinformatics Subsets: MEDLINE
    • بيانات النشر:
      Original Publication: [London] : BioMed Central, 2000-
    • الموضوع:
    • نبذة مختصرة :
      Background: Pan-virus detection, and virome investigation in general, can be challenging, mainly due to the lack of universally conserved genetic elements in viruses. Metagenomic next-generation sequencing can offer a promising solution to this problem by providing an unbiased overview of the microbial community, enabling detection of any viruses without prior target selection. However, a major challenge in utilising metagenomic next-generation sequencing for virome investigation is that data analysis can be highly complex, involving numerous data processing steps.
      Results: Here, we present Entourage to address this challenge. Entourage enables short-read sequence assembly, viral sequence search with or without reference virus targets using contig-based approaches, and intrasample sequence variation quantification. Several workflows are implemented in Entourage to facilitate end-to-end virus sequence detection analysis through a single command line, from read cleaning, sequence assembly, to virus sequence searching. The results generated are comprehensive, allowing for thorough quality control, reliability assessment, and interpretation. We illustrate Entourage's utility as a streamlined workflow for virus detection by employing it to comprehensively search for target virus sequences and beyond in raw sequence read data generated from HeLa cell culture samples spiked with viruses. Furthermore, we showcase its flexibility and performance on a real-world dataset by analysing a preassembled Tara Oceans dataset. Overall, our results show that Entourage performs well even with low virus sequencing depth in single digits, and it can be used to discover novel viruses effectively. Additionally, by using sequence data generated from a patient with chronic SARS-CoV-2 infection, we demonstrate Entourage's capability to quantify virus intrasample genetic variations, and generate publication-quality figures illustrating the results.
      Conclusions: Entourage is an all-in-one, versatile, and streamlined bioinformatics software for virome investigation, developed with a focus on ease of use. Entourage is available at https://codeberg.org/CENMIG/Entourage under the MIT license.
      (© 2024. The Author(s).)
    • References:
      Nat Commun. 2016 Apr 13;7:11257. (PMID: 27071849)
      Microbiome. 2019 Jan 28;7(1):12. (PMID: 30691529)
      Genome Biol. 2019 Oct 22;20(1):217. (PMID: 31640809)
      Virology. 2014 Dec;471-473:54-60. (PMID: 25461531)
      Methods. 2016 Jun 1;102:3-11. (PMID: 27012178)
      Genome Res. 2017 May;27(5):824-834. (PMID: 28298430)
      Elife. 2015 Dec 11;4:. (PMID: 26652000)
      Nucleic Acids Res. 2012 Dec;40(22):11189-201. (PMID: 23066108)
      F1000Res. 2021 Jan 18;10:33. (PMID: 34035898)
      PLoS One. 2013;8(2):e57355. (PMID: 23468974)
      PeerJ. 2015 May 28;3:e985. (PMID: 26038737)
      Nat Biotechnol. 2021 May;39(5):578-585. (PMID: 33349699)
      PLoS Biol. 2023 Feb 13;21(2):e3001922. (PMID: 36780432)
      Bioinformatics. 2014 Aug 1;30(15):2114-20. (PMID: 24695404)
      Virus Evol. 2020 Aug 25;6(2):veaa065. (PMID: 33365150)
      Virus Evol. 2020 Dec 02;6(2):veaa091. (PMID: 33408878)
      Bioinformatics. 2021 Sep 29;37(18):3029-3031. (PMID: 33734313)
      Bioinformatics. 2018 Sep 1;34(17):i884-i890. (PMID: 30423086)
      Virology. 2018 Oct;523:74-88. (PMID: 30098450)
      Sci Rep. 2019 Mar 1;9(1):3219. (PMID: 30824715)
      Bioinformatics. 2010 Oct 1;26(19):2460-1. (PMID: 20709691)
      Annu Rev Biochem. 2012;81:795-822. (PMID: 22482909)
      BMC Genomics. 2015 Mar 25;16:236. (PMID: 25879410)
      Hepatology. 2015 Jun;61(6):1842-50. (PMID: 25645961)
      Int J Infect Dis. 2021 Mar;104:306-314. (PMID: 33444750)
      Genome Med. 2021 Feb 22;13(1):30. (PMID: 33618765)
      Bioinformatics. 2017 Jun 01;33(11):1730-1732. (PMID: 28130230)
      Bioinformatics. 2019 Mar 1;35(5):871-873. (PMID: 30124794)
      J Microbiol Methods. 2007 May;69(2):330-9. (PMID: 17391789)
      mSphere. 2017 Sep 13;2(5):. (PMID: 28932815)
      BMC Evol Biol. 2013 Jul 17;13:154. (PMID: 23865988)
      Microbiome. 2017 Jul 6;5(1):69. (PMID: 28683828)
      Virol J. 2013 Apr 12;10:116. (PMID: 23587185)
      Annu Rev Virol. 2020 Sep 29;7(1):63-81. (PMID: 32511081)
      Bioinformatics. 2012 Jun 1;28(11):1420-8. (PMID: 22495754)
      Proc Natl Acad Sci U S A. 2012 Apr 17;109(16):6241-6. (PMID: 22454494)
      Front Microbiol. 2017 Oct 31;8:2110. (PMID: 29163404)
      Nat Commun. 2018 Nov 19;9(1):4881. (PMID: 30451857)
      Nat Methods. 2015 Jan;12(1):59-60. (PMID: 25402007)
      Microbiol Mol Biol Rev. 2020 Mar 4;84(2):. (PMID: 32132243)
      Sci Rep. 2016 Mar 30;6:23774. (PMID: 27026381)
      PLoS Comput Biol. 2023 Aug 28;19(8):e1011422. (PMID: 37639475)
      Gigascience. 2019 Jun 1;8(6):. (PMID: 31220250)
      Bioinformatics. 2009 Aug 15;25(16):2078-9. (PMID: 19505943)
      J Gen Virol. 1999 Jul;80 ( Pt 7):1725-1733. (PMID: 10423141)
      Genome Biol. 2019 Jan 8;20(1):8. (PMID: 30621750)
      Genome Biol. 2019 Nov 28;20(1):257. (PMID: 31779668)
      PeerJ. 2017 Sep 21;5:e3817. (PMID: 28948103)
      Bioinformatics. 2018 Dec 15;34(24):4287-4289. (PMID: 29982281)
      Nat Microbiol. 2022 Feb;7(2):327-336. (PMID: 34972821)
      BMC Bioinformatics. 2009 Dec 15;10:421. (PMID: 20003500)
      Genome Res. 2014 Jul;24(7):1180-92. (PMID: 24899342)
      Virology. 2017 Mar;503:21-30. (PMID: 28110145)
      Annu Rev Pathol. 2019 Jan 24;14:319-338. (PMID: 30355154)
    • Grant Information:
      JRA-CO-2563-12568-TH National Science and Technology Development Agency; HSRI. 66-142 Health Systems Research Institute
    • Contributed Indexing:
      Keywords: Bioinformatics pipeline; Intrasample variation; Metagenome; Virome; Virus detection; Virus discovery
    • الموضوع:
      Date Created: 20240624 Date Completed: 20240625 Latest Revision: 20240627
    • الموضوع:
      20240627
    • الرقم المعرف:
      PMC11197340
    • الرقم المعرف:
      10.1186/s12859-024-05846-y
    • الرقم المعرف:
      38914932