Item request has been placed! ×
Item request cannot be made. ×
loading  Processing Request

Identification and control for the effects of bioinformatic globin depletion on human RNA-seq differential expression analysis.

Item request has been placed! ×
Item request cannot be made. ×
loading   Processing Request
  • معلومة اضافية
    • المصدر:
      Publisher: Nature Publishing Group Country of Publication: England NLM ID: 101563288 Publication Model: Electronic Cited Medium: Internet ISSN: 2045-2322 (Electronic) Linking ISSN: 20452322 NLM ISO Abbreviation: Sci Rep Subsets: MEDLINE
    • بيانات النشر:
      Original Publication: London : Nature Publishing Group, copyright 2011-
    • الموضوع:
    • نبذة مختصرة :
      When profiling blood samples by RNA-sequencing (RNA-seq), RNA from haemoglobin (Hgb) can account for up to 70% of the transcriptome. Due to considerations of sequencing depth and power to detect biological variation, Hgb RNA is typically depleted prior to sequencing by hybridisation-based methods; an alternative approach is to deplete reads arising from Hgb RNA bioinformatically. In the present study, we compared the impact of these two approaches on the outcome of differential gene expression analysis performed using RNA-seq data from 58 human tuberculosis (TB) patient or contact whole blood samples-29 globin kit-depleted and 29 matched non-depleted-a subset of which were taken at TB diagnosis and at six months post-TB treatment from the same patient. Bioinformatic depletion of Hgb genes from the non-depleted samples (bioinformatic-depleted) substantially reduced library sizes (median = 57.24%) and fewer long non-coding, micro, small nuclear and small nucleolar RNAs were captured in these libraries. Profiling published TB gene signatures across all samples revealed inferior correlation between kit-depleted and bioinformatic-depleted pairs when the proportion of reads mapping to Hgb genes was higher in the non-depleted sample, particularly at the TB diagnosis time point. A set of putative "globin-fingerprint" genes were identified by directly comparing kit-depleted and bioinformatic-depleted samples at each timepoint. Two TB treatment response signatures were also shown to have decreased differential performance when comparing samples at TB diagnosis to six months post-TB treatment when profiled on the bioinformatic-depleted samples compared with their kit-depleted counterparts. These results demonstrate that failure to deplete Hgb RNA prior to sequencing has a negative impact on the sensitivity to detect disease-relevant gene expression changes even when bioinformatic removal is performed.
      (© 2023. The Author(s).)
    • References:
      Field, L. A. et al. Functional identity of genes detectable in expression profiling assays following globin mRNA reduction of peripheral blood samples. Clin. Biochem. 40, 499–502. https://doi.org/10.1016/j.clinbiochem.2007.01.004 (2007). (PMID: 10.1016/j.clinbiochem.2007.01.004)
      Debey, S. et al. Comparison of different isolation techniques prior gene expression profiling of blood derived cells: Impact on physiological responses, on overall expression and the role of different cell types. Pharmacogen. J 4, 193–207. https://doi.org/10.1038/sj.tpj.6500240 (2004). (PMID: 10.1038/sj.tpj.6500240)
      Jang, J. S. et al. Comparative evaluation for the globin gene depletion methods for mRNA sequencing using the whole blood-derived total RNAs. BMC Genomics 21, 890. https://doi.org/10.1186/s12864-020-07304-4 (2020). (PMID: 10.1186/s12864-020-07304-4)
      Harrington, C. A. et al. RNA-seq of human whole blood: Evaluation of globin RNA depletion on Ribo-Zero library method. Sci. Rep. 10, 6271. https://doi.org/10.1038/s41598-020-62801-6 (2020). (PMID: 10.1038/s41598-020-62801-6)
      Mastrokolias, A., den Dunnen, J. T., van Ommen, G. B., t’ Hoen, P. A. & van Roon-Mom, W. M. (2012) Increased sensitivity of next generation sequencing-based expression profiling after globin reduction in human blood RNA. BMC Genomics 13, 28 https://doi.org/10.1186/1471-2164-13-28.
      Shin, H. et al. Variation in RNA-seq transcriptome profiles of peripheral whole blood from healthy individuals with and without globin depletion. PLoS One 9, e91041. https://doi.org/10.1371/journal.pone.0091041 (2014). (PMID: 10.1371/journal.pone.0091041)
      Murphy, W. G. The sex difference in haemoglobin levels in adults—Mechanisms, causes, and consequences. Blood Rev. 28, 41–47. https://doi.org/10.1016/j.blre.2013.12.003 (2014). (PMID: 10.1016/j.blre.2013.12.003)
      Jenkins, D., Zhao, Y., Johnson, W. E., Odom, A. & Love, C. TBSignatureProfiler: Profile RNA-seq data using TB pathway signatures, https://github.com/compbiomed/TBSignatureProfiler (2020).
      Bloom, C. I. et al. Detectable changes in the blood transcriptome are present after two weeks of anti-tuberculosis therapy. PLoS One 7, e46191. https://doi.org/10.1371/journal.pone.0046191 (2012). (PMID: 10.1371/journal.pone.0046191)
      Tabone, O. et al. Blood transcriptomics reveal the evolution and resolution of the immune response in tuberculosis. J. Exp. Med. 218, e20210915. https://doi.org/10.1084/jem.20210915 (2021). (PMID: 10.1084/jem.20210915)
      Raghavachari, N., Xu, X., Munson, P. J. & Gladwin, M. T. Characterization of whole blood gene expression profiles as a sequel to globin mRNA reduction in patients with sickle cell disease. PLoS One 4, e6484. https://doi.org/10.1371/journal.pone.0006484 (2009). (PMID: 10.1371/journal.pone.0006484)
      Yip, R., Johnson, C. & Dallman, P. R. Age-related changes in laboratory values used in the diagnosis of anemia and iron deficiency. Am. J. Clin. Nutr. 39, 427–436. https://doi.org/10.1093/ajcn/39.3.427 (1984). (PMID: 10.1093/ajcn/39.3.427)
      Vahlquist, B. The cause of the sexual differences in erythrocyte hemoglobin and serum iron levels in human adults. Blood 5, 874–875 (1950). (PMID: 10.1182/blood.V5.9.874.874)
      Tilling, L. et al. Endothelial function does not relate to haemoglobin or serum erythropoietin concentrations and these do not explain the gender difference in endothelial function in healthy middle-aged men and women. Eur. J. Clin. Invest. 43, 225–230. https://doi.org/10.1111/eci.12033 (2013). (PMID: 10.1111/eci.12033)
      Sico, J. J. et al. Association between admission haematocrit and mortality among men with acute ischaemic stroke. Stroke Vasc. Neurol. 3, 160. https://doi.org/10.1136/svn-2018-000149 (2018). (PMID: 10.1136/svn-2018-000149)
      Luo, M., Chen, Y., Cheng, Y., Li, N. & Qing, H. Association between hematocrit and the 30-days mortality of patients with sepsis: A retrospective analysis based on the large-scale clinical database MIMIC-IV. PLoS One 17, e0265758. https://doi.org/10.1371/journal.pone.0265758 (2022). (PMID: 10.1371/journal.pone.0265758)
      Modiano, D. et al. Haemoglobin C protects against clinical Plasmodium falciparum malaria. Nature 414, 305–308. https://doi.org/10.1038/35104556 (2001). (PMID: 10.1038/35104556)
      Agarwal, A. et al. Hemoglobin C associated with protection from severe malaria in the Dogon of Mali, a West African population with a low prevalence of hemoglobin S. Blood 96, 2358–2363 (2000). (PMID: 10.1182/blood.V96.7.2358)
      Mockenhaupt, F. P. et al. Hemoglobin C and resistance to severe malaria in Ghanaian children. J. Infect. Dis. 190, 1006–1009. https://doi.org/10.1086/422847 (2004). (PMID: 10.1086/422847)
      Holik, A. Z. et al. RNA-seq mixology: Designing realistic control experiments to compare protocols and analysis methods. Nucleic Acids Res. 45, e30–e30. https://doi.org/10.1093/nar/gkw1063 (2017). (PMID: 10.1093/nar/gkw1063)
      Gallego Romero, I., Pai, A. A., Tung, J. & Gilad, Y. RNA-seq: Impact of RNA degradation on transcript quantification. BMC Biol. 12, 42. https://doi.org/10.1186/1741-7007-12-42 (2014). (PMID: 10.1186/1741-7007-12-42)
      Liao, Y., Smyth, G. K. & Shi, W. The R package R subread is easier, faster, cheaper and better for alignment and quantification of RNA sequencing reads. Nucleic Acids Res. 47, e47–e47. https://doi.org/10.1093/nar/gkz114 (2019). (PMID: 10.1093/nar/gkz114)
      Liao, Y., Smyth, G. K. & Shi, W. FeatureCounts: An efficient general purpose program for assigning sequence reads to genomic features. Bioinformatics 30, 923–930. https://doi.org/10.1093/bioinformatics/btt656 (2014). (PMID: 10.1093/bioinformatics/btt656)
      RStudio Team. RStudio: Integrated Development Environment for R (RStudio, Inc, Boston, 2022).
      Robinson, M. D., McCarthy, D. J. & Smyth, G. K. edgeR: A Bioconductor package for differential expression analysis of digital gene expression data. Bioinformatics 26, 139–140. https://doi.org/10.1093/bioinformatics/btp616 (2010). (PMID: 10.1093/bioinformatics/btp616)
      Durinck, S., Spellman, P. T., Birney, E. & Huber, W. Mapping identifiers for the integration of genomic datasets with the R/Bioconductor package biomaRt. Nat. Protoc. 4, 1184–1191. https://doi.org/10.1038/nprot.2009.97 (2009). (PMID: 10.1038/nprot.2009.97)
      Chen, Y., Lun, A. T. & Smyth, G. K. From reads to genes to pathways: Differential expression analysis of RNA-Seq experiments using R subread and the edge R quasi-likelihood pipeline. F1000Res 5, 1438 (2016).
      Hänzelmann, S., Castelo, R. & Guinney, J. GSVA: Gene set variation analysis for microarray and RNA-Seq data. BMC Bioinform. 14, 7. https://doi.org/10.1186/1471-2105-14-7 (2013). (PMID: 10.1186/1471-2105-14-7)
    • Grant Information:
      United Kingdom WT_ Wellcome Trust; R01 HL106804 United States HL NHLBI NIH HHS; FC0010218 United Kingdom MRC_ Medical Research Council
    • الرقم المعرف:
      0 (Hemoglobins)
      63231-63-0 (RNA)
      0 (RNA, Messenger)
    • الموضوع:
      Date Created: 20230201 Date Completed: 20230211 Latest Revision: 20240409
    • الموضوع:
      20250114
    • الرقم المعرف:
      PMC9892020
    • الرقم المعرف:
      10.1038/s41598-023-28218-7
    • الرقم المعرف:
      36725870