Access the full text.
Sign up today, get DeepDyve free for 14 days.
Jun Li, R. Tibshirani (2013)
Finding consistent patterns: A nonparametric approach for identifying differential expression in RNA-Seq dataStatistical Methods in Medical Research, 22
Jong Kim, J. Marioni (2013)
Inferring the kinetics of stochastic gene expression from single-cell RNA-sequencing dataGenome Biology, 14
Xiaohong Li, N. Cooper, T. O’Toole, E. Rouchka (2020)
Choice of library size normalization and statistical methods for differential gene expression analysis in balanced two-group comparisons for RNA-seq studiesBMC Genomics, 21
Chen Jia, L. Wang, G. Yin, Michael Zhang (2019)
Single-cell stochastic gene expression kinetics with coupled positive-plus-negative feedback.Physical review. E, 100 5-1
M. Robinson, Davis McCarthy, G. Smyth (2009)
edgeR: a Bioconductor package for differential expression analysis of digital gene expression dataBioinformatics, 26
C. Soneson, M. Delorenzi (2013)
A comparison of methods for differential expression analysis of RNA-seq dataBMC Bioinformatics, 14
Lei Wei, Jianmin Wang, E. Lampert, S. Schlanger, Adam DePriest, Q. Hu, E. Gomez, Mitsuko Murakam, S. Glenn, J. Conroy, C. Morrison, G. Azabdaftari, J. Mohler, Song Liu, H. Heemers (2017)
Intratumoral and Intertumoral Genomic Heterogeneity of Multifocal Localized Prostate Cancer Impacts Molecular Classifications and Genomic Prognosticators.European urology, 71 2
R. Stark, Marta Grzelak, J. Hadfield (2019)
RNA sequencing: the teenage yearsNature Reviews Genetics, 20
M. Dillies, A. Rau, J. Aubert, C. Hennequet-Antier, M. Jeanmougin, N. Servant, C. Keime, G. Marot, D. Castel, J. Estellé, G. Guernec, B. Jagla, L. Jouneau, D. Laloë, C. Gall, B. Schaeffer, S. Crom, M. Guedj, F. Jaffrézic (2013)
A comprehensive evaluation of normalization methods for Illumina high-throughput RNA sequencing data analysisBriefings in bioinformatics, 14 6
R. Burrell, C. Swanton (2014)
Tumour heterogeneity and the evolution of polyclonal drug resistanceMolecular Oncology, 8
E. Maza (2016)
In Papyro Comparison of TMM (edgeR), RLE (DESeq2), and MRN Normalization Methods for a Simple Two-Conditions-Without-Replicates RNA-Seq Experimental DesignFrontiers in Genetics, 7
P. Moulos, P. Hatzis (2014)
Systematic integration of RNA-Seq statistical algorithms for accurate detection of differential gene expression patternsNucleic Acids Research, 43
Fatemeh Seyednasrollah, A. Laiho, L. Elo (2013)
Comparison of software packages for detecting differential expression in RNA-seq studiesBriefings in Bioinformatics, 16
A. Marusyk, K. Polyak (2010)
Tumor heterogeneity: causes and consequences.Biochimica et biophysica acta, 1805 1
I. Dagogo-Jack, A. Shaw (2018)
Tumour heterogeneity and resistance to cancer therapiesNature Reviews Clinical Oncology, 15
T. Ching, Sijia Huang, L. Garmire (2014)
Power analysis and sample size estimation for RNA-Seq differential expressionRNA, 20
John Quackenbush (2002)
Microarray data normalization and transformationNature Genetics, 32 Suppl 2
Marek Gierlinski, Christian Cole, Pietá Schofield, N. Schurch, A. Sherstnev, Vijender Singh, Nicola Wrobel, Karim Gharbi, Gordon Simpson, Tom Owen-Hughes, Mark Blaxter, Geoffrey Barton (2015)
Statistical models for RNA-seq data derived from a two-condition 48-replicate experimentBioinformatics, 31
Fatih Ozsolak, P. Milos (2011)
RNA sequencing: advances, challenges and opportunitiesNature Reviews Genetics, 12
Zhong Wang, M. Gerstein, M. Snyder (2009)
RNA-Seq: a revolutionary tool for transcriptomicsNature Reviews Genetics, 10
N. Schurch, Pietá Schofield, M. Gierliński, C. Cole, A. Sherstnev, Vijender Singh, N. Wrobel, K. Gharbi, G. Simpson, T. Owen-Hughes, M. Blaxter, G. Barton (2015)
How many biological replicates are needed in an RNA-seq experiment and which differential expression tool should you use?RNA, 22
Zong-Hong Zhang, D. Jhaveri, Vikki Marshall, Denis Bauer, J. Edson, Ramesh Narayanan, GregoryJ Robinson, Andreas Lundberg, P. Bartlett, N. Wray, Qiongyi Zhao (2014)
A Comparative Study of Techniques for Differential Expression Analysis on RNA-Seq DataPLoS ONE, 9
A. Conesa, Pedro Madrigal, Sonia Tarazona, D. Gómez-Cabrero, A. Cervera, A. McPherson, M. Szcześniak, Daniel Gaffney, L. Elo, Xuegong Zhang, A. Mortazavi (2016)
A survey of best practices for RNA-seq data analysisGenome Biology, 17
Franck Rapaport, R. Khanin, Yupu Liang, Mono Pirun, A. Krek, P. Zumbo, C. Mason, N. Socci, D. Betel (2013)
Comprehensive evaluation of differential gene expression analysis methods for RNA-seq dataGenome Biology, 14
J. Sampson, S. Boca, S. Moore, R. Heller (2018)
FWER and FDR control when testing multiple mediatorsBioinformatics, 34
K. Hansen, Zhijin Wu, R. Irizarry, J. Leek (2011)
Sequencing technology does not eliminate biological variabilityNature biotechnology, 29
Sahar Seesi, Y. Tiagueu, A. Zelikovsky, I. Măndoiu (2014)
Bootstrap-based differential gene expression analysis for RNA-Seq data with and without replicatesBMC Genomics, 15
L. Maretty, J. Sibbesen, Anders Krogh (2014)
METHOD Open Access
A. Esteve-Codina, O. Arpí, M. Martínez-García, E. Pineda, M. Mallo, M. Gut, C. Carrato, Anna Rovira, R. Lopez, A. Tortosa, Marc Dabad, S. Barco, S. Heath, S. Bagué, T. Ribalta, F. Alameda, N. Iglesia, C. Balañà (2017)
A Comparison of RNA-Seq Results from Paired Formalin-Fixed Paraffin-Embedded and Fresh-Frozen Glioblastoma Tissue SamplesPLoS ONE, 12
Yuwen Liu, Jie Zhou, K. White (2014)
RNA-seq differential expression studies: more sequence or more replication?Bioinformatics, 30 3
M.D. Robinson, A. Oshlack (2010)
A scaling normalization method for differential expression analysis of RNA‐seq data, 11
L. Yin, L. Xiao, Yong Gao, Guangfu Wang, Hao Gao, Yunpeng Peng, Xiaole Zhu, Jishu Wei, Y. Miao, K. Jiang, Zipeng Lu (2020)
Comparative bioinformatical analysis of pancreatic head cancer and pancreatic body/tail cancerMedical Oncology, 37
K. Tomczak, Patrycja Czerwińska, M. Wiznerowicz (2015)
The Cancer Genome Atlas (TCGA): an immeasurable source of knowledgeContemporary Oncology, 19
M. Love, Wolfgang Huber, Simon Anders (2014)
Moderated estimation of fold change and dispersion for RNA-seq data with DESeq2Genome Biology, 15
Sophie Lamarre, P. Frasse, M. Zouine, Delphine Labourdette, Elise Sainderichin, Guojian Hu, V. Berre-Anton, M. Bouzayen, E. Maza (2018)
Optimization of an RNA-Seq Differential Gene Expression Analysis Depending on Biological Replicate Number and Library SizeFrontiers in Plant Science, 9
Yanzhu Lin, Kseniya Golovnina, Zhen-Xia Chen, Hang Lee, Y. Negron, Hina Sultana, Brian Oliver, Susan Harbison (2016)
Comparison of normalization and differential expression analyses using RNA-Seq data from 726 individual Drosophila melanogasterBMC Genomics, 17
A. Poplawski, H. Binder (2017)
Feasibility of sample size calculation for RNA‐seq studiesBriefings in Bioinformatics, 19
José Robles, Sumaira Qureshi, S. Stephen, Susan Wilson, C. Burden, Jennifer Taylor (2012)
Efficient experimental design and analysis strategies for the detection of differential expression using RNA-SequencingBMC Genomics, 13
Bingqing Lin, Zhen Pang (2019)
Stability of methods for differential expression analysis of RNA-seq dataBMC Genomics, 20
Jun Li, D. Witten, I. Johnstone, R. Tibshirani (2012)
Normalization, testing, and false discovery rate estimation for RNA-sequencing data.Biostatistics, 13 3
Weitong Cui, Huaru Xue, Lei Wei, Jinghua Jin, Xuewen Tian, Qinglu Wang (2021)
High heterogeneity undermines generalization of differential expression results in RNA-Seq analysisHuman Genomics, 15
L. Kwong, M. Macedo, L. Haydu, Aron Joon, M. Tetzlaff, T. Calderone, Chiang-Jun Wu, M. Kwong, J. Roszik, K. Hess, M. Davies, A. Lazar, J. Gershenwald (2018)
Biological Validation of RNA Sequencing Data from Formalin-Fixed Paraffin-Embedded Primary Melanomas.JCO precision oncology, 2018
Danni Yu, W. Huber, O. Vitek (2013)
Shrinkage estimation of dispersion in Negative Binomial models for RNA-seq experiments with small sample sizeBioinformatics, 29
Great efforts have been made on the algorithms that deal with RNA‐seq data to enhance the accuracy and efficiency of differential expression (DE) analysis. However, no consensus has been reached on the proper threshold values of fold change and adjusted p‐value for filtering differentially expressed genes (DEGs). It is generally believed that the more stringent the filtering threshold, the more reliable the result of a DE analysis. Nevertheless, by analyzing the impact of both adjusted p‐value and fold change thresholds on DE analyses, with RNA‐seq data obtained for three different cancer types from the Cancer Genome Atlas (TCGA) database, we found that, for a given sample size, the reproducibility of DE results became poorer when more stringent thresholds were applied. No matter which threshold level was applied, the overlap rates of DEGs were generally lower for small sample sizes than for large sample sizes. The raw read count analysis demonstrated that the transcript expression of the same gene in different samples, whether in tumor groups or in normal groups, showed high variations, which resulted in a drastic fluctuation in fold change values and adjustedp‐values when different sets of samples were used. Overall, more stringent thresholds did not yield more reliable DEGs due to high variations in transcript expression; the reliability of DEGs obtained with small sample sizes was more susceptible to these variations. Therefore, less stringent thresholds are recommended for screening DEGs. Moreover, large sample sizes should be considered in RNA‐seq experimental designs to reduce the interfering effect of variations in transcript expression on DEG identification.
Annals of Human Genetics – Wiley
Published: Nov 1, 2021
Keywords: Differential expression; false discovery rate; fold change; RNA‐seq; sample size; threshold
Read and print from thousands of top scholarly journals.
Already have an account? Log in
Bookmark this article. You can see your Bookmarks on your DeepDyve Library.
To save an article, log in first, or sign up for a DeepDyve account if you don’t already have one.
Copy and paste the desired citation format or use the link below to download a file formatted for EndNote
Access the full text.
Sign up today, get DeepDyve free for 14 days.
All DeepDyve websites use cookies to improve your online experience. They were placed on your computer when you launched this website. You can change your cookie settings through your browser.