DESeq: filter out low count reads
2
0
Entering edit mode
ccheung • 0
@ccheung-7248
Last seen 9.1 years ago
European Union

Hi Everybody,

I 44 samples and >76000 transcripts in my RNA Seq data which was counted using FeatureCounts.

I'd like to reduce the number of transcripts by filtering out the transcripts in which 80% of the samples (35 samples) have >10 reads.

Can anyone suggest how I may do so?

Thanx.

carol

deseq filter reads • 3.3k views
ADD COMMENT
2
Entering edit mode
@mikelove
Last seen 9 hours ago
United States

hi Carol,

In DESeq2, filtering of low count reads is handled automatically in the results() step. This uses software from the genefilter package on Bioconductor, to optimize the number of adjusted p-values less than a given value, say 0.1. We discuss the logic behind this here in the Independent Filtering section of the new paper:

http://genomebiology.com/2014/15/12/550

So you can either switch to DESeq2 (recommended) and this is taken care of for you, or you can use the genefilter package with DESeq.

Note: you should use gene counts, not transcript/isoform counts with DESeq or DESeq2. Search the support site for "DESeq transcript-level" for many discussion on why this is the case.

If you absolutely need to get the rows that satisfy the above criterion, it would be:

use = apply(counts(dds), 1, function(k) mean(k > 10)) > 0.8
ADD COMMENT
0
Entering edit mode
ccheung • 0
@ccheung-7248
Last seen 9.1 years ago
European Union

Thank you Michael!

ADD COMMENT

Login before adding your answer.

Traffic: 627 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6