hello all,
I am using featurecount for differential expression analysis. After running feature count I found out there are very less number of reads assigned successfully (33%). where as my SAM file (aligned by STAR) showing 82% mapped reads. I tried both counting by exon and gene feature. Featurecount code that I have used -
featureCounts -M -p -O -t exon -a /home/amol/work/Cuffcompare/cuffcmp.combined.gtf -o Bcyt.count3 /home/amol/work/STAR/Bcyt/Aligned.nsorted.bam
Summery of feature count -
Input files : 1 BAM file ||
|| P /home/amol/work/STAR/Bcyt/Aligned.nsorted.bam ||
|| ||
|| Output file : Bcyt.count3 ||
|| Annotations : /home/amol/work/Cuffcompare/cuffcmp.combined ... ||
|| ||
|| Threads : 1 ||
|| Level : meta-feature level ||
|| Paired-end : yes ||
|| Strand specific : no ||
|| Multimapping reads : counted ||
|| Multi-overlapping reads : counted ||
|| ||
|| Chimeric reads : counted ||
|| Both ends mapped : not required ||
|| ||
\\===================== http://subread.sourceforge.net/ ======================//
//================================= Running ==================================\\
|| ||
|| Load annotation file /home/amol/work/Cuffcompare/cuffcmp.combined.gtf ... ||
|| Features : 84237 ||
|| Meta-features : 16590 ||
|| Chromosomes : 4940 ||
|| ||
|| Process BAM file /home/amol/work/STAR/Bcyt/Aligned.nsorted.bam... ||
|| Paired-end reads are included. ||
|| Assign fragments (read pairs) to features... ||
|| Found reads that are not properly paired. ||
|| (missing mate or the mate is not the next read) ||
|| 2580055 reads have missing mates. ||
|| Input was converted to a format accepted by featureCounts. ||
|| Total fragments : 5314957 ||
|| Successfully assigned fragments : 1771088 (33.3%) ||
|| Running time : 2.54 minutes
can somebody please tell me whats going wrong?
thank you
amol
Perhaps I'm being pedantic, but this isn't really a Bioconductor question. It seems like you're using the standalone C version of
featureCounts
, rather than that inRsubread
. A more appropriate forum might be the Subread mailing list at https://groups.google.com/forum/#!forum/subread.Thanks Aaron for directing me!
But does this really matters because background is same, process is same. and you are understanding why it is happening. whether it is C version or R version how would it be different?
From what I understand, there could be some slight differences with versioning and/or default parameter settings between the
Rsubread
and standalone C implementations. So, it's worthwhile posting to the appropriate forum.