I am using RNA-seq data processed by RSEM given by someone else. The data has this format:
sample1 sample2 sample3 sample4 sample5 sample6
A1BG 94.64 278.03 94.07 96.00 55.00 64.03
A1BG-AS1 64.28 114.08 98.88 109.05 95.32 73.11
A1CF 0.00 2.00 1.00 1.00 0.00 1.00
A2M 24.00 2.00 18.00 4.00 35.00 8.00
A2M-AS1 1.00 1.00 1.00 0.00 0.00 0.00
A2ML1 0.84 2.89 0.00 1.00 2.00 3.11
But seems like DESeq2 can't handle integers. I tried tximport but hasn't been able to implement it for it to work, also I don't understand how/whether tximport would help in this situation. Can I just use round(data) instead?
Anyone has better suggestion to deal with this kind of data?
Thanks so much!
The DESeq2 manuals says: "As input, the DESeq2 package expects count data as obtained", which means just handles whole numbers. I doubt that "round" is a good approach here, because you change the counts. Where does the e.g. 0.64 count from Sample1 A1BG come from?
I think it comes from the way RSEM assign their counts to certain genes since RSEM estimates abundances for transcript/genes.