print matrix from EdgeR that are normalized by cpm and by spike-in
1
0
Entering edit mode
R ▴ 40
@r-5604
Last seen 3.7 years ago
Germany

 

 

I have a small RNA data set and I want to create a matrix that is normalized based on cpm and also normalized onto my spike-in control expression. What is the best way to do this?


d.RNA <- edgeR::DGEList(counts = round(counts), group=group)
d.Spike <- edgeR::DGEList(counts = s)

 

edgeR • 2.2k views
ADD COMMENT
0
Entering edit mode

On a side note, you no longer have to round your counts to the nearest whole number to use edgeR, but you should still ensure that they are on a raw count scale, even if they represent estimated or split counts.

ADD REPLY
2
Entering edit mode
@james-w-macdonald-5106
Last seen 4 days ago
United States

You can't do both. Algebraically things will just cancel out. So if you normalize by cpm and then normalize by housekeeping genes, the library sizes cancel out (because the housekeeping genes are normalized by cpm as well), and it is the same as if you just normalized by the housekeeping genes.

In my experience (here 'experience' means working with NanoString data, using CodeSets that varied from ~200 to ~800 genes), once you get past about 400-500 genes, normalizing to library size is a better way to go (where 'better' is defined by lower variance of technical replicates).

 

ADD COMMENT

Login before adding your answer.

Traffic: 515 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6