Question

[DESeq2] sequin [ANAQUIN] correction, too many factors in model

0

Entering edit mode

chaitra.sathyaprakash • 0

@chaitrasathyaprakash-20948

Last seen 3.9 years ago

Hello,

I have an RNA seq dataset obtained from mRNA ribosomal pulldown (TRAP) from a mutated and control cell line. I also have input samples for each of these. Due to the nature of TRAP, these samples are strongly enriched in mRNA, and the inputs are similar to bulk RNA sequencing, with a more complex library. Each sample was spiked with sequins (http://sequins.xyz) from mixture A or B, which I want to use to normalise my data. I have additional potential sources of batch effect including lane and differentiation day. How should I decide what to include in my final DESeq GLM?

What is the best way of checking what the greatest batch effect on this kind of data is? So far I have not been able to obtain a nice correlation of the observed LFC with expected LFC for the sequins. I have tried only using the TRAP samples in the DESeq object in case the samples are too different to analyse together, but this does not fix the problem either.

Many thanks, Chaitra

deseq2 ruvseq anaquin sequins • 913 views

ADD COMMENT • link updated 5.7 years ago by Michael Love 43k • written 5.7 years ago by chaitra.sathyaprakash • 0

score 0 · Answer 1 · 2019-06-19

0

Entering edit mode

Michael Love 43k

@mikelove

Last seen 1 day ago

United States

Going in depth into how to normalize here is beyond the software support I can provide at the moment. If you have spike ins, you can use these to normalize with controlGenes described in ?estimateSizeFactors. And I like to use MultiQC and PCA plots (see vignette and workflow) to examine batch effects.

ADD COMMENT • link 5.7 years ago Michael Love 43k