Question

Is it advisable to remove X and Y chromosome genes in mouse bulk RNA-seq data at the level of the count matrix?

0

Entering edit mode

charlesgwellem ▴ 10

@charlesgwellem-16293

Last seen 3.0 years ago

Germany

From this link remove X and Y chromosome genes in RNA-seq data using DESeq2 pipeline , I have learned that depending on context, it is perfectly valid to remove X and Y chromosomal genes in RNA-seq data before doing differential expression analysis. However I only have access to the count matrix and not the bam files of the RNA seq data I am analysing. Is it advisable to do the X and Y chromosome gene removal at the level of the count matrix? I am talking about removing all rows that contain X and Y chromosomal genes from the .csv count matrix file read in to R before proceeding to create the DESeq2 object.

Thank you in advance for your kind response.

deseq2 • 2.0k views

ADD COMMENT • link updated 6.0 years ago by Luiz ▴ 10 • written 6.0 years ago by charlesgwellem ▴ 10

score 2 · Answer 1 · 2019-04-09

2

Entering edit mode

Michael Love 43k

@mikelove

Last seen 15 hours ago

United States

Let me clarify that it's statistically valid. I don't know if it makes sense for your biologically question to do this.

If you have reason to remove X or Y genes, it is fine to do this on the count matrix, either the CSV or in R, before running DESeq().

ADD COMMENT • link 6.0 years ago Michael Love 43k

score 1 · Answer 2 · 2019-04-09

1

Entering edit mode

Luiz ▴ 10

@luiz-20465

Last seen 5.9 years ago

Brazil / São Paulo / UNIFESP

This is perfectly normal. If you want to 'optimize' your non coding RNA analysis for example, you can also remove your mRNA genes.

ADD COMMENT • link 6.0 years ago Luiz ▴ 10

2

Entering edit mode

I'm less comfortable with removing something like a majority or ~50% of genes, but as long as MA plots look reasonable, and it's not violating some known biology (e.g. all remaining features are DE), I can see this being statistically valid.

ADD REPLY • link 6.0 years ago Michael Love 43k