DESeq2 for differential gene expression on GTEx dataset
1
0
Entering edit mode
vikram • 0
@vikram-14432
Last seen 7.2 years ago

Hi,

I'm new to the field, and I'm trying to do a differential gene expression on the GTex dataset. My aim is to identify sets of genes which (with some confidence) identify each of the 50 odd tissue types in the said dataset. The dataset is (bulk) RNA-seq ~50k genes and ~12k samples. The resource I have at hand has ~50 CPU, each with 12 cores and plenty of RAM.

I have

1) browsed through the DESeq2 vignettes and I feel it may be a good fit.

2) Removed housekeeping genes, in the hope that it makes the task of the software a little easier. 3) Put the code to run

I was wondering if

1) My choice of algorithm is advisable, and

2) anyone has an estimate of how much time it may take the code to run

I'd be glad to give more details, if you need it.

Thanks for reading through. :-)

rnaseq deseq2 gtex • 2.1k views
ADD COMMENT
1
Entering edit mode
@mikelove
Last seen 17 hours ago
United States

hi,

For 100s of samples per condition/group, and with a total of thousands of samples overall, for differential expression I personally tend to switch to faster linear models, like limma-voom. The GLM has to do a lot of computation to iteratively find the solution (a beta coefficient for each tissue). This is much faster with the linear model.

ADD COMMENT
0
Entering edit mode

Thanks a lot for the reply. I'll try limma-voom.

ADD REPLY

Login before adding your answer.

Traffic: 559 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6