i have raw read counts of miRNA gene of lung cancer downloaded from GEO. i want to analyze and want to find which genes are differentailly expressed genes among my 90 samples using DESeq2 package. i read about DESeq2 package from bioconductor tutorial and gone through the workflow. but, i didn't understand from where to start with my raw read counts? (also calculated p values)
did you have the normal and and disease(e.g, cancer) dataset or not?
i hve only single file, which has this type of data
To be honest, I suspect that you might be unclear on what "differentially expressed" means.
A gene is called "differentially expressed", if it's expression strength in one type or group of samples is higher than in another type or group. So, you need at least two different groups of samples, so that you can compare one with the other.
Yeah, like @Simon said, you must have at least two types, normally, it includes normal samples and disease samples.
thank you @Simon @ huynguyen96.dnu now i havetwo file, normal file (contain 96 samples), and disease file( samples 96).