Hi Everyone!!
I recently started with Single Cell preprocessing step and I have cells in two batches. Day 16 (control vs Knockout) and Day 40 (control vs Knockout). I plan to use seurat
after preprocessing.
My queries are as follows:
Should I separately apply dropletutils on each control case like
p16C_sce <- read10xCounts(p16C, col.names=TRUE) #dim: 9250 6794880 p16K_sce <- read10xCounts(p16K, col.names=TRUE) #dim: 9250 6794880 p40C_sce <- read10xCounts(p40C, col.names=TRUE) #dim: 9250 6794880 p40K_sce <- read10xCounts(p40K, col.names=TRUE) #dim: 9250 6794880
or should I combine like it has been done here? This is because, when I apply Dropletutils separately, I get
sce
object with different dimensions and I am not sure if that can be ingested by Seurat.One of my Batch showed ~10% of barcodes with high mitochondrial gene expression, higher than the threshold both in Control and Knockout. Should I retain those cells too because we see similar pattern in both control and knockout?
Yes, I was referring to the usage of the
emptyDrops()
function on the combined sce object where we read both batches together (control-case) usingread10xCounts()
. We observed that that rescued nearly ~3000 more cells in total (control+case) than when they were preprocessed separately. But now I know that's not the right way. Thanks for your prompt response sire!!