Hello!
I am currently working on a 10x dataset using the SingleCellExperiment package and have made a new SCE isolating the CD4 T helper cells for analysis. I did this by identifying and separating the clusters that highly expressed particular genes such as CD4/CD3 etc. However, now that I am clustering these cells I've noticed a small amount (a few 100) B cell expressing genes (MS4A1 and CD19) which I am assuming are B cells, on the end of one cluster of T cells (I have run a doublet finder so not sure how they got in the new SCE).
My first question is, is there a way to remove these cells in particular that are highly expressing a certain gene? I can easily remove the gene but not the cells that express it.
Secondly, was clustering the best way to isolate the CD4 expressing cells for further analysis?
Thank you so much for your help!
If you're still early on in the analysis, I would suggest switching to Seurat. It appears to offer more features that are frequently used, in a more user-friendly manner. Seurat works great with 10x data, and they have a pretty clear and easy to follow tutorial as well.
Question 1:
What you would like to do, to remove cells can be easily done in Seurat. As suggested here: https://satijalab.org/seurat/articles/essential_commands.html
There's a lot you can do with subset, e.g https://github.com/satijalab/seurat/issues/1782
Question 2:
Yes, if there's a particular cluster you're interested. In Seurat, just select the cluster and run Scaling + clustering again. Skip the Normalization step, which presumably you already did at the beginning of the analysis. https://github.com/satijalab/seurat/issues/6115