I had a question regarding the appropriate distribution for modeling scater-normalized UMI data. I normalized the UMI dataset (a SingleCellExperiment) to the ERCC spike-ins to capture total differences in RNA content. I then used the convertTo() function in scran to export to Monocle CellDataSet so I could perform some additional analysis there. As I understand it, scater returns counts divided by calculated size factors (not log-transformed). I wasn't sure if the convertTo() function specified a Monocle expressionFamily to use as the distribution (it looked like it was defaulting to negative binomial) - but should I be treating this data as a tobit()? Additionally, I assumed that once in Monocle I shouldn't be recalculating size factors but rather using the size factors calculated by scater?
Thanks!