Hi,
I'm trying to calculate TPM for raw count bulk RNAseq data. Does anyone know how do lengths of transcripts are retrieved when using the function "calculateTPM"? Or if lengths should be provided, how do I calculate them? ref: https://www.rdocumentation.org/packages/scater/versions/1.0.4/topics/calculateTPM
Best
In the help file ?calculateTPM you'll see an argument
As for how to retrieve lengths? Use information from a gene annotation source - ensemble/biomart/TxDb etc.
Ideally, you would have a a transcript length per gene, per sample. Something like what RSEM outputs. But, if it's single-cell data, it won't be possible for most kinds of sequencing protocols that sequence only the 5' end or the 3' end.