Note sure where else to ask this, but I have been using tximport
for importing Salmon quantifications against custom transcriptomes into R.
I don't always use salmon mapping mode for quantification, so by default I run compute_fasta_digest
as was recommended to get the txomes Seq and Name hashes for when I generate a TxDb, but upon further inspection this does not seem to always work. Either the SeqHash or the NameHash from FastaDigest seems different than the salmon index, meaning my prebuilt linkedTxome and TxDb don't always work.
Does anybody know how to reliably compute the salmon hash values without fully indexing?
Salmon v1.10.0
fasta_digest v0.1.2
tximeta v1.16.0
R v4.2.3
I use the
fastaDigest
tool from COMBINE-lab/fastaDigest, which has the functioncompute_fasta_digest
, to make what I thought were the transcriptome hash values that salmon generates when indexing.I will do some more testing to see in what scenarios the mismatches occur, but I have at least one transcriptome where the
fastaDigest
output has a differentseq_hash
value, and at least one scenario where thefastaDigest
output has a differentname_hash
value.example:
result from
compute_fasta_digest
:info.json
in salmon index dir of same transcriptome fasta: