I'm pretty new to RNAseq, and I have been following this tutorial to get my data RNAseq data DESeq2 compatible (https://bioconductor.github.io/BiocWorkshops/rna-seq-data-analysis-with-deseq2.html). I get my sf file via Salmon. However, I'm getting this error once I try to call the tximport command:
txi command:
txi = tximport(files, type="salmon", tx2gene=tx2gene, ignoreTxVersion= TRUE, ignoreAfterBar=TRUE)
error message:
"Error in summarizeToGene(txi, tx2gene, varReduce, ignoreTxVersion, ignoreAfterBar, : None of the transcripts in the quantification files are present in the first column of tx2gene. Check to see that you are using the same annotation for both. Example IDs (file): [ebi, ...] Example IDs (tx2gene): [ENST00000456328.2, ENST00000450305.2, ENST00000473358.1, ...] This can sometimes (not always) be fixed using 'ignoreTxVersion' or 'ignoreAfterBar'."
I've looked at previously answered questions, and the most common suggestion was to put ignoreTxVersion=TRUE or ignoreAfterBar = TRUE. I have tried both, but I am still getting the same error. Not too sure what to do. Any suggestions?
You need to show the results from
head(tx2gene)
as well asreadr::read_tsv(files[1])
. Which may already point out to you what the problem is.Sorry about that!
head(tx2gene) shows:
and readr::read_tsv(files[1]) shows
Your transcript names in the file are not ENST...
sorry I'm a bit confused. How would I go about this?
Is there someone you can collaborate with, who is familiar with RNA-seq pipelines? Of course there are many tutorials online but it seems you are stuck at an early stage and you’d benefit from having someone looking over your shoulder.
everyone in my lab is currently out, so I'm trying to work around that, but thank you for taking your time out to respond! I appreciate it!