Hi, I am trying to import salmon quant data to isotypeswitchanalyzeR for transcript variant use analysis. But I am seeing this error-
aSwitchList <- importRdata(
isoformCountMatrix = txi.tx$counts,
isoformRepExpression = txi.tx$abundance,
designMatrix = mycols,
isoformExonAnnoation = "hg19_knowngene.gtf",
showProgress = FALSE
)
Step 1 of 6: Checking data... Using row.names as 'isoformid' for 'isoformCountMatrix'. If not suitable you must add them manually. Using row.names as 'isoformid' for 'isoformRepExpression'. If not suitable you must add them manually. Step 2 of 6: Obtaining annotation... importing GTF (this may take a while) converting annotated CDSs
Error in importRdata(isoformCountMatrix = txi.tx$counts, isoformRepExpression = txi.tx$abundance, :
The annotation and quantification (count/abundance matrix and isoform annotation) seems to be different (jacard similarity < 0.95).
Either isforoms found in the annotation are not quantifed or vise versa.
Specifically:
78631 isoforms were quantified.
82960 isoforms are annotated.
Only 78631 overlap.
This combination cannot be analyzed since it will cause discrepencies between quantification and annotation thereby skewing all analysis.
Please make sure they belong together and try again. For more info see the FAQ in the vignette.
The GTF file I used during salmon quantification contains 82860 transcripts whereas my quant files contain 78631 rows.
Any help would be appreciated.
Which version of IsoformSwitchAnalyzeR are you using?
Cheers Kristoffer
I have loaded 1.4.0. on rstudio.
Could you try updating to 1.5.6 (from the devel branch) and see if the problem persists? You can find the installation instructions here. Remember to restart your R session after the update.
Cheers Kristoffer
yeah, sure. I will try it.
I tried with 1.5.6 and got the same error.