Error reading BAM file with duplicated header in easyRNAseq and GenomicAlignments
1
0
Entering edit mode
@michael-dondrup-8328
Last seen 3.7 years ago
Bergen, Norway

I am trying to re-count a bunch of bam files that I have previously opened without problems in easyRNAseq and GenomicAlignments. But now I get. The bam files are seemingly valid and can be accessed by samtools, but the header contains few duplicated entries.

reads = readGAlignments('test.fastq.gz.clipped_M.bam.sorted.bam')
Error in GAlignments(seqnames = bamcols$rname, pos = bamcols$pos, cigar = bamcols$cigar,  :
  'seqlengths' must be an integer vector with unique names
In addition: Warning message:
In `levels<-`(`*tmp*`, value = if (nl == nL) as.character(labels) else paste0(labels,  :
  duplicated levels in factors are deprecated

 

This is a related thread:

Error when reading bam file with duplicated header

software error easyrnaseq genomicalignments bam • 1.6k views
ADD COMMENT
2
Entering edit mode
@martin-morgan-1513
Last seen 4 months ago
United States

Sorry for the delay, but I believe this is now fixed in GenomicAlignments / Rsamtools 1.4.2 / 1.20.5 (release) and 1.5.18 / 1.21.19 (devel).

ADD COMMENT
0
Entering edit mode

Thank you for fixing it. I can confirm that I can read and count the bam files again using GenomicAlignments and easyRNAseq, feels good :D

ADD REPLY

Login before adding your answer.

Traffic: 745 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6