Hi,
I´m reading a fastqc file. I want to generate stats based on this file. I have checked the documentation and seen the report that you can generate for a list of files.
In this particular case i would like to generate my own reports for each fastqc file individually rather than a global quality report such as the one provided by shortread.
The first question is i´m not sure id the phred scores are correct. My input file is a illumina pair-end file (this is case an R1 file).
fq <- readFastq(paste(path,"R1:fastq.gz",sep=""))
# get quality score list as PhredQuality
qual <- PhredQuality(quality(quality(fq)))
myqual_mat <- matrix(charToRaw(as.character(unlist(qual))), nrow=length(qual), byrow=TRUE)
myqual_mat[1580:1590,170:175]
> myqual_mat[1580:1590,170:175]
[,1] [,2] [,3] [,4] [,5] [,6]
[1,] 47 47 47 47 47 47
[2,] 46 43 46 43 2a 3d
[3,] 47 47 47 47 37 3d
[4,] 44 47 47 47 46 43
[5,] 47 47 47 47 35 45
[6,] 47 43 35 46 35 43
[7,] 46 45 3f 45 2a 2a
[8,] 3c 2b 3a 43 35 43
[9,] 47 46 47 47 46 47
[10,] 43 2c 39 2c 32 31
[11,] 40 23 23 23 23 23
The first question is that i have strange scores such as 2a 3d ... in the table. I´m probaly doing something wrong here but not sure what ?
My second question is that i want to generate the quartile for each base position so i need to transform this matrix into numeric. The current format is raw. How can i do that ??
thanks ,
david