about the quality score
1
0
Entering edit mode
wang peter ★ 2.0k
@wang-peter-4647
Last seen 10.2 years ago
i have data, like this @HWI-ST397_0000:5:1101:1110:2156#NTNTCG/1 CAAGGAGCAGAGTGAGGTCTCTCCGCAGAGCGCCAAGGCAAAGCAGATGGTGGAGCTGCTCTACGAGACG GCGCACATTGCGTCTGGATTCCCGCCTG +HWI-ST397_0000:5:1101:1110:2156#NTNTCG/1 gdgggegc_gffdeeggdgfggggdgbfadaeeeeefaefgcebebdffebbbbcg`gdgfgeeebgbdd cbccbfdffcdcadbW_]]`]dd_bb\^ reads <- readFastq(fastqfile); seqs <- sread(reads) it seems phred+64 system can i use the function: qual <- PhredQuality(quality(quality(reads))) i think can only read phred+32 system -- shan gao Room 231(Dr.Fei lab) Boyce Thompson Institute Cornell University Tower Road, Ithaca, NY 14853-1801 Office phone: 1-607-254-1267(day) Official email:sg839 at cornell.edu Facebook:http://www.facebook.com/profile.php?id=100001986532253
• 1.5k views
ADD COMMENT
0
Entering edit mode
@martin-morgan-1513
Last seen 4 months ago
United States
On 01/11/2012 09:11 AM, wang peter wrote: > i have data, like this > @HWI-ST397_0000:5:1101:1110:2156#NTNTCG/1 > CAAGGAGCAGAGTGAGGTCTCTCCGCAGAGCGCCAAGGCAAAGCAGATGGTGGAGCTGCTCTACGAGA CGGCGCACATTGCGTCTGGATTCCCGCCTG > +HWI-ST397_0000:5:1101:1110:2156#NTNTCG/1 > gdgggegc_gffdeeggdgfggggdgbfadaeeeeefaefgcebebdffebbbbcg`gdgfgeeebgb ddcbccbfdffcdcadbW_]]`]dd_bb\^ > reads<- readFastq(fastqfile); > seqs<- sread(reads) > it seems phred+64 system > > can i use the function: > qual<- PhredQuality(quality(quality(reads))) > i think can only read phred+32 system > From ?readFastq, documenting the ... argument ...: Additional arguments. In particular, 'qualityType' and 'filter': qualityType: Representation to be used for quality scores, must be one of 'Auto' (choose Phred-like if any character is ASCII-encoded as less than 59) 'FastqQuality' (Phred-like encoding), 'SFastqQuality' (Illumina encoding). see also ?SolexaQuality -- Computational Biology Fred Hutchinson Cancer Research Center 1100 Fairview Ave. N. PO Box 19024 Seattle, WA 98109 Location: M1-B861 Telephone: 206 667-2793
ADD COMMENT
0
Entering edit mode
dear martin? the Illumina1.3+(Phred+64) is not Solexa score, YOU CAN SEE : Score Offset phred ASCII Sanger 33 0?93 33?126 Solexa 64 -5?62 59?126 Illumina1.3+ 64 0?62 64?126 if i use solexa function to deal with Illumina1.3+, is it compatible? -- shan gao Room 231(Dr.Fei lab) Boyce Thompson Institute Cornell University Tower Road, Ithaca, NY 14853-1801 Office phone: 1-607-254-1267(day) Official email:sg839 at cornell.edu Facebook:http://www.facebook.com/profile.php?id=100001986532253
ADD REPLY
0
Entering edit mode
On 01/11/2012 01:05 PM, wang peter wrote: > dear martin? > the Illumina1.3+(Phred+64) is not Solexa score, > > YOU CAN SEE : > > > Score Offset phred ASCII > > Sanger 33 0?93 33?126 > Solexa 64 -5?62 59?126 > Illumina1.3+ 64 0?62 64?126 > > > if i use solexa function to deal with Illumina1.3+, is it compatible? In ShortRead, FastqQuality and SFastqQuality determine the _encoding_; SFastqQuality is appropriate for Solexa and Illumina1.3+. Functions in ShortRead, e.g., alphabetScore() or as(quality(), "matrix") operate on the integer value of the corresponding letter. ShortRead does not (unless I am missing some code) translate the encoding into probabilities. Biostrings PhredQuality and SolexaQuality also represent encoding, but allow coercion to numeric, as(<...>, "numeric"). These coercions use -10 log10 (p) for PhredQuality, -10 log10(p / 1-p) for SolexaQuality. The latter is not appropriate for Illumina1.3+ (although the differences are most pronounced when p is large, i.e., when reads have low quality anyway). I will add an additional class IlluminaQuality, to Biostrings in the 'devel' branch. Martin -- Computational Biology Fred Hutchinson Cancer Research Center 1100 Fairview Ave. N. PO Box 19024 Seattle, WA 98109 Location: M1-B861 Telephone: 206 667-2793
ADD REPLY
0
Entering edit mode
dear martin? thank you very much for your contribution. i think better keep 3 functions PhredQuality,SolexaQuality and IlluminaQuality and remove FastqQuality so many functions make people confused, try to make them simplers and uniform thank you -- shan gao Room 231(Dr.Fei lab) Boyce Thompson Institute Cornell University Tower Road, Ithaca, NY 14853-1801 Office phone: 1-607-254-1267(day) Official email:sg839 at cornell.edu Facebook:http://www.facebook.com/profile.php?id=100001986532253
ADD REPLY

Login before adding your answer.

Traffic: 512 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6