CRISPRSeek Results files column interpretation
1
1
Entering edit mode
xenon ▴ 40
@xenon-21520
Last seen 3.9 years ago

I am trying to find out how effective my guide RNAs are at targeting coding regions for a particular genome. I have the following questions from the output files that are generated post offTargetAnalysis() :-

Summary.xls

1- For a guide RNA what is the difference between top10OfftargetTotalScore and top5OfftargetTotalScore ? How shall I interpret them ?

  • Do I need to look at how off target score is calculated ?
  • Where does the N in topNOffTargetScore is used to calculate this score ?

2- What information do the columns like top1Hit.onTarget.MMdistance2PAM provide , for eg if it has value of NMM ?

  • What are the different values that can be expected here ?

3- Does the columns uniqREin200 and uniqREin100 tell about Name of unique restriction enzymes with cut sites 200bp and 100bp upstream or downstream around the guide RNA ?

OffTargetAnalysis.xls

4- What does the column score mean ? Does it mean that the gRNA with high score has more offTargets ? How is it related to gRNAefficacy column ?

  • What is range of values for the columns score and gRNAefficacy?
  • Is the gRNAefficacy in OffTargetAnalysis.xls same as in Summary.xls ?

5- What is isCanonicalPAM ? Is it telling whether the corresponding OffTargetSequence has a canonical PAM ?

CRISPRseek • 958 views
ADD COMMENT
3
Entering edit mode
Julie Zhu ★ 4.3k
@julie-zhu-3596
Last seen 14 months ago
United States

Thanks for the great questions!

Please see my response below your questions.

Best regards,

Julie

Question 1- For a guide RNA what is the difference between top10OfftargetTotalScore and top5OfftargetTotalScore ? How shall I interpret them ?

Do I need to look at how off target score is calculated ? Where does the N in topNOffTargetScore is used to calculate this score ?

Answer: top10OfftargetTotalScore means the sum of the scores from the top 5 offtargets and top10OfftargetTotalScore means the sum of the scores from top 10 offtargets. Please look at the OffTargetAnalysis.xls file for detailed information on individual offtargets

Please see the reference section in the help menu of offTargetAnalysis by typing help(offTargetAnalysis) in a R session.

Please also type help(offTargetAnalysis) in a R session for detailed parameter settings and interpretation. You can set the parameter topN.OfftargetTotalScore for calculating the topNOffTargetScore. It is the sum of the scores from the top N offtargets where N is set by topN.OfftargetTotalScore which specifies the top N offtargets used to calculate the topN.OfftargetTotalScore, default 10

In general, you would want to use gRNAs with high efficacy, low topNOfftargetTotalScore where N can be 5,10, or a number you feel comfortable, and a lower number of offtargets in offtargetAnalysis.xls.

Question 2- What information do the columns like top1Hit.onTarget.MMdistance2PAM provide , for eg if it has value of NMM ?

What are the different values that can be expected here ?

Answer: When there is no mismatch found in the top 1 hit, then it outputs NMM. If there is at least one perfect match in the genome, then it will output NMM. Otherwise, it outputs the distance from the mismatch base to the PAM sequence.

Question 3- Does the columns uniqREin200 and uniqREin100 tell about Name of unique restriction enzymes with cut sites 200bp and 100bp upstream or downstream around the guide RNA ?

Answer: Yes, correct. uniqREin200 and uniqREin100 columns contain REs unique to the cleavage site within 100 or 200 bases surrounding the gRNA sequence. You can turn off this by setting outputUniqueREs = FALSE.

Please type help(offTargetAnalysis) in a R session for detailed parameter settings and interpretation.

Question 4- What does the column score mean ? Does it mean that the gRNA with high score has more offTargets ? How is it related to gRNAefficacy column ?

Answer: It depends on which method you use to calculate the score.

You can set scoring.method to "Hsu-Zhang" or "CFDscore".

If you set scoring.method to "CFDscore", the value is from 0 to 1.

If you set scoring.method to "Hsu-Zhang", the value ranges from 0 to 100.

The higher the value, the higher the cutting frequency.

For on-targets, it should be 1 for CFDscore and 100 for Hsu-Zhang score, and it is close to 0 for off-targets with minimum cut.

Question: Does it mean that the gRNA with high score has more offTargets ?

Answer: No. The score is a relative cutting frequency of an offtarget site relative to the target site.

Question: How is it related to gRNAefficacy column ?

Answer: gRNAefficacy specifies how effective the gRNA is while score is the relative cutting frequency to the ontarget.

gRNAefficacy ranges from 0 to 1. 1 means 100% cutting efficacy while 0 means 0% cutting frequency.

gRNAefficacy refers the efficacy of a gRNA while score indicates the relative cutting frequency on an offtarget relative to that of the ontarget.

Question: Is the gRNAefficacy in OffTargetAnalysis.xls same as in Summary.xls ?

Answer: The gRNAefficacy in OffTargetAnalysis.xls for the target site is the same as that in the Summary.xls.

Question 5- What is isCanonicalPAM ? Is it telling whether the corresponding OffTargetSequence has a canonical PAM ?

Answer: Yes, you were right. Canonical PAM is the PAM sequence you specified in offTargetAnalysis. By default, PAM = "NGG". Please type help(offTargetAnalysis) in a R session for detailed parameter settings and interpretation.

ADD COMMENT

Login before adding your answer.

Traffic: 439 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6