Hi Bioconductor List and GUIDEseq maintainers,
Thanks for publishing and maintaining the GUIDEseq Bioconductor package. I’ve run into a few issues that I’ve put in bullet-point form below. Any guidance that you can offer would be greatly appreciated:
- The "on-target" sequence in the data I’m analyzing is not appearing as a hit
- This site has 100,000+ reads in the region and is the highest peak in the dataset but not appearing in the output data, even though it has no mismatches.
- When inspecting the BED outputs, I found that there were two flanking peaks around where the major peak was called but not the major peak.
- The IGV screenshot below shows what I mean with the two
GUIDEseq
called peaks in blue on the bottom track but no peak was called in between where there is a large pileup of reads (on-target site) - I've set
plus.strand.start.gt.minus.strand.end = FALSE
andkeepPeaksInBothStrandsOnly = FALSE
in attempts to recover this peak but was not successful - What would be next parameters to try tweaking to recover the expected on-target peak?
- While I'm testing a Cas9 dataset, I may be getting data from alternative nucleases. Is there support for alternative guide lengths, different PAM sequences and position weights?
- I noticed that within GUIDESeq there is a vector of weights coming from a different paper. If we're trying a nuclease that does not have these weights determined, what would be the recommendation as input?
Thanks in advance,
David