####edit
For the Stack exchange post, that eleborates more
https://bioinformatics.stackexchange.com/questions/5269/rsvsim-insertions-from-chr1-to-chr2
####edit
I have a question and I tried a lot of options already, but not with the desired result.
So I want to simulate some insertions. Because I don't want my insertions to be a random sequence, I made a DNAseq that is named chr1 (which has 100.000bp of similar DNA seq) and chr2 (which is the seq I want my insertions in).
So I want to take a DNA seq from chr1 and put them in chr2 as an insert.
If I simulate SVs, they either copy/cut from chr2 to chr2 (which essentially makes a duplication or DEL+INS) or if you don't specify anything, it is random. Sometimes from chr1 -> chr2 and sometimes chr2 -> chr1. This is really annoying. Trying to set chrA ="chr1" and chrB="chr2" doesn't work. Also the examples pasted below do not give the desired result.
Using translocations also wont give me the effect that I want, because you cannot generate more translocations than genomes/chromosomes that you add.
**
However, when processing INS, I found out that it just copies a DNA seq from one part, and puts it somewhere else in the same DNA seq (Bascially a DUP then), or if you specify another an extra DNAseq, to cut from, It randomly cuts from either chr1 or chr2 and puts it in the other. This creates deletions in the DNA seq I want to investigate, which is not what I want.
Does anyone know how to get this kind of result?
width seq names [1] 40 AAAAAAAAAAAAAAAAAAAATTTTTTTTTTTTTTTTTTTT chr1 [2] 40 GGGGGGGGGGGGGGGGGGGGCCCCCCCCCCCCCCCCCCCC chr2
to this:
width seq names [1] 32 AAAAAAAAAAAAAAAATTTTTTTTTTTTTTTT chr1 [2] 48 GGGGGGGGGGGGGGAAAATTTTGGGGGGCCCCCCCCCCCCCCCCCCCC chr2
**
I have tried several things already over the past hours, I dont even have all the examples anymore. Could someone help with this? I need 30 INS on random places of chr2 of varying sizes.
#some things I tried, see stackexchange post for more examples seq_random <- readDNAStringSet("/path/random.fasta", "fasta") seq_ref <- readDNAStringSet("/path/reference1MB.fasta", "fasta") genome2 = DNAStringSet(c(seq_random, seq_ref)) names(genome2) = c("chr1","chr2") length_seq = width(genome2[2]) knownInsertion = GRanges(IRanges(0,length_seq), seqnames="chr2") knownInsertion = GRanges(IRanges(0,length_seq), seqnames="chr1", chrB="chr2", chrA="chr1") knownInsertion = GRanges(seqnames="chr1", chrB="chr2") sim = simulateSV(output='/folder/of/output/', genome=genome2, ins = 30, sizeIns=y, regionsIns=knownInsertion)
Can someone elaborate if there is something missing or not clear in my question? I really need the answer, because if it is not possible I have to rewrite a lot of stuff, including code.
Please, how am I able to contact BioConductor? The only way possible seems to be this forum and other posts seem to be answered within the hour.
You've tagged your question with 'RSVSim' to indicate that you have a problem with the RSVSim package. If the maintainer of the RSVSim package does not monitor this forum (all maintainers are required to subscribe, but...) or if they are simply as busy as you and have not had a chance to respond in the last 2 days, or if other users of RSVSim are not able to help you, or if there are not that many users of RSVSim monitoring this forum, then you will not receive a response quickly...
You could try to reach out directly to
maintainer("RSVSim").