Question

process bed/bedGraph files with the same coordinates for all the samples

0

Entering edit mode

Kath • 0

@kath-18711

Last seen 5.1 years ago

I have large number of ChIPSeq samples and would like to generate bed/bedGraph files all with the same genomic coordinates. would any one help to provide any advice on this? Thank you in advance!

ChIPSeq • 1.4k views

ADD COMMENT • link 5.5 years ago Kath • 0

0

Entering edit mode

You need to more clearly explain what you want. If I were to interpret your question, I would think you want like a gazillion bed files, all with the same thing in each, which obviously makes no sense.

ADD REPLY • link 5.5 years ago James W. MacDonald 67k

0

Entering edit mode

James, thank you and this is the first time I get into this question and was wondering as well. I am using ChIPSeqSpike package and inputsubtraction step gave the errors, I was told to provide all the bed/bedGraph files in the same coordinates. Not sure how to proceed with it so I posted here for more advice.

Meantime, do you have any advice on normalization of ChIPSeq data with exogenous genome DNA spike in? Thank you in advance!

ADD REPLY • link 5.5 years ago Kath • 0

0

Entering edit mode

OK. It looks like you have contacted the maintainer of that package with a question, and he told you to do something? If so, you might get clarification from him, rather than here, as we have no idea what he is asking for, but he (one would assume) does. I would also assume that he could provide you with some code to help you get what he wants?

I don't have any advice for you concerning normalization. This site is primarily intended as a place that people can ask technical questions about how to get the software to do what they want it to do, with the implicit assumption that the person asking has the required statistical knowledge, but may not have the required technical knowledge for a given package.

What you are asking is a statistical question, which is almost always more difficult than a technical question, and would require me (or anybody else) to know more about your experiment, the goals, etc. This isn't a straightforward question, and there is usually not one 'right' answer. As an example, the csaw package has a workflow that goes through all the things you could do to normalize your data, but leaves it up to you to decide what's best.

As the analyst you have taken on the responsibility of knowing all the ways you could normalize your data, choosing one, and being able to defend your choice. If you can't do all three of those things, you should probably get some local help. Analyzing ChIP-Seq data is a non-trivial task

ADD REPLY • link 5.5 years ago James W. MacDonald 67k

0

Entering edit mode

Thanks for your comments

ADD REPLY • link 5.5 years ago Kath • 0

0

Entering edit mode

In addition to James' comments, the csaw user's guide also has some comments about spike-in normalization (Section 4.5 in https://bioconductor.org/packages/devel/workflows/html/csawUsersGuide.html ).

ADD REPLY • link updated 5.5 years ago by James W. MacDonald 67k • written 5.5 years ago by Aaron Lun ★ 28k