I'm working on getting MEDIPS running for some collaborators who previously had been using MACS to analyze their methyl cap seq data.
I've run MEDIPS on a data set that they had previously run through MACS. I only got one solid hit (p<0.05 after correction for multiple comparisons) using MEDIPS, while MACS yielded ~200. The single positive hit from MEDIPS was replicated in MACS, and the MACS data were corrected for multiple comparisons using the same method (BH).
After looking at the data, it appears that the issue is that MEDIPS is wiping out all of the p-values when adjusting for multiple comparisons. I spot-checked some of the hits from MACS, and for every hit in MACS, I see an interval or two that have a significant (<0.05) p-value in MEDIPS that gets adjusted to 1 or so. I've tried a few methods to "fix" this: I've tried increasing the window size (idea: larger window = less intervals = less comparisons) and changing the correction method (tried fdr, as it isn't overly conservative in my experience). None of this changed the outcome (i.e. single hit), although the exact corrected p-value did fluctuate.
Can anyone with a better understanding of the MEDIPS and MACS algorithms explain these differences?
The minRowSum parameter appears to be working. I still haven't gotten it to the level where I'm getting the number of hits MACS was returning (I need to see what kind of parameters they were using for MACS as well), but adjusting it yielded immediate results.
I had built my scripts using a MEDIPS tutorial here (http://www.bioconductor.org/packages/2.12/bioc/vignettes/MEDIPS/inst/doc/MEDIPS.pdf). In this tutorial, the example they use involves setting the minRowSum variable to 1. This tutorial claims that this is the default value. Setting the variable to 10 (the default according to your post) resulted in several new hits. Is that tutorial wrong/out of date? Based on what I know now, advising a minRowSum value of 1 seems like bad advice.
I'll continue tweaking minRowSum and window size to values more appropriate to the data. Thanks for your help.