Hi,
I used DiffBind for a collaborator who noticed that, contrary to his expectations, a particular region (which overlaps highly significant (pvalue << 0.001) peaks from MACS2 in each replicate of condition Y) got a very low FDR when compared to condition X (in which there are no peaks from any of the 4 replicates). Could you please help me to figure out if there's anything wrong in my code? Unfortunately, I cannot share the real data (it's still unpublished), but here's the anonymised experimental setup and the commands I used to compare conditions Y vs X, as well as Z vs X:
> samples SampleID Factor Condition Replicate bamReads bamControl Peaks PeakCaller 1 condX_repA TF condX 1 A-TF.bam A-I.bam A-condX.bed bed 2 condX_repB TF condX 2 B-TF.bam B-I.bam B-condX.bed bed 3 condX_repC TF condX 3 C-TF.bam C-I.bam C-condX.bed bed 4 condX_repD TF condX 4 D-TF.bam D-I.bam D-condX.bed bed 5 condY_repE TF condY 1 E-TF.bam E-I.bam E-condY.bed bed 6 condY_repF TF condY 2 F-TF.bam F-I.bam F-condY.bed bed 7 condY_repG TF condY 3 G-TF.bam G-I.bam G-condY.bed bed 8 condY_repH TF condY 4 H-TF.bam H-I.bam H-condY.bed bed 9 condZ_repJ TF condZ 1 J-TF.bam J-I.bam J-condZ.bed bed 10 condZ_repK TF condZ 2 K-TF.bam K-I.bam J-condZ.bed bed 11 condZ_repL TF condZ 3 L-TF.bam L-I.bam L-condZ.bed bed 12 condZ_repM TF condZ 4 M-TF.bam M-I.bam M-condZ.bed bed > db_analysis = dba(sampleSheet="sampleInfo.csv", minOverlap=4, bCorPlot=FALSE) > db_analysis = dba.count(db_analysis, minOverlap=4, bCorPlot=FALSE) > db_analysis = dba.contrast(db_analysis, minMembers=4, categories=DBA_CONDITION) > db_analysis = dba.analyze(db_analysis, bCorPlot=FALSE) > dba.report(db_analysis, contrast=1, th=1, file="sites_Y_vs_X", ext="txt") > dba.report(db_analysis, contrast=2, th=1, file="sites_Z_vs_X", ext="txt")
Thanks a lot for your help!
Silvia