Hi,
I have 2 replicates each of 4 different samples, so 8 samples total (P300 chip-seq).
All are from the same cell type, but with different transcription factors knocked out, and also WT.
I want to get "modules" of peaks gained and/or lost in each sample (enhancers created or repressed by each TF).
How might I set this up in DiffBind?
Right now I have made my samples object like this:
8 Samples, 30757 sites in matrix:
ID Tissue Factor Condition Treatment Replicate Caller Intervals FRiP
1 WT.1 Th2 P300 WT 5d 1 counts 30757 0.18
2 WT.2 Th2 P300 WT 5d 2 counts 30757 0.14
3 Gata3_cKO.1 Th2 P300 Gata3_KO 5d 1 counts 30757 0.19
4 Gata3_cKO.2 Th2 P300 Gata3_KO 5d 2 counts 30757 0.13
5 Stat6_KO.1 Th2 P300 Stat6_KO 5d 1 counts 30757 0.17
6 Stat6_KO.2 Th2 P300 Stat6_KO 5d 2 counts 30757 0.15
7 Gata3_Stat6_dKO.1 Th2 P300 Gata3_Stat6_KO 5d 1 counts 30757 0.14
8 Gata3_Stat6_dKO.2 Th2 P300 Gata3_Stat6_KO 5d 2 counts 30757 0.12
Many thanks for any help
Dear Rory,
I am a beginner to bioinformatics and DiffBind. Firstly, thank you for this very useful tool and the handy guides! I have some additional questions regarding using DiffBind on more than 2 conditions.
Currently I have 4 different conditions (representing different "subtypes" of a disease condition) and would like to identify differential peaks among all 4 different conditions (ideally identify peaks that are specific to each subtype).
I have set up all possible contrasts (i.e. 6 different contrasts). However from what I know, DiffBind only allows identification of differential regions between 2 conditions. As a result, for heatmap visualization of differential peak regions , at any one point I can only visualize one contrast that clusters the 2 conditions in that contrast.
I am not sure if it is possible to use DiffBind to obtain a heatmap plot that can ideally cluster the 4 different conditions according to differential peak regions across all 4 conditions?
Please correct me if I have any misunderstanding.
Thank you!
Best regards,
Shamaine
This is a new question, not a comment on the original quesiton, and should be logged as such.
Given a set of DB sites from a contrast, you can see how all the samples cluster, whether or not they are in the contrast, by setting the
mask
parameter to egdba.plotHeatmap()
anddba.plotPCA()
. So if you have 4 conditions and 3 replicates of each, you can setmask=1:12
and all the samples will be plotted using the DB sites from the specified contrast.