I am having a query related to VOOM function in the limma package. I am trying to analyze proteomics data using limma to identify the differential expression between two groups which has control(n=3and tumor, n=6). There is only biological replicates and not any technical ones. I have a doubt that the matrix design is correct or not . Also limma can be used for the analysis proteomics data obtained from masspectrometry platform .
design <- model.matrix(~ -1 + > factor(c(1,1,1,1,1,1,2,2,2)))
design <- model.matrix(~-1 + factor(rep(1:2, 1=6,2=3)))
colnames(design) <- c("tumor","control") contrast <- makeContrasts(tumor - control, levels = design)
voom(counts, design = NULL, lib.size = NULL, normalize.method = "none", span = 0.5, plot = FALSE, save.plot = FALSE)
I agree Gordon's comment: use limma but not voom because spectrometer are not counting values but measuring the are of a peak. Imputation of missing is a major point. Missing values are not at random but mainly occur because the intensity of the peak is too low to be captured by the spectrometer. Moreover, intensities are usually reported with high multiplicative coefficient: an intensity of 1e5 is frequently one of the lowest values. I prefer dividing intensities by 1e5 or 1e6 before imputing values using some random values around zero.
The MS data was generated in the orbitrap platform of Thermo Fisher Scientific and using proteome discoverer 2.2 the initial data processing was done. In, PD(2.2), there is an option for missing value imputation. The details are listed below. I have a doubt that whether we should consider software-based missing value imputation or the way you have suggested
•PD software missing value imputation- Low Abundance Resampling: Replaces missing values with random values sampled between the minimum and the lower 5 percent of all detected values.