Hi, I was trying to do a differential expression expression analysis using TCGAbiolinks. The tumour data was queried and downloaded using GDCquery and GDCdownload. The GDCprepare command gave the following error.


query.colon.cancer <- GDCquery(project = "TCGA-COAD", legacy = TRUE, data.category = "Gene expression", data.type = "Gene expression quantification", experimental.strategy = "RNA-Seq", sample.type = "Primary solid Tumor", file.type = "normalized_results")

GDCdownload(query.colon.cancer, files.per.chunk = 200)

prep.colon.cancer <- GDCprepare(query = query.colon.cancer, save = TRUE, summarizedExperiment = TRUE, save.filename = "COLON_CANCER.rda")

**Error in GDCprepare(query = query.colon.cancer, save = TRUE, summarizedExperiment = TRUE, : There are samples duplicated. We will not be able to prepare it

I am unable to remove the rows corresponding to the duplicated files by mentioning the row numbers.Can anyone help please? Great thanks in advance.

