Hello everyone!
I'm trying to do an analysis with DEXSeq, but for that I need to connect to biomart first. I need to use the makeTxDbFromBiomart function to generate the TxDb object, but I always get the same error. I have already solved the firewall problem and if I should be able to connect to the internet, I don't know what is not working.
Maybe someone can help me.
I attach the executed command and the error that it returns:
Ensembl site unresponsive, trying useast mirror
Download and preprocess the 'transcripts' data frame ... OK
Download and preprocess the 'chrominfo' data frame ... FAILED! (=> skipped)
Download and preprocess the 'splicings' data frame ... OK
Download and preprocess the 'genes' data frame ... OK
Prepare the 'metadata' data frame ... Error in function (type, msg, asError = TRUE) :
Failed to connect to ftp.ensembl.org port 21: Operation timed out
The Ensembl Mart is notoriously unreliable, but it looks like it's working today:
txdb <- makeTxDbFromBiomart(biomart="ENSEMBL_MART_ENSEMBL", dataset="mmusculus_gene_ensembl")
# Download and preprocess the 'transcripts' data frame ... OK
# Download and preprocess the 'chrominfo' data frame ... OK
# Download and preprocess the 'splicings' data frame ... OK
# Download and preprocess the 'genes' data frame ... OK
# Prepare the 'metadata' data frame ... OK
# Make the TxDb object ... OK
txdb
# TxDb object:
# Db type: TxDb
# Supporting package: GenomicFeatures
# Data source: BioMart
# Organism: Mus musculus
# Taxonomy ID: 10090
# Resource URL: www.ensembl.org:443
# BioMart database: ENSEMBL_MART_ENSEMBL
# BioMart database version: Ensembl Genes 110
# BioMart dataset: mmusculus_gene_ensembl
# BioMart dataset description: Mouse genes (GRCm39)
# BioMart dataset version: GRCm39
# Full dataset: yes
# miRBase build ID: NA
# Nb of transcripts: 149547
# Db created by: GenomicFeatures package from Bioconductor
# Creation time: 2023-08-11 18:57:14 -0700 (Fri, 11 Aug 2023)
# GenomicFeatures version at creation time: 1.52.1
# RSQLite version at creation time: 2.3.1
# DBSCHEMAVERSION: 1.2
Note that an alternative to makeTxDbFromBiomart() is to use makeTxDbFromEnsembl(). The latter will query the Ensembl MySQL server _directly_ (and thus bypass the Mart service completely):
txdb2 <- makeTxDbFromEnsembl("Mus musculus")
# Fetch transcripts and genes from Ensembl ... OK
# (fetched 149547 transcripts from 56941 genes)
# Fetch exons and CDS from Ensembl ... OK
# Fetch chromosome names and lengths from Ensembl ...OK
# Gather the metadata ... OK
# Make the TxDb object ... OK
txdb2
# TxDb object:
# Db type: TxDb
# Supporting package: GenomicFeatures
# Data source: Ensembl
# Organism: Mus musculus
# Ensembl release: 110
# Ensembl database: mus_musculus_core_110_39
# MySQL server: ensembldb.ensembl.org
# Full dataset: yes
# Nb of transcripts: 149547
# Db created by: GenomicFeatures package from Bioconductor
# Creation time: 2023-08-11 18:50:42 -0700 (Fri, 11 Aug 2023)
# GenomicFeatures version at creation time: 1.52.1
# RSQLite version at creation time: 2.3.1
# DBSCHEMAVERSION: 1.2
See ?makeTxDbFromEnsembl for more information, including how to specify a given Ensembl release.
Thanks James, I will try this.