Submitting R packages with external data
1
0
Entering edit mode
@vittoriofortino-7287
Last seen 9.9 years ago
Finland

Dear all,

I am Vittorio Fortino and I am writing you to kindly ask some information about submitting R packages.

Basically, I would like to publish an R package about a method for operon prediction. It uses some external data in order to carry out a complete working example. The external data are stored in a ".RData" file (it contains dataframes and lists) and the size is about 50Mb. 

Should I add it to the fold "data"??

or

Should I define a new "experimental-data" package??

Thanks in advance for your precious help.

Br,

VF  

package • 1.7k views
ADD COMMENT
2
Entering edit mode
Andrzej Oleś ▴ 750
@andrzej-oles-5540
Last seen 4.1 years ago
Heidelberg, Germany

Dear Vittorio,

your .Rdata files should go to the 'data' folder. The objects can be then loaded using 

data(yourObject)

For details see http://cran.r-project.org/doc/manuals/r-release/R-exts.html#Data-in-packages

I would suggest creating a dedicated experimental data package for storing your sample data set. This will allow for a lightweight methods package.

Kind regards,

Andrzej Oles

ADD COMMENT
0
Entering edit mode

Dear Andrzej,

thank you for quick reply. I have further questions. 

My .RData file contains coverage-depth data compiled from RNA-seq data deposited in the GEO database. 

"Could I make/publish an experimental-data package containing this coverage-depth data?" 

"Where should I insert the geo-reference? "

"Should I submit the two packages separately?"

Thanks in advance for your answer.

VF

ADD REPLY
0
Entering edit mode

Dear Vittorio,

for accessing data from the GEO database you might wish to use the GEOquery Bioconductor package. If you already have prepossessed data files you might want to include a function/script in your experiment data package demonstrating how these objects were obtained. The reference to GEO could be mentioned on the package's man pages and/or the vignette.

Regarding the submission process: you probably need to submit both your packages together, otherwise there might be problems in resolving the dependencies correctly.

Hope this helps.

Best,

Andrzej

ADD REPLY
0
Entering edit mode

Ok, I understand. However, the RNA-seq data available in GEO database were not aligned and I used BOWTIE to make it. Therefore, should I provide the instructions to run BOWTIE ??

ADD REPLY
0
Entering edit mode

The RNAseqData.HNRNPC.bam.chr14 package might provide a good example; check out the manual and scripts/ directories.

ADD REPLY

Login before adding your answer.

Traffic: 945 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6