Rqc for RNA-Seq data
6
0
Entering edit mode
@kritikamish99-9648
Last seen 6.3 years ago
India

Hello members,

I am new in rnaseq analysis. I am running Rqc for my files. can any one explain me what is "extdata/E-MTAB-1147" and how to locate my file by giving path command. I am usnig linux machine for path argument how to locate the directory.

Thank You

Rqc • 3.1k views
ADD COMMENT
2
Entering edit mode
@steve-lianoglou-2771
Last seen 21 months ago
United States

You are presumably talking about this line in the Rqc vignette:

folder <- system.file(package="ShortRead", "extdata/E-MTAB-1147")

The system.file function loads a file that can be found in the specified path from the root of the package that is specified. This allows for a mechanism for R to access files that are provided by a package which abstracts away the actual location that the system is configured to install R packages to. See the help page in ?system.file for more information. If you go to the directory that the ShortRead package is installed in your system, you will find an extdata folder, and you can take it from there ...

To answer your question about what that file is, it says so right in the Rqc vignette:

The next step is to determine the location of the FASTQ files that should be analyzed. The example below, uses sample files provided by the ShortRead package, but the user must modify this location accordingly, in order to reflect the actual location of the files that need QA.

So, they are sample (small) FASTQ files that are distributed with the ShortRead package for the purposes of running example code (and, likely, unit tests)

 

ADD COMMENT
2
Entering edit mode
@wdesouza
Last seen 4.1 years ago
Brazil

The rqc function uses list.files function to search a pattern inside a folder defined by the path. You can run list.files first with the same parameters to see which files the rqc funtion will process. An exemple (using your path):

folder <- "/home/Downloads/Work_Kriti/NGS_Data"
list.files(path = folder, pattern = ".fastq.gz")

Printed files are the files that you want the Rqc to process? If yes, just replace the function. An example:

folder <- "/home/Downloads/Work_Kriti/NGS_Data"
rqc(path = folder, pattern = ".fastq.gz")

In this case, you don't need to use system.file function. The examples in Rqc's vignette use this function to get folder path located inside ShortRead package (as mentioned by Steve Lianoglou).

ADD COMMENT
0
Entering edit mode

Thanks Welliton Souza ..

This was really help full. And i got output also.

Thanks ..

ADD REPLY
1
Entering edit mode
@wdesouza
Last seen 4.1 years ago
Brazil

Rqc automatically process multiple files in parallel. The multicoreWorkers function returns the number of available processing units.

The rqc function generates only one report file for all FASTQ files. The output directory of report file is defined by outdir parameter and the name of the report file is defined by file parameter.

If you want to have more control on report file, such as generating more than one report, use rqcQA and rqcReport functions instead of rqc function. An exemple using your code:

folder <- "/home/Downloads/Work_Kriti/NGS_Data"
files <- list.files(path = folder, pattern = ".fastq.gz")
# process all FASTQ files and store result data
result <- rqcQA(files)
# generate report for the first 10 FASTQ files
rqcReport(result[1:10])

As mentioned by Steve Lianoglou, please read the manual of the ?rqc?rqcQA and ?rqcReport functions.

ADD COMMENT
0
Entering edit mode
@kritikamish99-9648
Last seen 6.3 years ago
India

Thanks Steve Lianoglou for help. 

Also i need help  in this line  "rqc(path = folder, pattern = ".fastq.gz")". I understood the path is the location of my file.  But my question is  , in linux to go to specific directory we write /directory/filename but in R (path= )  how i will specifying   "path =  " 

ADD COMMENT
0
Entering edit mode
I am getting such error:-

Error in rqcQA(files, sample, n, groupFactor, BPPARAM = BPPARAM) : 
  Input files were not provided.
ADD REPLY
0
Entering edit mode

You're going to need to provide the code that you have run which leads to the error so that we can have a reproducible example to work with. Secondly you will also need to provide output from sessionInfo()

ADD REPLY
0
Entering edit mode

This is the code i am running

> folder <- system.file(package="ShortRead", "home/Downloads/Work_Kriti/NGS_Data")
> rqc(path = folder , pattern = "File1_R1.fastq.gz")
Error in rqcQA(files, sample, n, groupFactor, BPPARAM = BPPARAM) :
  Input files were not provided
ADD REPLY
0
Entering edit mode
@kritikamish99-9648
Last seen 6.3 years ago
India

Hello member In Rqc how to define output file name for each sample. I am using code

rqc(path=folder , pattern = ".fastq", openBrowser = FALSE, outdir = "/home/Downloads/Work_Kriti/NGS_Data/" , file="Rqc_file")

I have 66 paired samples so if i give pattern as .fastq it will run the quality check per sample. But what will the file for each sample. Thank You.

ADD COMMENT
0
Entering edit mode

And what is the fucntion of

workers = multicoreWorkers()

ADD REPLY
0
Entering edit mode

Please read the help page for the multicoreWorkers function. You can do that by typing ?multicoreWorkers into the R console. That function is part of the BiocParallel library.

ADD REPLY
0
Entering edit mode

ok. I need help for my first question.

In Rqc how to define output file name for each sample. I am using code 

rqc(path=folder , pattern = ".fastq", openBrowser = FALSE, outdir = "/
/home/Downloads/Work_Kriti/NGS_Data/" ,  file="Rqc_file")

I have 66 paired samples so if i give pattern as .fastq it will run the quality check per sample. But what will the file for each sample. 

Thank You.

 

ADD REPLY
0
Entering edit mode
@kritikamish99-9648
Last seen 6.3 years ago
India

ok

Thanks for the help Welliton Souza !!!!!!!! 

ADD COMMENT

Login before adding your answer.

Traffic: 779 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6