information retrieval from pubmed
1
0
Entering edit mode
Ed ▴ 230
@ed-4683
Last seen 10.2 years ago
Hi there, I am wondering if there is a tool (in bioconductor or not, off-topic I am sorry) that can help me retrieve, for example, all gene symbols that appears in cancer biology in the journals of the last 5 years? Many thanks! Nick [[alternative HTML version deleted]]
Cancer Cancer • 1.3k views
ADD COMMENT
0
Entering edit mode
@steve-lianoglou-2771
Last seen 21 months ago
United States
Hi, On Fri, May 9, 2014 at 3:38 PM, Nick <edforum at="" gmail.com=""> wrote: > Hi there, > > I am wondering if there is a tool (in bioconductor or not, off-topic I am > sorry) that can help me retrieve, for example, all gene symbols that > appears in cancer biology in the journals of the last 5 years? There's the cancer census gene list: http://cancer.sanger.ac.uk/cosmic/census Which isn't exactly what you want, but it's perhaps in the same ballpark. -steve -- Steve Lianoglou Computational Biologist Genentech
ADD COMMENT
0
Entering edit mode
Hi Nick, COSMIC is generally a well curated source for your purpose. Going with Steve's suggestion, you can use the 'COSMIC.67' bioconductor package to get the cancer gene census list: data(cgc_67, package = "COSMIC.67") The Cancer Gene Census (CGC) is a list of genes that are causal to cancer, currently including ~600 genes. If you want to go a step further, you could parse the mutation calls from a number of large scale cancer sequencing studies, mainly the ICGC and TCGA. You can find the somatic mutation calls of 8 TCGA studies in the 'SomaticCancerAlterations' package, and could find the genes overlapping the mutations. Best wishes Julian
ADD REPLY
0
Entering edit mode
Hi Nick, COSMIC is generally a well curated source for your purpose. Going with Steve's suggestion, you can use the 'COSMIC.67' bioconductor package to get the cancer gene census list: data(cgc_67, package = "COSMIC.67") The Cancer Gene Census (CGC) is a list of genes that are causal to cancer, currently including ~600 genes. If you want to go a step further, you could parse the mutation calls from a number of large scale cancer sequencing studies, mainly the ICGC and TCGA. You can find the somatic mutation calls of 8 TCGA studies in the 'SomaticCancerAlterations' package, and could find the genes overlapping the mutations. Best wishes Julian
ADD REPLY
0
Entering edit mode
Hi Nick, COSMIC is generally a well curated source for your purpose. Going with Steve's suggestion, you can use the 'COSMIC.67' bioconductor package to get the cancer gene census list: data(cgc_67, package = "COSMIC.67") The Cancer Gene Census (CGC) is a list of genes that are causal to cancer, currently including ~600 genes. If you want to go a step further, you could parse the mutation calls from a number of large scale cancer sequencing studies, mainly the ICGC and TCGA. You can find the somatic mutation calls of 8 TCGA studies in the 'SomaticCancerAlterations' package, and could find the genes overlapping the mutations. Best wishes Julian
ADD REPLY
0
Entering edit mode
Thanks Julian and Steve. But I think I am more concerned with some results like after information retrieval, kind of a one step before text mining. So those genes after retrieval might just be "associated" with cancer. Is there some online tools to extract those info from like pubmed? Best, Nick On Sat, May 10, 2014 at 3:20 AM, Julian Gehring <julian.gehring@embl.de>wrote: > Hi Nick, > > COSMIC is generally a well curated source for your purpose. > > Going with Steve's suggestion, you can use the 'COSMIC.67' bioconductor > package to get the cancer gene census list: > > data(cgc_67, package = "COSMIC.67") > > The Cancer Gene Census (CGC) is a list of genes that are causal to cancer, > currently including ~600 genes. > > If you want to go a step further, you could parse the mutation calls from > a number of large scale cancer sequencing studies, mainly the ICGC and > TCGA. You can find the somatic mutation calls of 8 TCGA studies in the > 'SomaticCancerAlterations' package, and could find the genes overlapping > the mutations. > > Best wishes > Julian > > _______________________________________________ > Bioconductor mailing list > Bioconductor@r-project.org > https://stat.ethz.ch/mailman/listinfo/bioconductor > Search the archives: http://news.gmane.org/gmane. > science.biology.informatics.conductor > [[alternative HTML version deleted]]
ADD REPLY

Login before adding your answer.

Traffic: 475 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6