Why dbConnect GEOmetadb_demo.sqlite shows some information while dbConnect GEOmetadb.sqlite shows no information
1
1
Entering edit mode
Lee ▴ 10
@f7d2e8fe
Last seen 14 months ago
Russia

When I used the demo sqlite file, there are some outputs


>sqlfile_demo <- getSQLiteFile(destdir = "~/bin/Rpack", destfile = "GEOmetadb.sqlite_demo.gz",type="demo")
>con <- dbConnect(SQLite(),sqlfile_demo)
>getBiocPlatformMap(con)[1:5,]

                                                                       title      gpl                 bioc_package         manufacturer
1                                      [Maize] Affymetrix Maize Genome Array  GPL4032                                        Affymetrix
2 Agilent-026652 Whole Human Genome Microarray 4x44K v2 (Probe Name version) GPL13497        HsAgilentDesign026652 Agilent Technologies
3     Illumina HumanMethylation27 BeadChip (HumanMethylation27_270596_v.1.2)  GPL8490  IlluminaHumanMethylation27k       Illumina, Inc.
4       Illumina HumanMethylation450 BeadChip (HumanMethylation450_15017482) GPL13534 IlluminaHumanMethylation450k       Illumina, Inc.
5                                   [AG] Affymetrix Arabidopsis Genome Array    GPL71                           ag           Affymetrix
              organism data_row_count
1             Zea mays          17734
2         Homo sapiens          34184
3         Homo sapiens          27578
4         Homo sapiens         485577
5 Arabidopsis thaliana           8297

> object.size(con)
2112 bytes

But when I used the complete sqlite file, there is no data.

>sqlfile <- getSQLiteFile(destdir = "~/bin/Rpack", destfile = "GEOmetadb.sqlite.gz",type="normal")
>con<-dbConnect(SQLite(),sqlfile)
>getBiocPlatformMap(con)[1:5,]
     title  gpl bioc_package manufacturer organism data_row_count
NA    <NA> <NA>         <NA>         <NA>     <NA>             NA
NA.1  <NA> <NA>         <NA>         <NA>     <NA>             NA
NA.2  <NA> <NA>         <NA>         <NA>     <NA>             NA
NA.3  <NA> <NA>         <NA>         <NA>     <NA>             NA
NA.4  <NA> <NA>         <NA>         <NA>     <NA>             NA

> object.size(con)
2096 bytes

enter image description here

GEOmetadb GEOmetadb.sqlite dbConnect • 669 views
ADD COMMENT
0
Entering edit mode
@james-w-macdonald-5106
Last seen 40 minutes ago
United States

The current GEOmetadb.sqlite file apparently has no entries for the bioc_package field. The table isn't empty however.

## 
> z <- dbGetQuery(con, "select title, gpl, manufacturer, organism, data_row_count from gpl where manufacturer is not null order by title;")
> head(z)
                                                                                           title      gpl
1 "Dehalococcoides ethenogenes" strain 195 with Methanogens and Fosmids, Base Composition Method GPL11218
2                        "Dehalococcoides ethenogenes" strain 195 with Mixed Community Sequences GPL10023
3   "Dehalococcoides ethenogenes" strain 195 with Mixed Community Sequences [Probe Name version] GPL16121
4                                                       (OID28929)100603_Sus_scrofa_CB_expr_HX12 GPL18875
5                                        017617 IMUL-Candida glabrata 8x15k [Probe name version] GPL22200
6                                                         018538_Agilent Agaricus bisporus array GPL20649
          manufacturer                       organism data_row_count
1              Agilent   Dehalococcoides mccartyi 195          15744
2              Agilent   Dehalococcoides mccartyi 195          15744
3              Agilent   Dehalococcoides mccartyi 195           1687
4            NimbleGen                     Sus scrofa          45022
5 Agilent Technologies Nakaseomyces glabratus CBS 138          10488
6              Agilent              Agaricus bisporus          11521

## just Affymetrix
> z <- dbGetQuery(con, "select title, gpl, manufacturer, organism, data_row_count from gpl where manufacturer='Affymetrix' order by title;")
> head(z)
                         title     gpl manufacturer     organism data_row_count
1 AFFY Human Phase3 v1.0 - C01 GPL5234   Affymetrix Homo sapiens        3161260
2 AFFY Human Phase3 v1.0 - C02 GPL5235   Affymetrix Homo sapiens        4053015
3 AFFY Human Phase3 v1.0 - C03 GPL5236   Affymetrix Homo sapiens        3223204
4 AFFY Human Phase3 v1.0 - C04 GPL5237   Affymetrix Homo sapiens        2951193
5 AFFY Human Phase3 v1.0 - C05 GPL5238   Affymetrix Homo sapiens        3469626
6 AFFY Human Phase3 v1.0 - C06 GPL5239   Affymetrix Homo sapiens        3052792

## try to get hgu133plus2 data
> z <- dbGetQuery(con, "select title, gpl, manufacturer, organism, data_row_count from gpl where manufacturer='Affymetrix' and title like '%U133_Plus%' order by title;")
> head(z)
                                                                                                          title
1    Affymetrix GeneChip Human Genome HG-U133 Plus 2 Array [CDF: Brainarray Version 11, HGU133Plus2_Hs_ENTREZG]
2                               Affymetrix GeneChip Human Genome HG-U133 Plus 2 Array [Hs133P_Hs_ENTREZG_8.cdf]
3 Affymetrix GeneChip Human Genome U133 Array Set HG-U133 Plus2 based on a custom CDF (GeneAnnot version 2.1.0)
4 Affymetrix GeneChip Human Genome U133 Array Set HG-U133 Plus2 based on a custom CDF (GeneAnnot version 2.2.0)
5                    Affymetrix GeneChip Human Genome U133 Plus 2.0 Array (Brainarray CustomCDF [Hs133P_Hs_UG])
6 Affymetrix GeneChip Human Genome U133 Plus 2.0 Array (Brainarray CustomCDF, HGU133Plus2_Hs_UG_Version 11.cdf)
       gpl manufacturer     organism data_row_count
1  GPL8712   Affymetrix Homo sapiens          17788
2 GPL13303   Affymetrix Homo sapiens          17663
3 GPL19633   Affymetrix Homo sapiens          18862
4 GPL19171   Affymetrix Homo sapiens          19204
5 GPL10925   Affymetrix Homo sapiens          22244
6  GPL9099   Affymetrix Homo sapiens          22244
ADD COMMENT

Login before adding your answer.

Traffic: 891 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6