I'm trying to use GeneGA as a component in codon optimisation for expression of polypeptide sequences in mammalian cells. It appears to work well, but I have an annoying issue. The package is said to include a database to optimise for 200 organisms. However, the abbreviation used to specify which organism appears to be non-standard and without documentation. I have tried this list: http://www.genome.jp/kegg/catalog/org_list.html as well as tried many real name alternatives. The one given as example int the documentation is "ec" (I assume ar e.coli, but that is not specified either).
Can anyone please help me finding the right abbreviation for human, rat and mouse for use in GeneGA?
Thanks'
the names used for available organisms seem to be extractable as follows
Thank you Vincent. This is clearly one step forward and two steps back as the list contains no mammalian species. I have the equivalent data for the species I need however. Is there a way to inject this data into the wSet data table before execution or make it always include this data as it has done with the three species for the seqinr caitab data?
I apologise if this is an obvious question, but I'm still rather new to bioconductor and R.