View Category:Genomics

CENICAFE has implemented a web-based Bioinformatics platform that functions as a genomics information resource for coffee and other organisms studied at the Center. The Bioinformatics platform includes a Laboratory Integrated Management System (LIMS), the implementation of wEMBOSS, home-developed perl tools for data analysis, InterproScan for annotation of sequence domains, and the implementation of wBLAST among other tools available. The main backbone of the system is an adaptation of the SOL Genomics Network (SGN) databases developed at Cornell University for ESTs, molecular markers and BAC sequences storage and analysis ( The system is based on the postgresQL relational database, the use of perl scripts for the manipulation of data, the Apache Web server with the mod_perl integrated perl interpreter, and the servers run the Debian distribution of the GNU/Linux operating system. Although SGN has mainly developed as a plant genomics oriented resource, the Cenicafe platform has implemented several new tools and databases for the analysis of other organisms sequence data such as fungi and insects. The Cenicafe databases contain to date over 120.000 coffee EST sequences from 22 libraries organized in 31.473 C. arabica, 11,657 C. liberica and 2,047 C. kapakata unigenes, 6.000 Beauveria bassiana EST sequences organized in 2.404 unigenes, over 200,000 Hypothenemus hampei (coffee berry borer) and H. obscurus EST sequences organized in 28,483 and 12,420 unigenes respectively, besides the more than 100.000 Solanaceae unigene sequences annotated at SGN. We are currently annotating the Hemileia vastatrix genome that is been sequenced using Illumina and 454 sequencing technologies in collaboration with Universidad de los Andes scientists. The sequences are annotated based on Solanaceae, Arabidopsis, Swissprot and Genbank sequence comparisons using BLAST homology searches, aminoacids are predicted using ESTScan, the domains are annotated using InterproScan and Gene families are annotated using a perl script developed at SGN. The system will implement in the near future a database of coffee genetics resources developed at Cenicafe, a proteomics platform, and a Microarray database. We will also be incorporating other components to the platform specially for the visualization of genetic maps from the Gmod project (Gbrowse), the SGN system, TIGR, and other open source projects.

