Computational genome and proteome annotation
Scientific question
Large-scale automatic annotation and analysis of plant genomes is challenging due to their size, astonishing diversity, high proportion of repeats and pseudogenes, and the presence of very extensive gene families.
Approach
•Collection and organization of genome sequencing data
•Integration and benchmarking of gene prediction tools.
•Development of a computational pipeline for genome annotation using proteomics cues.
•Comparative genome analysis
•Integration of OMICs data
Expected results
•High-quality annotation of coding and regulatory regions
•Insights into the evolution of plant genomes
•Comprehensive publicly available resource for the plant genomics community