CONTACT  |  SITE MAP  |  ABOUT US   
Ask an account
You are here : Home / Home URGI / About us / Publications / 2012 / An iterative process for TEs annotation in large genomes

2012

International,  COM (posters)

3rd International Conference GIETE 2012, February 24-28 2012, Asilomar Pacific Grove, California, USA

24 Feb 2012   An iterative process for TEs annotation in large genomes
Poster_GIETE2012_VJ

V Jamilloux, S Arnoux, O Inizan, H Quesneville

The recent successes of new sequencing technologies allow today the sequencing of very large genomes at reasonable costs. Transposable elements (TEs) constitute the most structurally dynamic components and the largest portion of their nuclear genomes, e.g. 85% of the maize genome (Schnable et al. 2009), and 88% of the wheat genome (Choulet et al. 2010). Therefore, TE annotation should be considered as a major task in genome projects, but currently not be obtained automatiquely. This crucial step is now a bottleneck for many genome analyses.

In this context, we improve, in the v2.0 release - see poster “REPET v2” (Arnoux et al. 2012) in this conference -, the REPET package (Flutre et al. 2011). REPET, gathers two pipelines: TEdenovo build a TEs library and TEannot annotate TE copies in the genome. Thus, we test a new strategie dedicated to annotate very large genomes, the iterative approach:

  1. Detection of young TEs with stringent parameters able to find quickly only the less degenerate ones to build a first TE library.
  2. TE annotation and extraction of the corresponding sequences from the initial contigs. We obtain a reduced genome sequence.
  3. Detection of the other TEs with sensitive parameters on the reduced genome sequence to build a second TE library.
  4. Annotation of the original contigs with the concatenation of the two TE libraries.

The rational is that these large genomes are made of mostly few TE families that recently invade. They will be detected in the first step and this will allow reducing the genome by an important factor. We tested this approach on A.thaliana and we will present the benchmarks obtained. We will also present preliminary results on the 3B chromosome of wheat (293890 contigs, 975Mbp), the first fully sequenced chromosome of an ~17Gbp allohexaploid genome.


Keywords: detection annotation element transposable REPET wheat genome
Update: 24 Feb 2013
Creation date: 14 Mar 2012