Study : Cupressus gigantea Raw sequence reads

Identification

Name
Cupressus gigantea Raw sequence reads
Identifier
dXJuOkVWQS9zdHVkeS9QUkpOQTM5NzEwMA==
Source
Description
Cupressus gigantea, one of the rare and endangered conifer tree species, is endemic to Tibetan Plateau with remarkable ecological value. However, little is known about the genetics of its unique ecological adaptation. Here, we present the first de novo transcriptome assembly and characterization based on Illumina paired-end sequencing. In total, 144,175,052 high quality reads were assembled into 101,092 unigenes, with an average sequence length of 1,029 bp. Similarity searches with known proteins shown, 33,302 (32.94% all the unigenes), 24,078 (23.81%), 28,087 (27.78%), 16,600 (16.42%) had homologs in the Nr, Swiss-port, GO, KOG databases. 6,848 (6.77%) unigenes were mapped to 292 pathways by BLASTX comparison against the KEGG databases. Besides, we identified 45 putative universal stress proteins (USPs), 43 leucine-rich repeat receptor-like protein kinases (LRR-RLKs) and 121 putative terpene synthase (TPS). Additionally, A core set of 41,373 genes belonging to 2,412 orthologous gene families shared by the ten plants were analyzed, and we identified 2,515 specific gene families containing 9,223 genes to C. gigantea. These gene families were enriched for several gene ontology (GO) terms, including GO involved in interaction with other organisms. Our study represents the first comprehensive transcriptome analysis of C. gigantea. These novel findings will pave the way for future functional genomic studies and breeding and conservation programs in this endangered species
Data files

Genotype

Accession number Name Taxon