Study : The Oxford Nanopore, Hi-C and Illumina HiSeq data used for Morus alba L. genome assembly.
The Oxford Nanopore, Hi-C and Illumina HiSeq data used for Morus alba L. genome assembly.
Mulberry (Morus alba L) is the sole feed crop of silkworm (Bombyx mori), i.e. the essential material of the traditional silk industry. However, the genome of mulberry with 28 chromosomes has not been sequenced. To perform de novo assembly of Morus alba L. genome, we combined three different technologies: Nanopore, Hi-C and Illumina HiSeq-platform. Using DNA from the leaves of Heyebai(a variety of Morus alba L.), we generated 154-fold coverage of Nanopore data (51.5 Gb), 121-fold coverage high- throughput chromatin conformation capture (Hi-C) data, and 82-fold coverage of Illumina paired-end reads (27.9 Gb). The Nanopore reads were first corrected with Canu and later assembled by smartdenovo (version 1.06) to an assembly that consisted of 398 contigs with an N50 of 2.71 Mb. Then the contigs were corrected and polished with Illumina paired-end sequences, in which 98.38% could be scaffolded on 14 chromosomes by Hi-C data with Juicer and 3D-dna. A final genome of 346.39Mb was assembled with scaffold N50 of 22.87 Mb. Analysis of the Plantae conserved core eukaryotic genes derived from the benchmarking universal single-copy ortholog (BUSCO v3.0.2) plant lineages showed 94.3% of genes in the dataset were identified.