Genome Assembly and Evaluation
The 148.75 Gb subreads data were obtained from Pacbio sequel sequencing
technology to apply in genome assembly.
The initial genome is 1,217,613,942
bp and then polished genome is 1,223,597,352 bp, which were larger than
the 17-mer estimating size (949,878,192 bp). As for the
autotetraploid,
there were some redundant sequences
product during assembly. So 278,014,088 bp redundant sequences were
removed from the polished genome. Finally, ultimate genome is the
non-redundant haploid genome and the size is
945,583,264 bp and N50 is 1,645,408
bp, including 967 contigs (Table 1).
For the quality of ultimate genome, some information can indicate that
the genome is very precise. As for the precision of genomeļ¼the coverage
of TGS and NGS is nearly 100% and the accuracy of genome is more than
99.99%. As for the completeness of genome, BUSCO database searched the
genome and 96.06% complete genes were detected in the genome (Table 1).