Genome Assembly and Evaluation
The 148.75 Gb subreads data were obtained from Pacbio sequel sequencing technology to apply in genome assembly. The initial genome is 1,217,613,942 bp and then polished genome is 1,223,597,352 bp, which were larger than the 17-mer estimating size (949,878,192 bp). As for the autotetraploid, there were some redundant sequences product during assembly. So 278,014,088 bp redundant sequences were removed from the polished genome. Finally, ultimate genome is the non-redundant haploid genome and the size is 945,583,264 bp and N50 is 1,645,408 bp, including 967 contigs (Table 1).
For the quality of ultimate genome, some information can indicate that the genome is very precise. As for the precision of genome,the coverage of TGS and NGS is nearly 100% and the accuracy of genome is more than 99.99%. As for the completeness of genome, BUSCO database searched the genome and 96.06% complete genes were detected in the genome (Table 1).