Library construction and genome sequencing
The Illumina NovaSeq-6000 and PacBio Sequel II platforms were applied
for genomic sequencing to generate short and long genomic reads,
respectively. Illumina sequencing libraries were prepared to estimate
the genome size, correct the genome assembly, and evaluate assemblies. A
paired-end library was constructed with an insert size of 300 bp
according to the Illumina standard protocol. After discarding reads with
low-quality bases (reads with more than 10% N bases or low-quality
bases≤5), adapter sequences, and duplicated sequences, the clean reads
were used for subsequent analysis.
For long-read sequencing, we constructed an SMRTbell library with a
fragment size of 20 Kb by using the SMRTBell template preparation kit
1.0 (PacBio, USA) by following the manufacturer’s protocol. The library
was sequenced with the PacBio Sequel II system, and data from one SMRT
cell were generated.