Metagenomic next-generation sequencing
In total, 76.38 gigabytes (GB) of raw data, averaging 5.46 GB per
sample, and 509 209 214 sequences (Anhui: 214 833 000; Guizhou:
179 543 666; Hunan: 114 832 548) were obtained from the 14 samples
of V. stejnegeri across three sampling localities. After QC
filtering, 75.16 GB of clean data, averaging 5.38 GB per sample, and
500 924 546 sequences (Anhui: 212 806 018; Guizhou: 178 779 316;
Hunan: 109 399 212) were obtained. The base percentages of Q20 and Q30
exceeded 90%, and the GC content was above 40%, indicating high
sequencing accuracy. The effective data from all samples exceeded 95%,
indicating that most sequences could be annotated and sampled samples
were sufficient and representative for subsequent analyses (Table 1).
Sequence length, TPM (transcripts per kilobase of exon model per million
mapped reads) values, and number of reads are detailed in Table 2.