Metagenomic next-generation sequencing
In total, 76.38 gigabytes (GB) of raw data, averaging 5.46 GB per sample, and 509 209 214 sequences (Anhui: 214 833 000; Guizhou: 179 543 666; Hunan: 114 832 548) were obtained from the 14 samples of V. stejnegeri across three sampling localities. After QC filtering, 75.16 GB of clean data, averaging 5.38 GB per sample, and 500 924 546 sequences (Anhui: 212 806 018; Guizhou: 178 779 316; Hunan: 109 399 212) were obtained. The base percentages of Q20 and Q30 exceeded 90%, and the GC content was above 40%, indicating high sequencing accuracy. The effective data from all samples exceeded 95%, indicating that most sequences could be annotated and sampled samples were sufficient and representative for subsequent analyses (Table 1). Sequence length, TPM (transcripts per kilobase of exon model per million mapped reads) values, and number of reads are detailed in Table 2.