Essential Site Maintenance: Authorea-powered sites will be updated circa 15:00-17:00 Eastern on Tuesday 5 November.
There should be no interruption to normal services, but please contact us at [email protected] in case you face any issues.

loading page

Beyond DNA barcoding: The unrealised potential of genome skim data in sample identification
  • +1
  • Kristine Bohmann,
  • Siavash Mirarab,
  • Vineet Bafna,
  • Tom Gilbert
Kristine Bohmann
University of Copenhagen

Corresponding Author:[email protected]

Author Profile
Siavash Mirarab
UC San Diego
Author Profile
Vineet Bafna
UCSD
Author Profile
Tom Gilbert
University of Copenhagen Globe Institute
Author Profile

Abstract

Genetic tools are increasingly used to identify and discriminate between species. One key transition in this process was the recognition of the potential of the ca 658bp fragment of the organelle cytochrome c oxidase I (COI) as a barcode region, which revolutionised animal bioidentification and lead, among others, to the instigation of the Barcode of Life database (BOLD), containing currently barcodes from >7.9 million specimens. Following this discovery, suggestions for other organellar regions and markers, and the primers with which to amplify them, have been continuously proposed. Most recently, the field has taken the leap from PCR based generation of DNA references into shotgun sequencing-based ‘genome skimming’ alternatives, which the ultimate goal of assembling organellar reference genomes. Unfortunately, in genome skimming approaches, much of the nuclear genome (as much as 99% of the sequence data) is discarded, which is not only wasteful but can also limit the power of discrimination at or below the species level. Here, we advocate that the full shotgun sequence data can be used to assign an identity (that we term for convenience its ‘DNA-mark’) for both voucher and query samples, without requiring any computationally intensive pretreatment (e.g., assembly) of reads. We argue that if reference databases are populated with such ‘DNA-marks’, it will enable future DNA-based taxonomic identification to complement, or even replace PCR of barcodes with genome skimming, and we discuss how such methodology ultimately could enable identification to population, or even individual, level.
26 Feb 2020Submitted to Molecular Ecology
26 Feb 2020Submission Checks Completed
26 Feb 2020Assigned to Editor
03 Mar 2020Reviewer(s) Assigned
18 Apr 2020Review(s) Completed, Editorial Evaluation Pending
29 Apr 2020Editorial Decision: Revise Minor
03 Jun 2020Review(s) Completed, Editorial Evaluation Pending
03 Jun 20201st Revision Received
05 Jun 2020Editorial Decision: Accept