Essential Site Maintenance: Authorea-powered sites will be updated circa 15:00-17:00 Eastern on Tuesday 5 November.
There should be no interruption to normal services, but please contact us at [email protected] in case you face any issues.

loading page

BAGS: an automated Barcode, Audit & Grade System for DNA barcode reference libraries.
  • +2
  • João Tadeu Fontes,
  • Pedro Vieira,
  • Torbjørn Ekrem,
  • Pedro Soares,
  • Filipe Costa
João Tadeu Fontes
University of Minho

Corresponding Author:[email protected]

Author Profile
Pedro Vieira
University of Minho, University of Minho
Author Profile
Torbjørn Ekrem
Norwegian University of Science and Technology
Author Profile
Pedro Soares
University of Minho
Author Profile
Filipe Costa
University of Minho
Author Profile

Abstract

Biodiversity studies greatly benefit from molecular tools, such as DNA metabarcoding, which provides an effective identification tool in biomonitoring and conservation programmes. The accuracy of species-level assignment, and consequent taxonomic coverage, relies on comprehensive DNA barcode reference libraries. The role of these libraries is to support species identification, but accidental errors in the generation of the barcodes may compromise their accuracy. Here we present an R-based application, BAGS (Barcode, Audit & Grade System), that performs automated auditing and annotation of cytochrome c oxidase subunit I (COI) sequences libraries, for a given taxonomic group of animals, available in the Barcode of Life Data System (BOLD). This is followed by implementing a qualitative ranking system that assigns one of five grades (A to E) to each species in the reference library, according to the attributes of the data and congruency of species names with sequences clustered in Barcode Index Numbers (BINs). Our ultimate goal is to allow researchers to obtain the most useful and reliable data, highlighting and segregating records according to their congruency. Different tests were performed to perceive its usefulness and limitations. BAGS fulfils a significant gap in the current landscape of DNA barcoding research tools by quickly screening reference libraries to gauge the congruence status of data and facilitate the triage of ambiguous data for posterior review. Thereby, BAGS have the potential to become a valuable addition in forthcoming DNA metabarcoding studies, in the long term contributing to globally improve the quality and reliability of the public reference libraries.
20 May 2020Submitted to Molecular Ecology Resources
05 Jun 2020Submission Checks Completed
05 Jun 2020Assigned to Editor
08 Jun 2020Reviewer(s) Assigned
05 Aug 2020Review(s) Completed, Editorial Evaluation Pending
12 Aug 2020Editorial Decision: Revise Minor
01 Sep 2020Review(s) Completed, Editorial Evaluation Pending
01 Sep 20201st Revision Received
07 Sep 2020Editorial Decision: Accept
30 Sep 2020Published in Molecular Ecology Resources. 10.1111/1755-0998.13262