loading page

CRABS -- A software program to generate curated reference databases for metabarcoding sequencing data
  • +3
  • Gert-Jan Jeunen,
  • Eddy Dowle,
  • Jonika Edgecombe,
  • Ulla von Ammon,
  • Neil Gemmell,
  • Hugh Cross
Gert-Jan Jeunen
University of Otago

Corresponding Author:[email protected]

Author Profile
Eddy Dowle
University of Colorado
Author Profile
Jonika Edgecombe
University of Otago
Author Profile
Ulla von Ammon
Cawthron Institute
Author Profile
Neil Gemmell
University of Otago
Author Profile
Hugh Cross
University of Otago
Author Profile

Abstract

The measurement of biodiversity is an integral aspect of life science research. With the establishment of second- and third-generation sequencing technologies, an increasing amount of metabarcoding data is being generated as we seek to describe the extent and patterns of biodiversity in multiple contexts. The reliability and accuracy of taxonomically assigning metabarcoding sequencing data has been shown to be critically influenced by the quality and completeness of reference databases. Custom, curated, eukaryotic reference databases, however, are scarce, as are the software programs for generating them. Here, we present CRABS (Creating Reference databases for Amplicon-Based Sequencing), a software package to create custom reference databases for metabarcoding studies. CRABS includes tools to download sequences from multiple online repositories (i.e., NCBI, BOLD, EMBL, MitoFish), retrieve amplicon regions through in silico PCR analysis and pairwise global alignments, curate the database through multiple filtering parameters (e.g., dereplication, sequence length, sequence quality, unresolved taxonomy), export the reference database in multiple formats for the immediate use in taxonomy assignment software, and investigate the reference database through implemented visualizations for diversity, primer efficiency, reference sequence length, and taxonomic resolution. CRABS is a versatile tool for generating curated reference databases of user-specified genetic markers to aid taxonomy assignment from metabarcoding sequencing data. CRABS is available for download as a conda package and via GitHub (https://github.com/gjeunen/reference_database_creator).
24 May 2022Submitted to Molecular Ecology Resources
01 Jun 2022Submission Checks Completed
01 Jun 2022Assigned to Editor
09 Jun 2022Reviewer(s) Assigned
21 Jul 2022Review(s) Completed, Editorial Evaluation Pending
23 Sep 2022Editorial Decision: Revise Minor
31 Oct 2022Review(s) Completed, Editorial Evaluation Pending
31 Oct 20221st Revision Received
15 Nov 2022Editorial Decision: Accept
Apr 2023Published in Molecular Ecology Resources volume 23 issue 3 on pages 725-738. 10.1111/1755-0998.13741