Abstract
Over the last two decades, there has been a huge increase in our
understanding of microbial diversity, structure and composition enabled
by high throughput sequencing (HTS) technologies. Yet, it is unclear how
the number of sequences translates to the number of cells or species
within the community. Additional observational data may be required to
ensure relative abundance patterns from sequence reads are biologically
meaningful or presence absence data may be used instead of abundance.
The goal is to obtain robust community abundance data, simultaneously,
from environmental samples. In this issue of Molecular Ecology
Resources, Karlusich et al., (2022) describe a new method for
quantifying phytoplankton cell abundance. Using Tara Oceans datasets,
the authors propose the photosynthetic gene psbO for reporting accurate
relative abundance of the entire phytoplankton community from
metagenomic data. The authors demonstrate improved correlations with
traditional optical methods including microscopy and flow cytometry,
improving upon current molecular identification typically using rRNA
markers genes. Furthermore, to facilitate application of their approach,
the authors curated a psbO gene database for accessible taxonomic
queries. This is an important step towards improving species abundance
estimates from molecular data and eventually reporting of absolute
species abundance, enhancing our understanding of community dynamics.