References
Bian, X., B. Zhu, M. Wang, Y. Hu, Q. Chen, C. Nguyen, B. Hicks, and D.
Meerzaman 2018 Comparing the performance of selected variant callers
using synthetic data and genome segmentation. BMC bioinformatics,
19:429.
Chen, J., X. Li, H. Zhong, Y. Meng, and H. Du 2019 Systematic comparison
of germline variant calling pipelines cross multiple next-generation
sequencers. Scientific reports, 9:9345.
Chen, S., Y. Zhou, Y. Chen, and J. Gu, 2018 fastp: an ultra-fast
all-in-one FASTQ preprocessor. Bioinformatics 34:i884–i890.
Cornish, A., and C. Guda 2015 A comparison of variant calling pipelines
using genome in a bottle as a reference. BioMed Research International,
2015:456479.
Danecek, P., A. Auton, G. Abecasis, C.A. Albers, E. Banks, M.A.
DePristo, R.E. Handsaker, G. Lunter, G.T. Marth, S.T. Sherry, G. McVean,
R. Durbin, and 1000 Genomes Project Analysis Group, 2011 The variant
call format and VCFtools. Bioinformatics 27:2156–2158.
De La Torre, R.A., I. Birol, J. Bousquet, P.K. Ingvarsson, S. Jansson,
S.J.M. Jones, C.I. Keeling, J. MacKay, O. Nilsson, K. Ritland, N.
Street, A. Yanchuk, P. Zerbe, and J. Bohlmann 2014 Insights into conifer
giga-genomes. Plant Physiology 166:1724–1732.
Garrison, E., and G. Marth, 2012 Haplotype-based variant detection from
short-read sequencing. arXiv arXiv:1207.3907.
Hwang, S., E. Kim, I. Lee, and E.M. Marcotte 2015 Systematic comparison
of variant calling pipelines using gold standard personal exome
variants. Scientific reports, 5:17875.
[dataset] Jasper R.J., T.K. McDonald, P. Singh, M. Lu, C. Rougeux,
B.M. Lind, and S. Yeaman 2021 Data: Evaluating the accuracy of variant
calling methods using the frequency of parent- offspring genotype
mismatch. Sequence Read Archive of the National Center for Biotechnology
Information, BioProject ID: PRJNA764196.
Koboldt, D., Q. Zhang, D. Larson, D. Shen, M. McLellan, L. Lin, C.
Miller, E. Mardis, L. Ding, and R. Wilson 2012 VarScan 2: Somatic
mutation and copy number alteration discovery in cancer by exome
sequencing. Genome Research 22:568–576.
Li, H., and R. Durbin 2009 Fast and accurate short read alignment with
Burrows-Wheeler Transform. Bioinformatics 25:1754–60.
Li, H., B. Handsaker, A. Wysoker, T. Fennell, J. Ruan, N. Homer, G.
Marth, G. Abecasis, R. Durbin, and 1000 Genome Project Data Processing
Subgroup 2009 The Sequence alignment/map (SAM) format and SAMtools.
Bioinformatics 25:2078–2079.
Lind, B.M., M. Lu, D. Obreht Vidakovic, P. Singh, T. Booker, S. Aitken
and S. Yeaman 2021 Haploid, diploid, and pooled exome capture
recapitulate features of biology and paralogy in two non-model tree
species. Molecular Ecology Resources 00:1–14.
Neale, D.B., P.J. Martínez-García, A.R. De La Torre, S. Montanari and
X.-X. Wei 2017 Novel Insights into Tree Biology and Genome Evolution as
Revealed Through Genomics. Annual Review of Plant Biology 68:457–483.
Poland, J.A., and T.W. Rife 2012 Genotyping-by-sequencing for plant
breeding and genetics. The Plant Genome 5:92–102.
R Core Team 2021 R: A language and environment for statistical
computing. R Foundation for Statistical Computing, Vienna, Austria.
Sandmann, S., A.O. de Graaf, M. Karimi, B.A. van der Reijedn, E.
Hellström-Lindberg, J.H. Jansen and M. Dugas 2019 Evaluating variant
calling tools for non-matched next-generation sequencing data.
Scientific reports, 7:43169.
Scott, A.D., A.V. Zimin, D. Puiu, R. Workman, M. Britton, S. Zaman, M.
Caballero, A.C. Read, A.J. Bogdanove, E. Burns, J. Wegrzyn, W. Timp,
S.L. Salzberg and D.B. Neale 2020 A reference genome sequence for giant
sequoia. G3: Genes, Genomes, Genetics 10:3907–3919.
Shu, M., and E.V. Moran 2020 Testing pipelines for genome-wide SNP
calling from genotyping- by-sequencing (GBS) data for Pinus ponderosa.
Research Square 1–21.
Van der Auwera, G.A., and B.D. O’Connor 2020 Genomics in the cloud.
O’Reilly Media, USA.
Wang, J., N. Lu, F. Yi, and Y. Xiao 2020 Identification of transposable
elements in conifer and their potential application in breeding.
Evolutionary Bioinformatics 16:1–4.
Wegrzyn, J.L., J.D. Liechty, K.A. Stevens, L.S. Wu, C.A. Loopstra, H.A.
Vasquez-Gross, W.M. Dougherty, B.Y. Lin, J.J. Zieve, P.J.
Martínez-García, and C. Holt 2014 Unique features of the loblolly pine
(Pinus taeda L.) megagenome revealed through sequence annotation.
Genetics 196:891–909.
Yi F., J. Ling, Y. Xiao, H. Zhang, F. Ouyang, and J. Wang 2018 ConTEdb:
a comprehensive database of transposable elements in conifers. Database
2018.
Zheng, L., A.E. Baniaga, E.B. Sessa, M. Scascitelli, S.W. Graham, L.H.
Rieseberg, and M.S. Barker 2015 Early genome duplications in conifers
and other seed plants. Science advances 1:e1501084.
Zimin, A., K.A. Stevens, M.W. Crepeau, D. Puiu, J.L. Wegrzyn, J.A.
Yorke, C.H. Langley, D.B. Neale, and S.L. Salzberg 2017 An improved
assembly of the loblolly pine mega-genome using long-read
single-molecule sequencing. Gigascience 6:1–4.