The Promise and Pitfalls of Machine Learning in Ocean Remote Sensing
- Patrick Clifton Gray,
- Emmanuel Boss,
- J Xavier Prochaska,
- Hannah Kerner,
- Charlotte Begouen Demeaux,
- Yoav Lehahn
Patrick Clifton Gray
School of Marine Sciences, University of Maine, Department of Marine Geosciences, Charney School of Marine Sciences, University of Haifa
Corresponding Author:[email protected]
Author ProfileEmmanuel Boss
School of Marine Sciences, University of Maine
J Xavier Prochaska
Department of Ocean Sciences, University of California at Santa Cruz
Hannah Kerner
School of Computing and Augmented Intelligence, Arizona State University
Charlotte Begouen Demeaux
School of Marine Sciences, University of Maine
Yoav Lehahn
Department of Marine Geosciences, Charney School of Marine Sciences, University of Haifa
Abstract
The proliferation of easily accessible machine learning algorithms and their apparent successes at inference and classification in computer vision and the sciences has motivated their increased adoption in ocean remote sensing. Our field, however, runs the risk of developing these models on limited training datasets-with sparse geographical and temporal sampling or ignoring the real data dimensionality-and thereby constructing over-fitted or non-generalized algorithms. These models may perform poorly in new regimes or on new, anomalous phenomena that emerge in a changing climate. We highlight these issues and strategies for mitigating them, share a few heuristics to help users develop intuition for machine learning methods, and provide a vision for areas we believe are underexplored at the intersection of machine learning and ocean remote sensing. The ocean is a complex physical-biogeochemical system that we cannot mechanistically model well despite our best efforts. ML has the potential to play an important role in improved process understanding, but we must always ask what we are learning after the model has learned.09 Jun 2024Submitted to ESS Open Archive 10 Jun 2024Published in ESS Open Archive