loading page

A Little Data goes a Long Way: Automating Seismic Phase Arrival Picking at Nabro Volcano with Transfer Learning
  • +3
  • Sacha Lapins,
  • Berhe Goitom,
  • J.-Michael Kendall,
  • Maximilian J. Werner,
  • Katharine V. Cashman,
  • James Hammond
Sacha Lapins
University of Bristol, University of Bristol

Corresponding Author:[email protected]

Author Profile
Berhe Goitom
University of Bristol, University of Bristol
Author Profile
J.-Michael Kendall
University of Oxford, University of Oxford
Author Profile
Maximilian J. Werner
University of Bristol, University of Bristol
Author Profile
Katharine V. Cashman
University of Bristol, University of Bristol
Author Profile
James Hammond
Birkbeck, University of London, Birkbeck, University of London
Author Profile

Abstract

Supervised deep learning models have become a popular choice for seismic phase arrival detection. However, they don’t always perform well on out-of-distribution data and require large training sets to aid generalization and prevent overfitting. This can present issues when using these models in new monitoring settings. In this work, we develop a deep learning model for automating phase arrival detection at Nabro volcano using a limited amount of training data (2498 event waveforms recorded over 35 days) through a process known as transfer learning. We use the feature extraction layers of an existing, extensively-trained seismic phase picking model to form the base of a new all-convolutional model, which we call U-GPD. We demonstrate that transfer learning reduces overfitting and model error relative to training the same model from scratch, particularly for small training sets (e.g., 500 waveforms). The new U-GPD model achieves greater classification accuracy and smaller arrival time residuals than off-the-shelf applications of two existing, extensively-trained baseline models for a test set of 800 event and noise waveforms from Nabro volcano. When applied to 14 months of continuous Nabro data, the new U-GPD model detects 31,387 events with at least four P-wave arrivals and one S-wave arrival, which is more than the original base model (26,808 events) and our existing manual catalogue (2,926 events), with smaller location errors. The new model is also more efficient when applied as a sliding window, processing 14 months of data from 7 stations in less than 4 hours on a single GPU.
Jul 2021Published in Journal of Geophysical Research: Solid Earth volume 126 issue 7. 10.1029/2021JB021910