E. Natasha Stavros

and 15 more

Imaging spectroscopy data is becoming more readily available from different satellite and airborne platforms. As this data becomes more prolific, there is a need for shared data tools and code for wrangling, cleaning, and analyzing it. The geospatial Imaging Spectroscopy Processing Environment on the Cloud (ImgSPEC) pioneers an on-demand science data processing platform with scalable back-end compute. It considers user experience and facilitates open science. ImgSPEC enables users to create data products in areas of interest using default workflows from registered algorithms, while also enabling users to customize scripts and workflows. ImgSPEC seamlessly interfaces with NASA Earthdata Search and tracks appropriate metadata for reproducibility when generating data products to share with others. Users can work in their preferred workspace (e.g., Rstudio, Jupyterlab, or command line) thereby facilitating use of open science software packages and collaborative coding through Git. ImgSPEC leverages existing NASA-funded information technologies such as the hybrid on-premise/cloud science data system (HySDS) and the Multi-mission Algorithm and Analysis Platform (MAAP). It also creates seamless interfaces with NASA-funded ECOSIS – a crowd-sourced spectral database, and ECOSML – a crowd-sourced model database. We demonstrate ImgSPEC on the Terrestrial Ecosystem use case processing through to foliar traits and fractional cover, thus aligning with driving thrusts for the NASA Surface Biology and Geology (SBG) Science and Applications Communities. As this technology is more widely adopted the interface with Amazon Web Services and NASA Earthdata search will enable broader use of more data (publicly available or loaded by the user) across more domains.

E. Natasha Stavros

and 9 more

The geospatial Imaging Spectroscopy Processing Environment on the Cloud (ImgSPEC; formerly GeoSPEC) pioneers an on-demand science data processing system (SDPS) producing user-customized Level 1 calibrated radiance to Level 3+ data products in anticipation for the 2017-2027 Earth Decadal Survey prioritized spaceborne global imaging spectrometer to advance the study of Surface Biology and Geology (SBG). SBG data volumes (~20 TB/day) of high dimensionality (>224 bands) would be infeasible to download and the breadth of applications of the data across dozens of disciplines presents a need to evolve the traditional NASA SDPS. ImgSPEC streamlines processing data into key SBG observables that have demonstrated algorithms at local-to-regional scales and may vary locally. As such, a traditional, monolithic SDPS could not fully exploit the information in SBG measurements. To remove this barrier to use, ImgSPEC demonstrates an on-demand SDPS prototype that improves imaging spectroscopy data discovery, access, and utility enabling shared knowledge transfer from advanced imaging spectroscopy users to less experienced users such as decision makers and the general public. We test three use cases: 1) standard data processing workflows, 2) customized variants of standard workflows, and 3) algorithm development of new workflows. We create collaborative algorithm development environments that offer services typically restricted to NASA SDPSs such as data product provenance and bulk processing. We leverage existing NASA-funded information technologies such as the hybrid on-premise/ cloud science data system (HySDS), the Multi-mission Algorithm and Analysis Platform (MAAP), ECOSIS – a crowd-sourced spectral database, and ECOSML – a crowd-sourced model database. We demonstrate ImgSPEC on the Terrestrial Ecosystem use case processing through to foliar traits and fractional cover, thus aligning with driving thrusts for the SBG Science and Applications Communities.