James Stagge

and 5 more

There have been numerous calls to promote reproducible research. This growing awareness coincides with major advances in data/code sharing technologies. Yet authors, journals, institutions, and funders still need to act to advance more reproducible research. Here, we suggest to view reproducibility as a continuum that includes the 1) availability of data, models, code, and directions to use the digital artifacts, 2) replication of results, and 3) reproducibility of findings. We present a simple survey tool to assess where a peer-reviewed journal article lies on the continuum. We use the tool to assess 360 random sampled articles of the 1,989 articles published in 2017 in six well-regarded hydrology and water resources journals. 49% of sampled articles had some materials available online, but just 5.6% made available all the data, models, code, and directions. For 1.6% of articles, we generated results that replicated some or all of the published results. Assessments took 5 to 14 minutes per article to determine the availability of digital artifacts and 25 to 86 minutes to replicate results (25-75% range). The availability of data, models, code, and directions differed by journal and journal policy towards data availability. From the 360 article sample, we estimate that 0.6% to 6.8% of all articles published in the six journals in 2017 can be replicated using their published artifacts (95% confidence interval). These results suggest several practices to improve the reproducibility of published research. First, authors should provide directions to use their data, models, and code in addition to the digital artifacts. Second, on author submission, journals should use a tool like ours to assess the submission’s position on the reproducibility continuum. Third, journals should formulate policies that require authors to state the intended reproducibility of their work and place relevant information in an easy-to-find article location. Fourth, journals, institutions, and funders should highlight work whose digital artifacts, results, and findings are available, replicable, and reproducible.