Abstract
In July 2021, Germany experienced its costliest riverine floods in
history, with over 189 fatalities and a staggering \euro33 billion in
damages. Following this event, news outlets widely disseminated
information on the flood’s aftermath. Here, we demonstrate how newspaper
data can be instrumental in the assessment of flood socioeconomic
impacts often overlooked by conventional methods. Using natural language
processing tools on 14,888 unique newspaper articles, we estimate the
impacts of the 2021 flood on various sectors and critical
infrastructure, including water contamination, mental health, and
tourism. Our results revealed severe and lasting impacts in the Ahr
Valley, even months after the event. At the same time, we identified
smaller-scale yet widespread impacts across Germany, which are typically
overlooked by existing impact databases. Our approach advances current
research by systematically examining indirect and intangible flood
impacts over large areas. This underscores the value of leveraging
complementary text data to provide a more comprehensive picture of flood
impacts.