Research report 2021 - Gesellschaft für wissenschaftliche Datenverarbeitung mbH Göttingen (GWDG)
Leveraging data lakes for managing and processing of large data volumes
Nolte, Hendrik; Kasprzak, Piotr; Kunkel, Julian; Wieder, Philipp
Gesellschaft für wissenschaftliche Datenverarbeitung mbH Göttingen (GWDG) – Arbeitsgruppe eScience und Arbeitsgruppe Computing
SummaryData lakes have experienced increasing popularity for years and are being used in more and more institutions as central storage for all accumulating data, especially unstructured data. An unique selling point of a data lake is that data is stored in the respective raw format. This is intended to prevent information loss and thus increase the reusability of the data. Furthermore, a data lake at the GWDG provides means for data management and, thus, can also support researchers in complying with good scientific practice.