Research report 2021 - Gesellschaft für wissenschaftliche Datenverarbeitung mbH Göttingen (GWDG)

Leveraging data lakes for managing and processing of large data volumes 

Authors
Nolte, Hendrik; Kasprzak, Piotr; Kunkel, Julian; Wieder, Philipp
 
Departments
Gesellschaft für wissenschaftliche Datenverarbeitung mbH Göttingen (GWDG) – Arbeitsgruppe eScience und Arbeitsgruppe Computing
Summary
Data lakes have experienced increasing popularity for years and are being used in more and more institutions as central storage for all accumulating data, especially unstructured data. An unique selling point of a data lake is that data is stored in the respective raw format. This is intended to prevent information loss and thus increase the reusability of the data. Furthermore, a data lake at the GWDG provides means for data management and, thus, can also support researchers in complying with good scientific practice.
 

For the full text, see the German version.

Go to Editor View