Data Lakes /

The concept of a data lake is less than 10 years old, but they are already hugely implemented within large companies. Their goal is to efficiently deal with ever-growing volumes of heterogeneous data, while also facing various sophisticated user needs. However, defining and building a data lake is s...

Full description

Bibliographic Details
Main Authors: Laurent, Anne (Author), Laurent, Dominique (Author), Madera, Cédrine (Author)
Corporate Author: Safari, an O'Reilly Media Company
Format: eBook
Language:English
Published: Wiley-ISTE, 2020.
Edition:1st edition.
Subjects:
Online Access:Connect to this electronic resource
Description
Summary:The concept of a data lake is less than 10 years old, but they are already hugely implemented within large companies. Their goal is to efficiently deal with ever-growing volumes of heterogeneous data, while also facing various sophisticated user needs. However, defining and building a data lake is still a challenge, as no consensus has been reached so far. Data Lakes presents recent outcomes and trends in the field of data repositories. The main topics discussed are the data-driven architecture of a data lake; the management of metadata - supplying key information about the stored data, master data and reference data; the roles of linked data and fog computing in a data lake ecosystem; and how gravity principles apply in the context of data lakes. A variety of case studies are also presented, thus providing the reader with practical examples of data lake management.
Item Description:Electronic resource.
Physical Description:1 online resource (244 pages)
Format:Mode of access: World Wide Web.