Wednesday, 5 February 2020

What is a Lakehouse?

A new management paradigm has emerged that combines data lakes and data warehouses. Lakehouses are similar to data warehouses with structures and data management features. This is backed with the low cost storage that is used for data lakes.

A lakehouse has some key attributes:

  • Transaction support
  • Schema enforcement and governance
  • BI support
  • Storage is decoupled from compute
  • Openness
  • Support for diverse data types ranging from unstructured to structured data
  • Support for diverse workloads
  • End-to-end streamingThere is a great article to read which covers this in more depth The Data Lakehouse – Dismantling the Hype

No comments:

Post a Comment

Note: only a member of this blog may post a comment.