Lakehouse is the brand name for the underlying architecture of Databricks' Delta Lake: A data lake that is as performant as a data warehouse.
This is also a video-heavy edition, I keep chugging along my watch list with Glancer
Hive is arguably old. It is also undoubtedly useful, even now: 10 years after it was introduced.
The video edition
As promised, the numbering of these posts is now year-indexed.
Managing logging in Spark ain’t easy, and is even harder in managed clouds like Databricks or EMR.