Delta Lake builds upon standard data formats. Delta Lake table gets stored on storage in one or more data files in Apache Parquet format, along with transaction logs in JSON format.
Reference
- Lakehouse: A New Generation of Open Platforms that Unify Data Warehousing and Advanced Analytics
- Michael Armburst (@databricks)
- Ali Ghodsi (@databricks, @uc berkeley)
- Reynold Xin (@databricks)
- Matei Zaharia (@databricks, @stanford)
- Michael Armburst: Boston Spark Meetup @ Wayfair / Delta Lake: Open Source Reliability and Quality for Data Lakes
- Delta Lake Inside
- YouTube: Understanding Delta File Logs - The Heart of the Delta Lake