Tips and tricks of Jupyter Notebook  [draft]

Reference 28 Jupyter Notebook tips, tricks, and shortcuts Nazif Berat: Boosting Your Jupyter Notebook Productivity

September 29, 2018 · 1 min · 15 words · Eric

All about Apache Parquet  [draft]

Converting to Parquet https://stackoverflow.com/questions/45043554/how-to-read-a-list-of-parquet-files-from-s3-as-a-pandas-dataframe-using-pyarrow Reference Twitter: Dremel Made Simple with Parquet Databricks: The Parquet Format and Performance Optimization Opportunities Boudewijn Braams

July 16, 2018 · 1 min · 21 words · Eric

Airflow in Practice - Interactive with Airflow Internal Storage

Problem Definition One typical Airflow usage scenario is to continuously execute some workflow with regular base, and the output data of last iteration will be the input data for the next iteration. One way we can do that is to keep your output data as a local file or store that into database table, and read and update those data in every iteration. However, with those solutions you need to manual handle database connections and that is not convenient sometime....

June 11, 2018 · 2 min · 332 words · Eric

Airflow Concept

What is Airflow? Airflow is a platform to programmatically author, schedule and monitor your workflows and pipelines. What are the benefits for using Airflow? Programmatically author workflow In Airflow, you can define your workflow programmatically with Python scripts and that would put you in a very good position by leveraging all the convenience and sweet that Python provide. This is a huge improvement if you experienced with Oozie or other GUI-typed (or even without a GUI) scheduling tools....

June 1, 2018 · 2 min · 286 words · Eric

Data Warehouse Design - Inmon vs Kimball  [draft]

Inmon Framework (Enterprise Data Warehouse) A centralized data repository that Kimball Framework (Dimensional Data Warehouse) Reference Data Warehouse Architecture Comparison: Kimball and Inmon dbt: Kimball in the Context of Morden Data Warehouse

March 10, 2016 · 1 min · 32 words · Eric