AWS Storage Comparison

Block Storage vs Object Storage Block Storage: block level operations are possible one block changed, e.g piece of the file, that contains the changed data can be updated in block level Object Storage: Entire file must be removed and new file needs to be put there S3 vs EBS vs EFS S3: S3 to be used for WORM operations, e.g. Write once Read many times Scalable, size not be planned not suitable for hosting OS or Database EBS:...

November 3, 2019 · 1 min · 147 words · Eric

First 90 Days

Why Do We Fail in First 90 Days? From Employee perspective, there are several aspects that lead to a unsuccessful start Job vs expectation. Relationship with boss. Lack of training. While from Company side, they have a different view on that: Poor performance. Poor punctuality. not able to be on time. Absenteeism. Not even be able to show up. First 90 days into a new role is not just the paperwork and basic induction, like training and meeting and greets....

October 26, 2019 · 6 min · 1201 words · Eric

Apache Kafka Knowledge Quick Reference

Kafka Theory Cluster Rack Broker Every broker in Kafka is a “bootstrap server” which knows about all brokers, topics and partitions (metadata) that means Kafka client (e.g. producer, consumer etc) only need to connect to one broker in order to connect to entire cluster. At all times, only one broker should be the controller, and one broker must always be the controller in the cluster Topic Kafka takes bytes as input without even loading them into memory (that’s called zero copy) Brokers have defaults for all the topic configuration parameters Partition Topic can have one or more partition....

August 17, 2019 · 10 min · 2099 words · Eric

Resolve VT-X not enabled on Windows 10

Using VirtualBox on Windows 10 is a little bit different with other os. The first time when loading a virtual box image into VirtualBox, you may meet the VT-X is not enabled error. Solution To solve this, there are two steps you need to do. Step 1: Turn off Hyper-V Platform in Windows Feature Step 2: Open a command prompt as Administrator and run the following command 1 dism.exe /Online /Disable-Feature:Microsoft-Hyper-V After that, you can try to restart your workstation and this may take several minutes to let Windows complete all underneath setting....

May 22, 2019 · 1 min · 117 words · Eric

GitHub Projects

Interview Preparation https://github.com/jwasham/coding-interview-university https://github.com/yangshun/tech-interview-handbook https://github.com/taizilongxu/interview_python Data Engineering https://github.com/high-performance-spark/high-performance-spark-examples https://github.com/kerbelp/redshift-utils https://github.com/mozilla/telemetry-airflow https://github.com/igorbarinov/awesome-data-engineering Data Science https://github.com/shirkeyaws/sagemaker-recommender-workshop https://github.com/JifuZhao/DS-Take-Home https://github.com/jupyter/notebook Tutorial https://github.com/dask/dask-tutorial https://github.com/lopusz/gists/ https://github.com/Birne94/rds-s3-database-backup https://github.com/traviscrawford/airflow-pex-example https://github.com/awslabs/lambda-streams-to-firehose https://github.com/GoogleCloudPlatform/python-docs-samples https://github.com/kumarawsbit/awsbit https://github.com/coursera/dataduct https://github.com/mumoshu/kube-airflow https://github.com/segmentio/stack Utility Download Netease CloudMusic

April 16, 2019 · 1 min · 31 words · Eric