Chengzhi Zhao(@ChengzhiZhao) 's Twitter Profileg
Chengzhi Zhao

@ChengzhiZhao

Data Engineer | Data Content Writer | Contributor of Airflow, Flink | Personal Blog https://t.co/4D2l15P0FX | DIYer

ID:331435012

linkhttp://chengzhizhao.com calendar_today08-07-2011 05:03:13

138 Tweets

91 Followers

116 Following

Chengzhi Zhao(@ChengzhiZhao) 's Twitter Profile Photo

When it comes to visualizing data with histogram with multiple groups, it can be pretty challenging. A useful ggplot2 extension called ggridges has been helpful for my exploratory tasks. Here is what I learned buff.ly/3rWyGcF

R Markdown

When it comes to visualizing data with histogram with multiple groups, it can be pretty challenging. A useful ggplot2 extension called ggridges has been helpful for my exploratory tasks. Here is what I learned buff.ly/3rWyGcF #RStats @rmarkdown #DataScience
account_circle
Chengzhi Zhao(@ChengzhiZhao) 's Twitter Profile Photo

Union operation in Spark are often discussed widely. However, a hidden fact that has been less discussed is the performance caveat associated with the union operator.



buff.ly/3OlPHWE

account_circle
Chengzhi Zhao(@ChengzhiZhao) 's Twitter Profile Photo

Tools are great, but many data engineering problems cannot be resolved by using the newest tool but by human — Data Engineers. Here are my thoughts on 'Why data engineering is about much more than just the tools.'
science engineering
buff.ly/44LvXRh

account_circle
Chengzhi Zhao(@ChengzhiZhao) 's Twitter Profile Photo

Slowly Changing Dimension (SCD) is critical to dimensional modeling. I will discuss the eight types of SCDs. By the end, you will clearly understand each type and can differentiate SCD types in dimensional modeling
science warehouse
buff.ly/452gVaj

account_circle
Chengzhi Zhao(@ChengzhiZhao) 's Twitter Profile Photo

This article revisits the topic of visualizing monthly expenses comprehensively, updating the original tooling to be more interactive and user-friendly by continuing development in R.buff.ly/3D16W8T engineering viz

account_circle
Chengzhi Zhao(@ChengzhiZhao) 's Twitter Profile Photo

Sometimes writing SQL can be frustrating, especially when encountering NULL values. I hope this article can help you better understand these tricky NULL in SQL.

science base

buff.ly/43gNDmX

account_circle
Chengzhi Zhao(@ChengzhiZhao) 's Twitter Profile Photo

The data engineering space is evolving. Here are the resources I collected for practical data engineering resources.

science engineering

buff.ly/44fQLAC

account_circle
Chengzhi Zhao(@ChengzhiZhao) 's Twitter Profile Photo

“Why my Spark job is running slow?” is an inevitable question while working with Apache Spark. One of the most common scenarios regarding Apache Spark performance tuning is data skew.

science engineering

buff.ly/3NJhcYm

account_circle