Chengzhi Zhao
@ChengzhiZhao
Data Engineer | Data Content Writer | Contributor of Airflow, Flink | Personal Blog https://t.co/4D2l15P0FX | DIYer
ID:331435012
http://chengzhizhao.com 08-07-2011 05:03:13
138 Tweets
91 Followers
116 Following
When it comes to visualizing data with histogram with multiple groups, it can be pretty challenging. A useful ggplot2 extension called ggridges has been helpful for my exploratory tasks. Here is what I learned buff.ly/3rWyGcF
#RStats R Markdown #DataScience
Union operation in Spark are often discussed widely. However, a hidden fact that has been less discussed is the performance caveat associated with the union operator.
#DataEngineering #ApacheSpark #ETL #DataPipeline #DataScience #DataAnalytics
buff.ly/3OlPHWE
Discover the Essential Reading List for Data Engineers: 10 Classic Books You Can’t-Miss.
#DataEngineering #BigData #ETL #DataPipeline #DataWarehouse #DataArchitecture #Hadoop #DataScience #DataAnalytics #MachineLearning
buff.ly/3OkA95t
The data engineering space is evolving. Here are the resources I collected for practical data engineering resources.
#DataEngineering #BigData #ETL #DataPipeline #DataWarehouse #DataArchitecture #Hadoop #DataScience #DataAnalytics #MachineLearning
buff.ly/44fQLAC
Unlocking the Secrets of Slowly Changing Dimension (SCD): A Comprehensive View of 8 Types by Chengzhi Zhao buff.ly/3OjzmBN