
vutrinh
@_vutrinh
My mom read my articles to support her son. Now, she can design a data architecture and write ETL scripts.
ID: 1638587225146560512
https://vutr.substack.com 22-03-2023 17:03:50
48 Tweet
111 Followers
215 Following


🚀🚀 How does the Apache Iceberg reading process look like? ◉ The reader first visits the catalog to retrieve the table's current metadata file location. ◉ After fetching the metadata file, it collects the table’s schema and checks partition schemes to understand the data


🚀🚀 How does the Apache Spark plan the execution for us? (With the help of Catalyst Optimizer) When defining DataFrame transformation logic, it must first go through an optimized process before execution. This involves four key phases: ◉ Analysis: Spark SQL starts by






