Shubham Baldava (@cto_datazip) 's Twitter Profile
Shubham Baldava

@cto_datazip

ID: 1793585364231147520

calendar_today23-05-2024 10:11:01

6 Tweet

7 Followers

9 Following

OLake by Datazip (@_olake) 's Twitter Profile Photo

Last weekend, OLake got a stage at the Edge Forum Meetup! The whiteboarding session by Shubham-Shubham Baldava showed how OLake streamlines data ingestion into Apache Iceberg faster, more consistent, and easier for engineers and data scientists. Big thanks to the host Contentstack

Last weekend, OLake got a stage at the Edge Forum Meetup!
The whiteboarding session by Shubham-<a href="/cto_datazip/">Shubham Baldava</a>  showed how OLake streamlines data ingestion into Apache Iceberg faster, more consistent, and easier for engineers and data scientists.
Big thanks to the host <a href="/Contentstack/">Contentstack</a>
OLake by Datazip (@_olake) 's Twitter Profile Photo

Watch Shubham Baldava at the Apache Iceberg NYC Meetup answer a key question: 👉 How does OLake handle write conflicts when multiple workers are running in parallel? #OLake #ApacheIceberg #DataEngineering #ParallelProcessing

OLake by Datazip (@_olake) 's Twitter Profile Photo

Databricks Delta Lake Apache Hive Now that we’ve laid out the problems, let’s dive into the approach as presented by Shubham Baldava OLake processes Apache Iceberg as its exclusive offering because of these capabilities while solving these challenges end-to-end. How it works is described here 👇

Akshay Sharma (@cappybaradeploy) 's Twitter Profile Photo

yet another great week at bangalore Attended the Apache Kafka Meetup Bangalore and explored how companies are scaling with Kafka in 2025. One highlight was Nutanix Inc. showcasing a resequencer using Kafka Streams with RocksDB to handle out-of-order events a really smart

yet another great week at bangalore

Attended the <a href="/apachekafka/">Apache Kafka</a>  Meetup Bangalore and explored how companies are scaling with Kafka in 2025.

One highlight was <a href="/nutanix/">Nutanix Inc.</a>  showcasing a resequencer using Kafka Streams with RocksDB to handle out-of-order events  a really smart
OLake by Datazip (@_olake) 's Twitter Profile Photo

Optimizing Queries in Apache Iceberg As data grows, queries slow down. Sorting helps, but not always with multi-column filters. That’s where Z-ordering comes in it clusters data, reduces file scans, and speeds up queries. The key is knowing when to apply it. 🧵 A thread

Optimizing Queries in Apache Iceberg
As data grows, queries slow down. Sorting helps, but not always with multi-column filters.

That’s where Z-ordering comes in  it clusters data, reduces file scans, and speeds up queries.

The key is knowing when to apply it.

🧵 A thread
OLake by Datazip (@_olake) 's Twitter Profile Photo

Only 1 day to go! 🎉 Join us on Aug 28, 8:00 A.M PT for a hands-on deep dive into ClickHouse + Apache Iceberg ✅ Native writes & compaction ✅ Migration & syncing ✅ Time travel & schema evolution Don’t miss the demos, real-world challenges, and live Q&A. Link in comments!

Only 1 day to go! 🎉

Join us on Aug 28, 8:00 A.M PT for a hands-on deep dive into <a href="/ClickHouseDB/">ClickHouse</a>  + <a href="/ApacheIceberg/">Apache Iceberg</a>
✅ Native writes &amp; compaction
✅ Migration &amp; syncing
✅ Time travel &amp; schema evolution
Don’t miss the demos, real-world challenges, and live Q&amp;A.
Link in comments!
OLake by Datazip (@_olake) 's Twitter Profile Photo

In our recent session with Arsham (co-founder , greybeam), we dove deep into the evolving Apache Iceberg catalog ecosystem. One highlight: Apache Polaris Here’s what it is — and why it matters 👇 #DataEngineeringStudy

OLake by Datazip (@_olake) 's Twitter Profile Photo

We’re coming to Mumbai for AWS Community Day AWS User Group Mumbai 🇮🇳 on Oct 11! We’re proud to be the sponsor of the Community Day—great to be part of the global stage. We’re excited to connect and share why we’re the fastest data replication tool in the world for Apache Iceberg

OLake by Datazip (@_olake) 's Twitter Profile Photo

Wrapped FOSS United 2025, BLR. We showed how OLake makes ingestion to Iceberg 5–500× faster, walked through the stack, and met future contributors. Community > everything. #OpenSource #DataEngineering #ApacheIceberg

Wrapped <a href="/FOSSUnited/">FOSS United</a>  2025, BLR.
We showed how OLake makes ingestion to Iceberg 5–500× faster, walked through the stack, and met future contributors. 
Community &gt; everything.
#OpenSource #DataEngineering #ApacheIceberg
OLake by Datazip (@_olake) 's Twitter Profile Photo

Join us at Bengaluru Streams and Lakehouse Days on Sept 27, 2025, at Accel Launchpad! Our Co-founder & CTO, Shubham Baldava Shubham Baldava , will present on Reimagining Ingestion for Apache Iceberg sharing insights from building OLake, our open-source high-performance tool.

Join us at Bengaluru Streams and Lakehouse Days on Sept 27, 2025, at <a href="/Accel/">Accel</a>  Launchpad!

Our Co-founder &amp; CTO, Shubham Baldava <a href="/cto_datazip/">Shubham Baldava</a> , will present on Reimagining Ingestion for <a href="/ApacheIceberg/">Apache Iceberg</a>  sharing insights from building OLake, our open-source high-performance tool.
OLake by Datazip (@_olake) 's Twitter Profile Photo

We’ve contributed an important feature to Apache Iceberg Go, writing into partitioned tables! The new design uses a fan-out strategy with rolling writers per partition, flushes parquet files efficiently, and supports all partition transforms built on top of ApacheArrow .

We’ve contributed an important feature to <a href="/ApacheIceberg/">Apache Iceberg</a>  Go,  writing into partitioned tables! 

The new design uses a fan-out strategy with rolling writers per partition, flushes parquet files efficiently, and supports all partition transforms  built on top of <a href="/ApacheArrow/">ApacheArrow</a> .