Kyle Weller (@kylejweller) 's Twitter Profile
Kyle Weller

@kylejweller

0 to 1 Builder of data platforms and data products. Lately you can find me at the lake chilling with Apache Hudi, Apache Iceberg, and Delta Lake

ID: 331177052

linkhttps://www.linkedin.com/in/lakehouse/ calendar_today07-07-2011 19:25:31

538 Tweet

577 Followers

497 Following

Onehouse (@onehousehq) 's Twitter Profile Photo

One of the great things about the universal #datalakehouse compared with traditional data architectures is that it enables you to bring any compute engine, so you can always choose the right engine for the job. Join Kyle Weller and Andy Walner later this month to learn why

Onehouse (@onehousehq) 's Twitter Profile Photo

We bring the #datalakehouse. You bring your data and compute engine. It's that simple. ๐Ÿ‘‰onehouse.ai/webinar/your-dโ€ฆ #dataengineering #dataarchitecture

Onehouse (@onehousehq) 's Twitter Profile Photo

๐Ÿšจ Announcing Open Enginesโ„ข, a quick + reliable way to deploy Trino, ray, and Apache Flink making it easy to choose the right engine for analytics, streaming, or ML/DS. Read the details๐Ÿ‘‰ onehouse.ai/blog/announcinโ€ฆ

Kyle Weller (@kylejweller) 's Twitter Profile Photo

๐Ÿš€ Today we are introducing Open Enginesโ„ข, 1-click infrastructure to rapidly deploy open source engines like Trino, Ray, and Flink. It has never been easier to match your open data to the right engine for analytics, streaming, and ML/DS use cases. onehouse.ai/blog/announcinโ€ฆ

Vinoth Chandar (@byte_array) 's Twitter Profile Photo

Last week, we announced "OpenEngines" to flip the way data platforms are built, from being engine-centric to being data-centric. Other people have said that better. Check out this well-researched InfoWorld article. infoworld.com/article/396495โ€ฆ ๐Ÿคบ "breaks the Siamese connection

Onehouse (@onehousehq) 's Twitter Profile Photo

๐Ÿ”ฅ Announcing OpenXData - the free virtual conference on open data ๐Ÿ”ฅ OpenXData brings together 25+ sessions by data innovators and thought leaders from companies like Meta, Netflix, Salesforce, Peloton, and more, to share best practices and the latest trends in the world

๐Ÿ”ฅ Announcing OpenXData - the free virtual conference on open data ๐Ÿ”ฅ

OpenXData brings together 25+ sessions by data innovators and thought leaders from companies like <a href="/Meta/">Meta</a>, <a href="/netflix/">Netflix</a>, <a href="/salesforce/">Salesforce</a>, <a href="/onepeloton/">Peloton</a>, and more, to share best practices and the latest trends in the world
Shiyan Xu (@_xushiyan) 's Twitter Profile Photo

I had the privilege of presenting Hudi-rs at VeloxCon 2025 - the native Rust implementation of Apache Hudi that's opening new doors for cross-language data processing. ๐Ÿ”จ By rebuilding Hudi's core in Rust, we've created a foundation that supports multiple language bindings

I had the privilege of presenting Hudi-rs at VeloxCon 2025 - the native Rust implementation of <a href="/apachehudi/">Apache Hudi</a>  that's opening new doors for cross-language data processing.

๐Ÿ”จ By rebuilding Hudi's core in Rust, we've created a foundation that supports multiple language bindings
Onehouse (@onehousehq) 's Twitter Profile Photo

๐Ÿ•’ Ever wanted to spin up a #datalakehouse but couldn't find the time? โšก Let Chandra Krishnan, Solutions Engineer at Onehouse,ย show you how quickly it can be doneโ€”from spinning up a fresh data source, building pipelines, adding transformations, integrating catalogs, all the way

๐Ÿ•’ Ever wanted to spin up a #datalakehouse but couldn't find the time?

โšก Let Chandra Krishnan, Solutions Engineer at Onehouse,ย show you how quickly it can be doneโ€”from spinning up a fresh data source, building pipelines, adding transformations, integrating catalogs, all the way
Vinoth Chandar (@byte_array) 's Twitter Profile Photo

๐Ÿ’ฐ โ‰๏ธWhat does running ETL on your cloud data platform cost you? ๐Ÿ“ˆ ๐Ÿ’ฒ Short answer: very likely that itโ€™s more than you think. Across all the performance-critical data systems Iโ€™ve worked on, the one thing that bugged me the most is: how poorly we benchmark Warehouse/Lakehouse

๐Ÿ’ฐ โ‰๏ธWhat does running ETL on your cloud data platform cost you?

๐Ÿ“ˆ ๐Ÿ’ฒ Short answer: very likely that itโ€™s more than you think.

Across all the performance-critical data systems Iโ€™ve worked on, the one thing that bugged me the most is: how poorly we benchmark Warehouse/Lakehouse
Onehouse (@onehousehq) 's Twitter Profile Photo

Today we announce SQL and Spark jobs powered by our new Quanton execution engine ๐Ÿš€ Quanton delivers 2-3x price/performance vs Databricks w/ photon, AWS EMR, and Snowflake. See our benchmark in the blog ๐Ÿคบ onehouse.ai/blog/announcinโ€ฆ

Vinoth Chandar (@byte_array) 's Twitter Profile Photo

๐Ÿ”ฅ Meet Quanton โ€” the new query execution engine from Onehouse. ๐Ÿ‘ Same Spark & SQL. ๐Ÿ“‰ At least half the cost. ๐Ÿ“ˆ 1.6x-3.6x better ETL price-performance ๐Ÿ“Š 2.2x-6.5x better Ingest price-performance ๐Ÿ‘‰ย  Read the full blog here: onehouse.ai/blog/announcinโ€ฆ โฌ‡๏ธย  Download our free

Onehouse (@onehousehq) 's Twitter Profile Photo

What happens when ๐˜ฎ๐˜ข๐˜ด๐˜ด๐˜ช๐˜ท๐˜ฆ ๐˜ด๐˜ต๐˜ณ๐˜ฆ๐˜ข๐˜ฎ๐˜ช๐˜ฏ๐˜จ ๐˜ธ๐˜ฐ๐˜ณ๐˜ฌ๐˜ญ๐˜ฐ๐˜ข๐˜ฅ๐˜ด meet the reality of maintaining Iceberg metadata at scale? We just dropped a deep-dive blog that pulls back the curtain on our experience managing ๐—ป๐—ฒ๐—ฎ๐—ฟ-๐—ฟ๐—ฒ๐—ฎ๐—น-๐˜๐—ถ๐—บ๐—ฒ ๐—ถ๐—ป๐—ด๐—ฒ๐˜€๐˜๐—ถ๐—ผ๐—ป

What happens when ๐˜ฎ๐˜ข๐˜ด๐˜ด๐˜ช๐˜ท๐˜ฆ ๐˜ด๐˜ต๐˜ณ๐˜ฆ๐˜ข๐˜ฎ๐˜ช๐˜ฏ๐˜จ ๐˜ธ๐˜ฐ๐˜ณ๐˜ฌ๐˜ญ๐˜ฐ๐˜ข๐˜ฅ๐˜ด meet the reality of maintaining Iceberg metadata at scale?

We just dropped a deep-dive blog that pulls back the curtain on our experience managing ๐—ป๐—ฒ๐—ฎ๐—ฟ-๐—ฟ๐—ฒ๐—ฎ๐—น-๐˜๐—ถ๐—บ๐—ฒ ๐—ถ๐—ป๐—ด๐—ฒ๐˜€๐˜๐—ถ๐—ผ๐—ป
Onehouse (@onehousehq) 's Twitter Profile Photo

It may be the season for SNOW, but this talk was hot! It was standing room only for Kyle Weller's presentation on building an Apache Iceberg #datalakehouse for Snowflake! Have more questions? Swing by booth 1415 after the keynote. #SnowflakeSummit

It may be the season for SNOW, but this talk was hot! It was standing room only for <a href="/KyleJWeller/">Kyle Weller</a>'s  presentation on building an <a href="/ApacheIceberg/">Apache Iceberg</a> #datalakehouse for <a href="/Snowflake/">Snowflake</a>! Have more questions? Swing by booth 1415 after the keynote. #SnowflakeSummit
Onehouse (@onehousehq) 's Twitter Profile Photo

Whoโ€™s going to #databricks #DataAISummit next week? Join us on Tuesday for Onehouse VP of Product Kyle Weller sharing โ€œOpen By Default, Fast By Design: One Lakehouse That Scales From BI to AIโ€ in Expo Theater 3. The gang will be at booth E501 the rest of the time, so come by

Whoโ€™s going to #databricks #DataAISummit next week? Join us on Tuesday for <a href="/Onehousehq/">Onehouse</a> VP of Product <a href="/KyleJWeller/">Kyle Weller</a>  sharing โ€œOpen By Default, Fast By Design: One Lakehouse That Scales From BI to AIโ€ in Expo Theater 3. The gang will be at booth E501 the rest of the time, so come by
Kyle Weller (@kylejweller) 's Twitter Profile Photo

Great launches for Unity Catalog today, but what % are proprietary vs OSS? Last I checked >1/2 were locked away to proprietary Databricks version: onehouse.ai/blog/comprehenโ€ฆ IMO Databricks is making the same mistakes as Delta Lake, which seems they now are taking the knee on.

Great launches for Unity Catalog today, but what % are proprietary vs OSS?

Last I checked &gt;1/2 were locked away to proprietary Databricks version: onehouse.ai/blog/comprehenโ€ฆ

IMO Databricks is making the same mistakes as Delta Lake, which seems they now are taking the knee on.
Vinoth Chandar (@byte_array) 's Twitter Profile Photo

๐Ÿงช TPC-DI is better than TPC-DS for ETL workloads โ€” but is it good enough? Read Databricks' recent take (and Shannonโ€™s benchmark deep dive) with interest. Itโ€™s refreshing to see industry leaders debating real ETL performance, not just SQL query speeds. At Onehouse, we agree: ETL

๐Ÿงช TPC-DI is better than TPC-DS for ETL workloads โ€” but is it good enough?

Read Databricks' recent take (and Shannonโ€™s benchmark deep dive) with interest. Itโ€™s refreshing to see industry leaders debating real ETL performance, not just SQL query speeds. At Onehouse, we agree: ETL
Apache Hudi (@apachehudi) 's Twitter Profile Photo

๐Ÿฆ€ Hudi-rs 0.4.0 is released! Another step towards standardizing Apache Hudi APIs across broad ecosystem integrations using #rustlang #Python #cpp ! ๐Ÿšข github.com/apache/hudi-rsโ€ฆ

๐Ÿฆ€ Hudi-rs 0.4.0 is released! Another step towards standardizing <a href="/apachehudi/">Apache Hudi</a> APIs across broad ecosystem integrations using #rustlang #Python #cpp !
 
๐Ÿšข github.com/apache/hudi-rsโ€ฆ
Kyle Weller (@kylejweller) 's Twitter Profile Photo

S3 Tables: where โ€œfully managedโ€ means โ€“ no knobs โ€“ no metrics โ€“ JUST VIBES (and a 20x bigger bill) It compacts when it feels like it. It charges like it knows you wonโ€™t check. We ran the numbers so you donโ€™t have to: onehouse.ai/blog/s3-manageโ€ฆ

Vinoth Chandar (@byte_array) 's Twitter Profile Photo

๐Ÿšจ Think S3 Tables are a drop-in solution for Iceberg compaction at scale? Think again It promised simplicity. But in practice? Itโ€™s a broken abstraction trying to shove a database into your file system.

๐Ÿšจ Think S3 Tables are a drop-in solution for Iceberg compaction at scale?
Think again

It promised simplicity. But in practice? Itโ€™s a broken abstraction trying to shove a database into your file system.
Shiyan Xu (@_xushiyan) 's Twitter Profile Photo

๐Ÿš€ New Blog: Building a RAG-based AI Recommender (Part 1/2) ๐Ÿ‘‰ blog.datumagic.ai/p/building-a-rโ€ฆ ๐Ÿ“š What's inside: โœฆ How RAG works end-to-end (chunking โ†’ embedding โ†’ retrieval โ†’ generation) โœฆ Why 70% of AI success is actually data engineering ๐Ÿ“Š โœฆ How Apache Hudi 's incremental

๐Ÿš€ New Blog: Building a RAG-based AI Recommender (Part 1/2)

๐Ÿ‘‰ blog.datumagic.ai/p/building-a-rโ€ฆ

๐Ÿ“š What's inside: 
โœฆ How RAG works end-to-end (chunking โ†’ embedding โ†’ retrieval โ†’ generation) 
โœฆ Why 70% of AI success is actually data engineering ๐Ÿ“Š 
โœฆ How <a href="/apachehudi/">Apache Hudi</a> 's incremental