Sam Arch 🇦🇺 (@samarchdb) 's Twitter Profile
Sam Arch 🇦🇺

@samarchdb

PhD Student in Databases @CMUDB with @andy_pavlo, Previously a Compiler Engineer @apple

ID: 1685322781750161408

linkhttp://samarch.xyz calendar_today29-07-2023 16:14:24

16 Tweet

524 Followers

173 Following

Andy Pavlo (@andypavlo.bsky.social) (@andy_pavlo) 's Twitter Profile Photo

My #1 PhD student @butro successfully completed his PhD defense. Thanks to the committee (Jignesh Patel billions of packets Sam Madden). Matt's thesis is on accelerating databases with eBPF (telemetry, proxies, OLTP stores). You have 60 days to hire him. Expect fierce competition.

My #1 PhD student @butro successfully completed his PhD defense. Thanks to the committee (<a href="/pateljm/">Jignesh Patel</a> <a href="/justinesherry/">billions of packets</a> <a href="/samrmadden/">Sam Madden</a>). Matt's thesis is on accelerating databases with eBPF (telemetry, proxies, OLTP stores). 

You have 60 days to hire him. Expect fierce competition.
Abigale Kim (@abigalekim.bsky.social) (@abigale_kim) 's Twitter Profile Photo

I am excited to announce that I will start my PhD in database systems at Wisconsin DB Group (@wiscdb.bsky.social) and work with Xiangyao Yu beginning Fall 2024! I'm super grateful to everyone who has supported me along the way :)

I am excited to announce that I will start my PhD in database systems at <a href="/wiscdb/">Wisconsin DB Group (@wiscdb.bsky.social)</a> and work with <a href="/xiangyao_yu/">Xiangyao Yu</a> beginning Fall 2024! I'm super grateful to everyone who has supported me along the way :)
DuckDB (@duckdb) 's Twitter Profile Photo

We are proud to release the first major version of DuckDB, v1.0.0, codenamed "Snow Duck". This version is a culmination of almost six years of research and development. Today we are shipping an innovative database system with a backwards-compatible storage format. Check out our

We are proud to release the first major version of DuckDB, v1.0.0, codenamed "Snow Duck".

This version is a culmination of almost six years of research and development. Today we are shipping an innovative database system with a backwards-compatible storage format.

Check out our
Andy Pavlo (@andypavlo.bsky.social) (@andy_pavlo) 's Twitter Profile Photo

It took three years to finish, but our follow-up to the 2006 "What Goes Around Comes Around" is finally out! Stonebraker and I examine the last 20 years in databases and discuss why relational databases + SQL will continue to remain on top. 📄PDF: db.cs.cmu.edu/papers/2024/wh…

It took three years to finish, but our follow-up to the 2006 "What Goes Around Comes Around" is finally out! Stonebraker and I examine the last 20 years in databases and discuss why relational databases + SQL will continue to remain on top.

📄PDF: db.cs.cmu.edu/papers/2024/wh…
Phil Eaton (@eatonphil) 's Twitter Profile Photo

Video of Sam Arch 🇦🇺 's talk from NYC Systems August 2024 is now up! Dear UDFs, I Broke Up With You, But Now I'm Ready To Give You A Second Chance. Will You Take Me Back? Sincerely, SQL youtube.com/watch?v=XMIEkn…

Video of <a href="/SamArchDB/">Sam Arch 🇦🇺</a> 's talk from NYC Systems August 2024 is now up!

Dear UDFs, I Broke Up With You, But Now I'm Ready To Give You A Second Chance. Will You Take Me Back? Sincerely, SQL

youtube.com/watch?v=XMIEkn…
Andy Pavlo (@andypavlo.bsky.social) (@andy_pavlo) 's Twitter Profile Photo

VLDB'24 Paper #1: Collecting training data for ML models with DBs is $$$/slow. Wanshen Li's Boot framework uses PostgreSQL extensions to cutoff redundant queries. Offline training goes from weeks to hours! • Code: github.com/lmwnshn/boot • Paper: x.com/pvldb/status/1…

Chris Lattner (@clattner_llvm) 's Twitter Profile Photo

This speaks to me: it’s the essence of building hard things that take time to have impact, but then matter in a big way - you have to love the process, not just the outcome!

Sam Arch 🇦🇺 (@samarchdb) 's Twitter Profile Photo

I am flying to the Bay Area to give talks about my new VLDB 2025 paper on UDFs with Andy Pavlo (@andypavlo.bsky.social) and Jignesh Patel. With our new technique (UDF outlining), queries run up to 1000× faster than FROID. The paper will drop soon. 9/10 Databricks 9/11  UC Berkeley 9/12 #HTAPSummit2024

I am flying to the Bay Area to give talks about my new VLDB 2025 paper on UDFs with <a href="/andy_pavlo/">Andy Pavlo (@andypavlo.bsky.social)</a> and <a href="/pateljm/">Jignesh Patel</a>. With our new technique (UDF outlining), queries run up to 1000× faster than FROID.  The paper will drop soon.

9/10 <a href="/databricks/">Databricks</a> 
9/11  <a href="/UCBerkeley/">UC Berkeley</a>
9/12 #HTAPSummit2024
Sam Arch 🇦🇺 (@samarchdb) 's Twitter Profile Photo

The UDF world tour continues. This week, I'm stopping at UW Madison, Microsoft's Gray Systems Lab, and The University of Washington. See you all there. 10/24 Wisconsin DB Group (@wiscdb.bsky.social) 10/25 Microsoft Gray Systems Lab 10/25 uwdb

PVLDB (@pvldb) 's Twitter Profile Photo

Vol:18 No:1 → The Key to Effective UDF Optimization: Before Inlining, First Perform Outlining vldb.org/pvldb/vol18/p1…

Vol:18 No:1 → The Key to Effective UDF Optimization: Before Inlining, First Perform Outlining vldb.org/pvldb/vol18/p1…
Andy Pavlo (@andypavlo.bsky.social) (@andy_pavlo) 's Twitter Profile Photo

The latest paper from the #1 CMU-DB PhD student Sam Arch 🇦🇺's is wild compilation DB magic! He automatically makes UDFs run 300x faster on Microsoft SQL Server and 1.3x faster on DuckDB. Code: github.com/SamArch27/PRISM Paper: vldb.org/pvldb/vol18/p1…