Marcin Kuthan
@marcinkuthan
#Mastodon: fosstodon.org/@mkuthan, feel free to follow me there for updates and discussion
ID: 4859763490
http://mkuthan.github.io/ 29-01-2016 09:37:46
493 Tweet
286 Followers
224 Following
Blogged about the power of Unified Batch and Stream Processing! Seamlessly combine real-time for insights while managing historical data efficiently mkuthan.github.io/blog/2023/09/2… Don't miss my code samples: github.com/mkuthan/stream… #DataProcessing #Streaming #BigData #ApacheBeam #Scala
🙏 The first edition of Checkpoint Chronicle includes links out to content from folk across the data and streaming space, including Responsive' Rohan Desai, @cloudflare's Matt Boyle, Andrea Medda, Robert Metzger, Yaroslav Tkachenko, Tabular (now part of Databricks)'s Ryan Blue, Vinoth Chandar, Ali Ghodsi,
🗣️ "We present the first detailed, empirical evaluation of three popular and increasingly-adopted formats and evaluate their suitability to be used as a native format in a DBMS" ApacheArrow, Apache Parquet, Apache ORC compared, by Chunwei Liu et al. vldb.org/pvldb/vol16/p3…
No work items left unturned: How Dataflow mitigates stragglers cloud.google.com/blog/products/…, Google Cloud why do you publish articles with links to the internal resources not available for public audience?
My new book, Building Resilient Distributed Systems, is now available in early access form O'Reilly Media. A few chapters are available at this stage, ahead of the planned publication in August next year. You can find out more about the book here: samnewman.io/books/building…
The new native Apache Kafka container image is dope for unit testing. Compiled into a native binary via #GraalVM, a single-node broker is starting in ~150 ms on my laptop; an entire test via Testcontainers completes in less than two seconds 🥳. 👉 cwiki.apache.org/confluence/dis…