Travis Downs (@trav_downs) 's Twitter Profile
Travis Downs

@trav_downs

Making stuff fast at @redpandadata.

ID: 1072520634259910660

linkhttps://travisdowns.github.io/ calendar_today11-12-2018 15:57:00

12,12K Tweet

4,4K Followers

175 Following

Pete Cawley (@corsix) 's Twitter Profile Photo

Given: 1. crc32 has throughput 1 on port 1 2. pclmulqdq has throughput 1 on port 5 3. pclmulqdq+pxor can emulate crc32 It seems that fastest crc32 code should divide input in half and issue a crc32 _and_ a pclmulqdq every cycle. Code and numbers at corsix.org/content/fast-c…

Alexander Yee (@mysticial) 's Twitter Profile Photo

y-cruncher v0.7.10 coming out soon with Zen4 AVX512 optimizations. This was fun since there are no optimization resources yet for this chip. Had to do all the RE myself.😅

y-cruncher v0.7.10 coming out soon with Zen4 AVX512 optimizations.

This was fun since there are no optimization resources yet for this chip. Had to do all the RE myself.😅
Travis Downs (@trav_downs) 's Twitter Profile Photo

Hey AMD release the AMD Ryzen Zen 4 optimization guide already. Keeping this under wraps is just a dead loss isn't it? A weird "we hate people optimizing for our chips too early" thing.

Alexander Yee (@mysticial) 's Twitter Profile Photo

While everyone is enjoying all the #AMD #Zen4 performance reviews, here is my teardown of Zen4's #AVX512 implementation and architecture. tl;dr - Intel has some serious competition now. mersenneforum.org/showthread.php…

Dougall (@dougallj) 's Twitter Profile Photo

New ARM instructions were just released! (With SVE 2.1, among many other things) (I always use the exploration tools, but you can view them online too: developer.arm.com/documentation/…) developer.arm.com/downloads/-/ex…

🕺💃🤟 Alexander Gallego (@emaxerrno) 's Twitter Profile Photo

Touisteur EmporteUneVache 10-20GB/s yes. We don't have anyone at 40GB/s yet. 400GbE not yet, but DM me if you want to test. This is the reason we have a dedicated Performance Engineering team x.com/trav_downs/sta…

🕺💃🤟 Alexander Gallego (@emaxerrno) 's Twitter Profile Photo

some pretty cool tests we are doing on the #redpanda cloud of aggregate throughputs of 40GB/s. nbd .... more deets to come on Nov 15th. hopin.com/events/redpand… thanks to Travis Downs for the goodie you see below.... maybe we should do a 200GB/s test next? 🤣 #kafka

some pretty cool tests we are doing on the #redpanda cloud of aggregate throughputs of 40GB/s. nbd .... more deets to come on Nov 15th. hopin.com/events/redpand…

thanks to <a href="/trav_downs/">Travis Downs</a> for the goodie you see below.... maybe we should do a 200GB/s test next? 🤣 

#kafka
P99CONF (@p99conf) 's Twitter Profile Photo

We're excited to hear Redpanda Data's Travis Downs describe the practical experience of building high performance systems with C++20 in an asynchronous runtime. He'll also discuss tradeoffs in adopting a thread-per-core architecture. bit.ly/43RHzlV #P99CONF #ScyllaDB

We're excited to hear <a href="/redpandadata/">Redpanda Data</a>'s Travis Downs describe the practical experience of building high performance systems with C++20 in an asynchronous runtime. He'll also discuss tradeoffs in adopting a thread-per-core architecture. bit.ly/43RHzlV

#P99CONF #ScyllaDB
Travis Downs (@trav_downs) 's Twitter Profile Photo

Despite the expansive title this is mostly a quick look at coroutine performance for those who are interested in that kind of thing.

Travis Downs (@trav_downs) 's Twitter Profile Photo

So we have LLMs that border on indsiguishable from real people (well, this depends in part on the company you keep) but autocorrect on my phone is still awful, making mistakes a toddler could correct. Is it a latency problem?

Daniel Lemire (@lemire) 's Twitter Profile Photo

The latest release of the simdutf C++ library (6.0.0) brings in more convenient for C++20 users. While you used to have to provide both a pointer and a size parameter... often you can now just pass your container... std::vector<char> data{1, 2, 3, 4, 5}; // C++11 API auto cpp11