Daniele De Sensi (@daniele_desensi) 's Twitter Profile
Daniele De Sensi

@daniele_desensi

Tenure-track Assistant Professor at Sapienza University of Rome

ID: 398887098

linkhttps://danieledesensi.github.io/ calendar_today26-10-2011 17:51:14

80 Tweet

52 Followers

111 Following

Underfox (@underfox3) 's Twitter Profile Photo

In this paper is presented Swing, a new algorithm to improve allreduce performance on torus networks which performs a logarithmic number of steps and transmits the minimal number of bytes while reducing the distance between communicating nodes. #HPC arxiv.org/pdf/2401.09356…

In this paper is presented Swing, a new algorithm to improve allreduce performance on torus networks which performs a logarithmic number of steps and transmits the minimal number of bytes while reducing the distance between communicating nodes. #HPC

arxiv.org/pdf/2401.09356…
SPCL@ETH (@spcl_eth) 's Twitter Profile Photo

Check out how to accelerate allreduce and other collectives on multidimensional torus networks and HammingMesh using Swing. Up to 3x performance improvement on different topologies! Daniele De Sensi will present it at USENIX Association #NSDI24. arxiv.org/abs/2401.09356

Check out how to accelerate allreduce and other collectives on multidimensional torus networks and HammingMesh using Swing. 
Up to 3x performance improvement on different topologies! <a href="/daniele_desensi/">Daniele De Sensi</a> will present it at <a href="/usenix/">USENIX Association</a> #NSDI24.
arxiv.org/abs/2401.09356
Tiziano De Matteis (@tizianodem) 's Twitter Profile Photo

Are you passionate about #HPC, #GPU programming, and #numerical algorithms? Do you want tohelp us develop a new generation of computational tools to democratize access to massively parallel computing systems? A #PhD #position is now open at VU Amsterdam! workingat.vu.nl/vacancies/phd-…

Daniele De Sensi (@daniele_desensi) 's Twitter Profile Photo

We have open PhD/PostDoc positions👩‍🎓on workload acceleration using SmartNICs. You can find more details at hipeac.net/jobs/14559/phd… Submission deadline on March 16th, 2024. Feel free to drop me a message✉️if you need more information.

UniTrento_DISI (@unitrento_disi) 's Twitter Profile Photo

Passionate about #HPC? 📍The 2nd CINI Summer School on High Performance Computing and Emerging Technologies #HPC2024 will be held on 17-21 June 2024 Università di Trento, Italy ℹ hpc-summer-school-24.disi.unitn.it/home @hicrestlab Consorzio CINI Cineca SuperComputing ICSC E4 Computer Engineering

SPCL@ETH (@spcl_eth) 's Twitter Profile Photo

Happening right now at #NSDI👉 Nils Blach presenting "A High-Performance Design, Implementation, Deployment, and Evaluation of The Slim Fly Network" and Daniele De Sensi presenting his paper "Swing: Short-cutting Rings for Higher Bandwidth Allreduce" Torsten Hoefler 🇨🇭 ETH CS Department #HPC

Happening right now at #NSDI👉 <a href="/nilsblach/">Nils Blach</a>  presenting "A High-Performance Design, Implementation, Deployment, and Evaluation of The Slim Fly Network" and Daniele De Sensi presenting his paper "Swing: Short-cutting Rings for Higher Bandwidth Allreduce"
<a href="/thoefler/">Torsten Hoefler 🇨🇭</a> <a href="/CSatETH/">ETH CS Department</a> #HPC
SPCL@ETH (@spcl_eth) 's Twitter Profile Photo

How to model performance and accelerate reduction collectives for the Cerebras Wafer-Scale Engine? Learn more from Piotr Luczynski's talk at HPDC 2025! Friday 7th June, at 2 PM. Paper: arxiv.org/abs/2404.15888

How to model performance and accelerate reduction collectives for the Cerebras Wafer-Scale Engine? Learn more from Piotr Luczynski's talk at <a href="/ACM_HPDC/">HPDC 2025</a>! Friday 7th June, at 2 PM.

Paper: arxiv.org/abs/2404.15888
Torsten Hoefler 🇨🇭 (@thoefler) 's Twitter Profile Photo

Yutong Lu gave a great talk about #HPC interconnect optimization in Cetraro. She shows optimized allreduce for Dragonfly, now onto Slim Fly 😉. Also consider topology-aware collectives like Daniele's cool Swing algorithm 😎. Closing with extended notifications for #RMA.

Yutong Lu gave a great talk about #HPC interconnect optimization in Cetraro. 

She shows optimized allreduce for Dragonfly, now onto Slim Fly 😉. Also consider topology-aware collectives like Daniele's cool Swing algorithm 😎. 

Closing with extended notifications for #RMA.
Torsten Hoefler 🇨🇭 (@thoefler) 's Twitter Profile Photo

HammingMesh - our optimized failure-tolerant network topology for #DeepLearning enables embedding multi-dimensional rings now Communications of the ACM research highlight! cacm.acm.org/research-highl… Low cost, high scheduling flexibility, and remarkable similarity with the later TPUv4 network. #HPC

HammingMesh - our optimized failure-tolerant network topology for #DeepLearning enables embedding multi-dimensional rings now <a href="/CACMmag/">Communications of the ACM</a> research highlight!

cacm.acm.org/research-highl…

Low cost, high scheduling flexibility, and remarkable similarity with the later TPUv4 network. #HPC