
Kevin Swersky
@kswersk
Research Scientist at Deepmind.
ID: 3344958413
25-06-2015 02:14:46
390 Tweet
8,8K Followers
523 Following

Have you ever wondered what are the challenges and opportunities of using neural networks for data prefetching? Title: A Hierarchical Neural Model of Data Prefetching Abstract: asplos-conference.org/abstracts/aspl… @ZhanShi44105240 Akanksha Jain Milad Hashemi Kevin Swersky #ASPLOS21 #NeuralNetwork



Leveraging #MachineLearning for accelerator design enables faster exploration of the architecture search space leading to more efficient hardware across a range of applications. Collaboration w/: Christof Angermüller, Berkin Akin, Yanqi Zhou, Milad Hashemi, Kevin Swersky.




This was a very fun project: an elegant algorithm that works well on the difficult task of sampling from discrete EBMs. Congratulations will grathwohl and team!

Big shout-out to will grathwohl Jackson (Kuan-Chieh) Wang @cvpr jörn jacobsen David Duvenaud Kevin Swersky Mohammad Norouzi for their amazing paper "Your classifier is secretly an energy-based model and you should treat it like one"!


The overall recipe is general, and the same method could be applied to many other design problems in principle! More in the paper: arxiv.org/abs/2110.11346 Awesome collaboration led by Aviral Kumar & Amir Yazdan! w/ Milad Hashemi & Kevin Swersky


📢Introducing Pix2Seq-D, a generalist framework casting panoptic segmentation as a discrete data generation task conditioned on pixels. Works for both images and videos, with minimal task engineering. arxiv.org/abs/2210.06366 work w/ Lala Li, Saurabh Saxena Geoffrey Hinton David Fleet


This is a really natural framework to improve Bayesian optimization when you have access to related optimization tasks arxiv.org/abs/2109.08215 Joint work with Zi Wang, Ph.D., George E. Dahl, Chansoo Lee, Zachary Nado, Justin Gilmer, Jasper, Zoubin Ghahramani


Check out Kevin Clark’s and my paper on fine-tuning diffusion models on differentiable rewards! We present DRaFT, which computes gradients through diffusion sampling. DRaFT is efficient & works across many reward functions. With Kevin Swersky, David Fleet arXiv: arxiv.org/abs/2309.17400

We have a student researcher opportunity in our team Google DeepMind in Toronto 🍁 If you’re excited about research on diffusion models, and generative video models, please fill the form : forms.gle/auNq61N35AvTZS… and apply here: deepmind.google/about/careers/…


🆕🔥We show that LLMs *can* plan if instructed well! 🔥Instructing the model using ICL leads to a significant boost in planning performance, + can be further improved by using long context. arxiv.org/abs/2406.13094 w/ Azade Nova Bernd Bohnet A.Parisi Kati Goshvadi Kevin Swersky Hanjun Dai +


