Cambridge MLG (@cambridgemlg) 's Twitter Profile
Cambridge MLG

@cambridgemlg

Machine Learning Group @Cambridge_Uni

ID: 255587198

linkhttp://mlg.eng.cam.ac.uk calendar_today21-02-2011 17:30:59

546 Tweet

5,5K Followers

136 Following

Dima Krasheninnikov (@dmkrash) 's Twitter Profile Photo

1/ Excited to finally tweet about our paper “Implicit meta-learning may lead LLMs to trust more reliable sources”, to appear at ICML 2024. Our results suggest that during training, LLMs better internalize text that appears useful for predicting other text (e.g. seems reliable).

1/ Excited to finally tweet about our paper “Implicit meta-learning may lead LLMs to trust more reliable sources”, to appear at ICML 2024. Our results suggest that during training, LLMs better internalize text that appears useful for predicting other text (e.g. seems reliable).
Katie Collins (@katie_m_collins) 's Twitter Profile Photo

It’s game night! Playing games, thinking about games, becoming an expert, and inventing entirely new games are key elements of our shared human experience. Yet how do we actually play new games in the first place? And determine what games we even want to play? 1/

It’s game night! Playing games, thinking about games, becoming an expert, and inventing entirely new games are key elements of our shared human experience. Yet how do we actually play new games in the first place? And determine what games we even want to play? 1/
Cambridge MLG (@cambridgemlg) 's Twitter Profile Photo

It's been great presenting at the #ICML2024 workshops. Thank you to everyone who came up to chat with us! Here are the works that we presented:

It's been great presenting at the #ICML2024 workshops. Thank you to everyone who came up to chat with us!

Here are the works that we presented:
Shoaib Ahmed Siddiqui (@shoaibasiddiqui) 's Twitter Profile Photo

Are all blocks in a pretrained LLM equally important? In our new preprint, “A deeper look at depth pruning of LLMs” (arxiv.org/abs/2407.16286), we attempt to better understand the impact of depth pruning, which is a specific case of structured pruning that directly translates to

Are all blocks in a pretrained LLM equally important? In our new preprint, “A deeper look at depth pruning of LLMs” (arxiv.org/abs/2407.16286), we attempt to better understand the impact of depth pruning, which is a specific case of structured pruning that directly translates to
Bruno Mlodozeniec (@kayembruno) 's Twitter Profile Photo

The biggest take-away for me from this work was that training neural nets with SGD does something akin to implicit model selection. The interesting thing is that, for this to work, the order of the data _really_ matters, unlike in exact Bayesian inference.

Alexander Terenin (@avt_im) 's Twitter Profile Photo

We’re extremely excited to announce the NeurIPS Workshop on Bayesian Decision-making and Uncertainty: from probabilistic and spatiotemporal modeling to sequential experiment design! This will take place at NeurIPS 2024, in Vancouver, BC, Canada, either on December 14th or 15th.

We’re extremely excited to announce the NeurIPS Workshop on Bayesian Decision-making and Uncertainty: from probabilistic and spatiotemporal modeling to sequential experiment design!

This will take place at NeurIPS 2024, in Vancouver, BC, Canada, either on December 14th or 15th.
Katie Collins (@katie_m_collins) 's Twitter Profile Photo

[New preprint!] What does it take to build machines that **meet our expectations** and **compliment our limitations**? In this Perspective, we chart out a vision, which engages deeply with computational cognitive science, to design truly human-centric AI “thought partners” 1/

[New preprint!] What does it take to build machines that **meet our expectations** and **compliment our limitations**? In this Perspective, we chart out a vision, which engages deeply with computational cognitive science, to design truly human-centric AI “thought partners” 1/
Usman Anwar (@usmananwar391) 's Twitter Profile Photo

Our agenda paper on alignment and safety of LLMs just got published at TMLR: openreview.net/forum?id=oVTkO… 🥳 The revised version is also now on arxiv arxiv.org/abs/2404.09932.

Ferenc Huszár (@fhuszar) 's Twitter Profile Photo

I made my first ever investment in a startup, Ångström AI They build on ML surrogate models to enable cost-effective simulation of molecular properties (e.g. solubility or binding) in pharmacologically relevant settings. A very hard technical problem that this team might crack.