Jacob Bamberger (@jacobbamberger) 's Twitter Profile
Jacob Bamberger

@jacobbamberger

Looking for topology where it shouldnโ€™t be. PhD student @CompSciOxford. Interested in Geometric Deep Learning and Applied Topology

ID: 1274046778665508865

calendar_today19-06-2020 18:30:11

52 Tweet

203 Followers

611 Following

Alvaro Arroyo (@arroyo_alvr) 's Twitter Profile Photo

๐Ÿšจ How do attention sinks relate to information flow in LLMs? We show how massive activations create attention sinks and compression valleys, revealing a three-stage theory of information flow in LLMs. ๐Ÿงต w/ Enrique* Federico Barbero xiaowen dong Michael Bronstein Yann LeCun Ravid Shwartz Ziv

๐Ÿšจ How do attention sinks relate to information flow in LLMs?

We show how massive activations create attention sinks and compression valleys, revealing a three-stage theory of information flow in LLMs. ๐Ÿงต

w/ Enrique* <a href="/fedzbar/">Federico Barbero</a> <a href="/epomqo/">xiaowen dong</a> <a href="/mmbronstein/">Michael Bronstein</a> <a href="/ylecun/">Yann LeCun</a> <a href="/ziv_ravid/">Ravid Shwartz Ziv</a>
Alex Tong (@alexandertong7) 's Twitter Profile Photo

#AITHYRA, Vienna's new Biomedical AI institute, is hiring Postdocs! Come work with us. Openings in: ๐Ÿ”น Generative AI ๐Ÿ”น Multimodal ML ๐Ÿ”น Virology ๐Ÿ”น Enzyme Function Apply by Nov 20: oeaw.ac.at/aithyra/postdoโ€ฆ #PostDoc #AI #ML #Vienna #ScienceJobs

#AITHYRA, Vienna's new Biomedical AI institute, is hiring Postdocs!

Come work with us. Openings in: ๐Ÿ”น Generative AI ๐Ÿ”น Multimodal ML ๐Ÿ”น Virology ๐Ÿ”น Enzyme Function

Apply by Nov 20: oeaw.ac.at/aithyra/postdoโ€ฆ #PostDoc #AI #ML #Vienna #ScienceJobs
Oscar Davis (@osclsd) 's Twitter Profile Photo

Introducing Generalised Flow Maps ๐ŸŽ‰ A stable, few-step generative model on Riemannian manifolds ๐Ÿชฉ ๐Ÿ“š Read it at: arxiv.org/abs/2510.21608 ๐Ÿ’พ Code: github.com/olsdavis/gfm Michael Albergo Nicholas Boffi Michael Bronstein Joey Bose

Olga Zaghen @ ICLR ๐Ÿ‡ธ๐Ÿ‡ฌ (@olgazaghen) 's Twitter Profile Photo

Cool news: our extended Riemannian Gaussian VFM paper is out! ๐Ÿ”ฎ We define and study a variational objective for probability flows ๐ŸŒ€ on manifolds with closed-form geodesics. Floor Eijkelboom Alison Cong Liu Max Welling Jan-Willem van de Meent Erik Bekkers ๐Ÿ”ฅ ๐Ÿ“œ arxiv.org/abs/2502.12981

Ben Murrell (@benjmurrell) 's Twitter Profile Photo

We figured out flow matching over states that change dimension. With "Branching Flows", the model decides how big things must be! This works wherever flow matching works, with discrete, continuous, and manifold states. We think this will unlock some genuinely new capabilities.

Jacob Bamberger (@jacobbamberger) 's Twitter Profile Photo

Flow Matching models often struggle to balance memorization and generalization. ๐Ÿ˜ฑ We set out to fix this โ€” by using the geometry of the data manifold. Introducing Carrรฉ du Champ Flow Matching (CDCFM)๐Ÿง‘โ€๐ŸŽจ๐Ÿฅ– โ€” improving generalization without sacrificing sample quality.

Flow Matching models often struggle to balance memorization and generalization. ๐Ÿ˜ฑ
We set out to fix this โ€” by using the geometry of the data manifold. 

Introducing Carrรฉ du Champ Flow Matching (CDCFM)๐Ÿง‘โ€๐ŸŽจ๐Ÿฅ– โ€” improving generalization without sacrificing sample quality.
Jacob Bamberger (@jacobbamberger) 's Twitter Profile Photo

Learning diffusion geometry from Iolo Jones was a highlight of this project for me. Highly recommend checking out his work: General diffusion geometry: arxiv.org/abs/2405.10858 Application to manifolds/differential geometry: arxiv.org/abs/2411.04100

Francesco Capuano (@_fracapuano) 's Twitter Profile Photo

TIL Flow matching faces a quality-generalisation trade-off that can be mitigated working with the (empirical) data manifold directly! ๐Ÿ‡ช๐Ÿ‡บ

Jacob Bamberger (@jacobbamberger) 's Twitter Profile Photo

We also found that flow matching (FM) can overfit unevenly across the data manifold. ๐Ÿง In this heterogeneous toy example (large + small circle), the large circle is memorized early while the other is learned much later. CDCFM mitigates this spatial imbalance. ๐Ÿ”ฅ