Lars Quaedvlieg (@lars_quaedvlieg) 's Twitter Profile
Lars Quaedvlieg

@lars_quaedvlieg

Research Assistant @ CLAIRE | MSc Data Science @ EPFL πŸ‡¨πŸ‡­ | Interested in reasoning with foundation models and sequential decision-making 🧠

ID: 1694031255262617600

linkhttps://lars-quaedvlieg.github.io/ calendar_today22-08-2023 16:58:25

19 Tweet

43 Followers

163 Following

Grigoris Chrysos (@grigoris_c) 's Twitter Profile Photo

πŸ—’οΈ#NeurIPS paper on tackling NP-hard problem(s) is out! How can self-training meet dynamic programming? Paper: arxiv.org/abs/2310.18672 1/2

Skander Moalla (@skandermoalla) 's Twitter Profile Photo

Have you ever been left puzzled by your PPO agent collapsing out of nowhere? πŸ“ˆπŸ€―πŸ“‰ We’ve all been there... We can help you with a hint: monitor your representations!πŸ’‘ πŸš€ We show that PPO suffers from degrading representations and that this breaks its trust region πŸ’”

Have you ever been left puzzled by your PPO agent collapsing out of nowhere? πŸ“ˆπŸ€―πŸ“‰
We’ve all been there... We can help you with a hint: monitor your representations!πŸ’‘
πŸš€ We show that PPO suffers from degrading representations and that this breaks its trust region πŸ’”
Caglar Gulcehre (@caglarml) 's Twitter Profile Photo

Submissions to our workshop at ICML on the next "Next Generation of Sequence Modelling" are open now πŸŽ‰πŸŽ‡. If you have interesting ideas on improving research related to sequence models, consider submitting them to our workshop: sites.google.com/view/ngsmworks…

Justin Deschenaux (@jdeschena) 's Twitter Profile Photo

How is Stable Diffusion or Dall.E able to synthesize novel images? Does this arise from the architecture? Is it the training data? πŸ€” Our new paper in Β #ICML2024Β πŸš€ ICML Conference focuses on the interpolation aspect of generalization🧡 1/8

How is Stable Diffusion or Dall.E able to synthesize novel images? Does this arise from the architecture? Is it the training data? πŸ€” Our new paper in Β #ICML2024Β πŸš€ <a href="/icmlconf/">ICML Conference</a> focuses on the interpolation aspect of generalization🧡 1/8
Justin Deschenaux (@jdeschena) 's Twitter Profile Photo

🌟 Excited to share our latest work on making diffusion language models (DLMs) faster than autoregressive (AR) models! ⚑ It’s been great to work on this with Caglar Gulcehre 😎 Lately, DLMs are gaining traction as a promising alternative to autoregressive sequence modeling πŸ‘€ 1/14 🧡

Xiuying Wei@Neurips (Wed11am East #2010) (@xiuyingwei966) 's Twitter Profile Photo

Want to make the big FFN more efficient? Check out our work tomorrow! πŸ“… Wednesday, 11 am – 2 pm PST πŸ“ East Exhibit Hall A–C #2010 Stop by and explore how structured matrices can make LLM training more efficient. πŸš€ Caglar Gulcehre Skander Moalla

Want to make the big FFN more efficient?
Check out our work tomorrow!
πŸ“… Wednesday, 11 am – 2 pm PST
πŸ“ East Exhibit Hall A–C #2010
Stop by and explore how structured matrices can make LLM training more efficient. πŸš€
<a href="/caglarml/">Caglar Gulcehre</a> <a href="/SkanderMoalla/">Skander Moalla</a>
Lars Quaedvlieg (@lars_quaedvlieg) 's Twitter Profile Photo

Proud to have contributed to EvoTune β€” a new method that combines evolutionary search and RL fine-tuning to accelerate algorithm discovery with LLMs! πŸš€ Check out claire-labo.github.io/EvoTune for more info and cool visualizations :)