David Zhang (@davwzha) 's Twitter Profile
David Zhang

@davwzha

ML research at Qualcomm AI

ID: 1154181653184561153

linkhttp://davzha.netlify.app calendar_today25-07-2019 00:08:44

42 Tweet

286 Takipçi

319 Takip Edilen

David Zhang (@davwzha) 's Twitter Profile Photo

Join us at tomorrows #ICLR2024 oral session in the afternoon to learn about how we can turn neural networks into data! Afterwards Miltos Kofinas 🦋 @miltoskofinas.bsky.social Yan and I will be at the poster: #77 Graph Neural Networks for Learning Equivariant Representations of Neural Networks

Join us at tomorrows #ICLR2024 oral session in the afternoon to learn about how we can turn neural networks into data!

Afterwards <a href="/MiltosKofinas/">Miltos Kofinas 🦋 @miltoskofinas.bsky.social</a> <a href="/Cyanogenoid/">Yan</a> and I will be at the poster: #77 Graph Neural Networks for Learning Equivariant Representations of Neural Networks
Hazel Doughty (@doughty_hazel) 's Twitter Profile Photo

I'm hiring for a fully funded #PhD position in #ComputerVision on 'Detailed Video Understanding' at Leiden Computer Science Universiteit Leiden. Apply before 22nd June. More info👇 universiteitleiden.nl/en/vacancies/2…

Auke Wiggers (@aukejw) 's Twitter Profile Photo

ARC is a tough reasoning benchmark where modern LLMs far underperform humans still. Great to see that there's serious additional backing! Coincidentally, we just open-sourced CodeIt, our LLM-improvement approach for ARC: github.com/Qualcomm-AI-re…

Haggai Maron (@haggaimaron) 's Twitter Profile Photo

Thanks, Kostas Daniilidis, Congyue Deng, and the team for inviting me. It's an honor to speak alongside this esteemed group of researchers! My talk will focus on our recent work on *Equivariant Weight Space Learning*: designing neural networks that can process other neural networks.

Lewis Tunstall (@_lewtun) 's Twitter Profile Photo

Our solution write-up for the 1st AIMO Progress Prize is now out ✍️! huggingface.co/blog/winning-a… In it, we share technical details on: ♾️💻 The 2-stage MuMathCode recipe we used to train NuminaMath 7B TIR with iterative SFT ⚖️ Evals on MATH - 56.3% for Stage 1 & 68.2% for Stage

Our solution write-up for the 1st AIMO Progress Prize is now out ✍️!

huggingface.co/blog/winning-a…

In it, we share technical details on:

♾️💻 The 2-stage MuMathCode recipe we used to train NuminaMath 7B TIR with iterative SFT

⚖️ Evals on MATH - 56.3% for Stage 1 &amp; 68.2% for Stage
Damian Borth (@damianborth) 's Twitter Profile Photo

Pleasant surprise of ICML Conference: There is a growing Weight Space Learning Community out there - it was great to meet you all: Haggai Maron (Haggai Maron), Gal Chechik (Gal Chechik), Konstantin Schürholt (Konstantin Schürholt), Eliahu Horwitz (Eliahu Horwitz), Derek Lim (Derek Lim),

Pleasant surprise of <a href="/icmlconf/">ICML Conference</a>: There is a growing Weight Space Learning Community out there - it was great to meet you all:

Haggai Maron (<a href="/HaggaiMaron/">Haggai Maron</a>), Gal Chechik (<a href="/GalChechik/">Gal Chechik</a>), Konstantin Schürholt (<a href="/k_schuerholt/">Konstantin Schürholt</a>), Eliahu Horwitz (<a href="/EliahuHorwitz/">Eliahu Horwitz</a>), Derek Lim (<a href="/dereklim_lzh/">Derek Lim</a>),
Damian Borth (@damianborth) 's Twitter Profile Photo

🚀 Exciting News! 🚀 I am more than happy to share that I have officially begun my research sabbatical this week! Over the coming months, I am fortunate to be working as TU/e Artificial Intelligence Systems Institute Visiting Professor at @TUeindhoven collaborating with Joaquin Vanschoren on some exciting ideas

🚀 Exciting News! 🚀

I am more than happy to share that I have officially begun my research sabbatical this week!

Over the coming months, I am fortunate to be working as <a href="/TUeEAISI/">TU/e Artificial Intelligence Systems Institute</a> Visiting Professor at @TUeindhoven  collaborating with <a href="/joavanschoren/">Joaquin Vanschoren</a> on some exciting ideas
Markus Nagel (@mnagel87) 's Twitter Profile Photo

Are you pursuing a PhD and are you interested in working on efficiency of LLMs/LVMs? Then join our model efficiency team in #QualcommAIResearch for an internship! Apply below, we have openings for 2025 as well as autumn/winter 2024. careers.qualcomm.com/careers/job/44…

@levelsio (@levelsio) 's Twitter Profile Photo

🇪🇺 eu/acc A few weeks ago Mario Draghi asked my recommendations for his report that came out today about European competitiveness I had a call with him and summarized my problems with doing business in the EU I wrote this which is included in the report presented to the

🇪🇺 eu/acc

A few weeks ago Mario Draghi asked my recommendations for his report that came out today about European competitiveness

I had a call with him and summarized my problems with doing business in the EU

I wrote this which is included in the report presented to the
Boris Knyazev (@borisaknyazev) 's Twitter Profile Photo

Optimization can be sped up by 50% using our NiNo model! It takes a history of parameter values and predicts future parameters leveraging "neural graphs". Accelerating Training with Neuron Interaction and Nowcasting Networks: arxiv.org/abs/2409.04434 code: github.com/SamsungSAILMon…

Optimization can be sped up by 50% using our NiNo model! It takes a history of parameter values and predicts future parameters leveraging "neural graphs".
Accelerating Training with Neuron Interaction and Nowcasting Networks: arxiv.org/abs/2409.04434
code: github.com/SamsungSAILMon…
Taco Cohen (@tacocohen) 's Twitter Profile Photo

🚨 Attention aspiring PhD students: Meta / FAIR is looking for candidates for a joint academic/industry PhD! 🚨 Among others, the CodeGen team is looking for candidates to work on world models for code, discrete search & continuous optimization methods for long-term planning,

🚨 Attention aspiring PhD students: Meta / FAIR is looking for candidates for a joint academic/industry PhD! 🚨

Among others, the CodeGen team is looking for candidates to work on world models for code, discrete search &amp; continuous optimization methods for long-term planning,
Guan-Horng Liu (@guanhorng_liu) 's Twitter Profile Photo

📢Interested in #interning at #FAIR NY? Excited to share that I have one internship position available for #Summer2025 🙂! Looking for PhD interested in flow/diffusion models, optimal transport/control for structural problems. 🙌Send me your CV, website & GScholar by #Oct16th!

Gabriel Synnaeve (@syhw) 's Twitter Profile Photo

Want to do research in code generation with LLMs and wonky deep learning from the 90s? We're recruiting one Master student (M2) intern for 2025 at FAIR Paris in my team metacareers.com/jobs/106871446…

Eliahu Horwitz | @ ICLR2025 (@eliahuhorwitz) 's Twitter Profile Photo

📢Thrilled to announce our ICLR 2025 ICLR 2026 workshop on Weight Space Learning, exploring model weights as a new data modality! 📢 Stay tuned for submission instructions and deadlines. openreview.net/pdf?id=Bz6wEdo… #weightspace #weightspacelearning #ICLR2025 #iclr

Derek Lim (@dereklim_lzh) 's Twitter Profile Photo

Our new workshop at ICLR 2025: Weight Space Learning: weight-space-learning.github.io Weights are data. We can learn from weights. Learning can outperform human-designed methods for optimization, interpretability, model merging, and more.