Yangsibo Huang (@yangsibohuang) Twitter Tweets • TwiCopy

Mengdi Wang

9 months ago

Princeton University #AI is recruiting Postdoc Fellows in AI for Accelerating Invention! Join us if you want to advance generative AI, RL and AI applications in engineering and science! Apply here today: puwebp.princeton.edu/AcadHire/apply… Ryan Adams Jennifer Rexford Princeton Engineering Princeton University

thumb_up_off_alt84

chat_bubble_outline0

repeat21

shareShare

Dinghuai Zhang 张鼎怀

@zdhnarsil

5 months ago

Consider submitting your work to our Safe Generative AI Workshop at NeurIPS 2024!

thumb_up_off_alt65

chat_bubble_outline0

repeat15

shareShare

Wenting Zhao

@wzhao_nlp

5 months ago

Can you really train a smart LLM without copyrighted material? There has been hope that small LM + retrieval might circumvent data requirements. We think this approach is a bit of a mirage, which only improves performance on simple tasks, but hurts the reasoning capabilities.

thumb_up_off_alt90

chat_bubble_outline6

repeat12

shareShare

Xindi Wu

@cindy_x_wu

4 months ago

How good is the compositional generation capability of current Text-to-Image models? arxiv.org/abs/2408.14339 Introducing ConceptMix, our new benchmark that evaluates how well models can generate images that accurately combine multiple visual concepts, pushing beyond simple,

thumb_up_off_alt178

chat_bubble_outline6

repeat40

shareShare

Ahmad Beirami

@abeirami

4 months ago

Excellent tips! 1. Always have a 1yr research vision/plan which'll guide what to work on. 2. Go after big/important problems rather than incremental research. 3. If a problem is such that you know someone else is going to crack it in the next 3mos, that's not worth your while.

thumb_up_off_alt40

chat_bubble_outline1

repeat4

shareShare

Leshem Choshen 🤖🤗

@lchoshen

4 months ago

Human feedback is critical for aligning LLMs, so why don’t we collect it in the open ecosystem?🧐 We (15 orgs) gathered the key issues and next steps. Envisioning a community-driven feedback platform, like Wikipedia alphaxiv.org/abs/2408.16961 🧵

thumb_up_off_alt153

chat_bubble_outline3

repeat43

shareShare

Yangsibo Huang

@yangsibohuang

4 months ago

Collecting, using, and sharing human feedback on models brings up new privacy and copyright concerns. We discuss these issues and key considerations in Sections 3.6 and 3.7.

thumb_up_off_alt65

chat_bubble_outline1

repeat1

shareShare

Yangsibo Huang

@yangsibohuang

4 months ago

I recently began exploring how memorization affects model capabilities. E.g., we found that image generation models struggle with prompts that combine more than 3 visual concepts (e.g., "red," "fluffy," "squared," "smartphone") & we attribute this to their training data.

thumb_up_off_alt123

chat_bubble_outline2

repeat8

shareShare

Dawn Song

@dawnsongtweets

4 months ago

Large Language Model Agents is the next frontier. Really excited to announce our Berkeley course on LLM Agents, also available for anyone to join as a MOOC, starting Sep 9 (Mon) 3pm PT! 📢 Sign up & join us: llmagents-learning.org

thumb_up_off_alt381

chat_bubble_outline10

repeat85

shareShare

Google DeepMind

@googledeepmind

4 months ago

We’re releasing DataGemma: open models that enhance LLM factuality by grounding them with real-world data from Google's Data Commons. 💡 It tackles hallucinations in AI models to generate more accurate and useful responses. Here’s how they work 🧵 dpmd.ai/47nWbvK

We’re releasing DataGemma: open models that enhance LLM factuality by grounding them with real-world data from <a href="/Google/">Google</a>'s Data Commons. 💡

It tackles hallucinations in AI models to generate more accurate and useful responses.

Here’s how they work 🧵 dpmd.ai/47nWbvK

thumb_up_off_alt291

chat_bubble_outline7

repeat53

shareShare

Xinyun Chen

@xinyun_chen_

4 months ago

Super glad to see a lot of excitement about our course! Again, huge thanks to Denny Zhou for coming to Berkeley and sharing insights on LLM reasoning!! Please join us on 2nd lecture, Shunyu Yao will give an overview of LLM agents and share his thoughts on important directions.

thumb_up_off_alt55

chat_bubble_outline1

repeat11

shareShare

Sadhika Malladi

@sadhikamalladi

4 months ago

Submit to the Math of Modern Machine Learning (M3L) workshop at NeurIPS 2024! Deadline is Sep 29. sites.google.com/view/m3l-2024/

thumb_up_off_alt16

chat_bubble_outline0

repeat4

shareShare

Bill Yuchen Lin 🤖

@billyuchenlin

4 months ago

Both🍓o1-mini and o1-preview by OpenAI are on our ZeroEval reasoning leaderboard Ai2 now! Note that there is a significant improvement on 🦓 ZebraLogic and MATH-L5! 🔗 Link on Hugging Face: hf.co/spaces/allenai…

Both🍓o1-mini and o1-preview by <a href="/OpenAI/">OpenAI</a> are on our ZeroEval reasoning leaderboard <a href="/allen_ai/">Ai2</a> now! Note that there is a significant improvement on 🦓 ZebraLogic and MATH-L5!

🔗 Link on <a href="/huggingface/">Hugging Face</a>: hf.co/spaces/allenai…

thumb_up_off_alt159

chat_bubble_outline7

repeat27

shareShare

A. Feder Cooper

@afedercooper

4 months ago

Exciting announcement! The submission portal for ACM CS+Law '25 is now open! Please send your papers in to this amazing venue at the intersection of computer science and law. CFP: computersciencelaw.org/2025 Submit: cslaw25.hotcrp.com cc The GenLaw Center

thumb_up_off_alt53

chat_bubble_outline1

repeat13

shareShare

FAR.AI

@farairesearch

4 months ago

"Please learn from our mistakes. Don't do exactly the same things that we did, or you'll end up in ten years with having nothing to show for it." — Nicholas Carlini urging AI researchers to avoid the pitfalls of past adversarial ML research at the Vienna Alignment Workshop 2024.

thumb_up_off_alt821

chat_bubble_outline13

repeat112

shareShare

Armand Joulin

@armandjoulin

4 months ago

A 9B open model that surpasses some of the best open models! Would have love to be the one claiming this win, this is massive! Congrats Yu Meng Mengzhou Xia and Danqi Chen !

thumb_up_off_alt110

chat_bubble_outline2

repeat34

shareShare

Yuntian Deng

@yuntiandeng

4 months ago

Is OpenAI's o1 a good calculator? We tested it on up to 20x20 multiplication—o1 solves up to 9x9 multiplication with decent accuracy, while gpt-4o struggles beyond 4x4. For context, this task is solvable by a small LM using implicit CoT with stepwise internalization. 1/4

thumb_up_off_alt2,2K

chat_bubble_outline82

repeat267

shareShare

Robin Jia

@robinomial

4 months ago

Really excited about this new workshop we’re proposing for *CL! Memorization of training data is both fascinating to analyze and has a wide range of legal/privacy/benchmarking/social implications. Please vote if you’re interested!

thumb_up_off_alt39

chat_bubble_outline0

repeat8

shareShare

Yangsibo Huang

@yangsibohuang

4 months ago

I'm super excited about the *CL workshop we're planning to organize on LLM memorization, & its implications for compliance (privacy/copyright) and capabilities (evaluation/generalization). Plz help by RT and voting in the thread!

thumb_up_off_alt78

chat_bubble_outline0

repeat13

shareShare