Minjoon Seo (@seo_minjoon) Twitter Tweets • TwiCopy

Hyeonbin Hwang

2 years ago

🚨 New LLM Reasoning Paper 🚨 Q. How can LLMs self-improve their reasoning ability? ⇒ Introducing Self-Explore⛰️🧭, a training method specifically designed to help LLMs avoid reasoning pits by learning from their own outputs! [1/N]

thumb_up_off_alt290

chat_bubble_outline8

repeat55

shareShare

Ai2

@allen_ai

2 years ago

Announcing our latest addition to the OLMo family, OLMo 1.7!🎉Our team's efforts to improve data quality, training procedures and model architecture have led to a leap in performance. See how OLMo 1.7 stacks up against its peers and peek into the technical details on the blog:

thumb_up_off_alt168

chat_bubble_outline13

repeat47

shareShare

TwelveLabs (twelvelabs.io)

@twelve_labs

2 years ago

🚀 We're excited to share the technical report of Pegasus-1, our 17B-parameter VLM, setting new benchmarks in video understanding. It surpasses larger models like Gemini Pro and Ultra in video conversation, QA, summarization, and temporal understanding. bit.ly/pegasus-1-tech…

thumb_up_off_alt196

chat_bubble_outline4

repeat49

shareShare

Seungone Kim @ NAACL2025

@seungonekim

2 years ago

#NLProc Introducing 🔥Prometheus 2, an open-source LM specialized on evaluating other language models. ✅Supports both direct assessment & pairwise ranking. ✅ Improved evaluation capabilities compared to its predecessor. ✅Can assess based on user-defined evaluation criteria.

thumb_up_off_alt170

chat_bubble_outline3

repeat42

shareShare

Seongyun Lee

@sylee_ai

a year ago

🚨 New LLM personalization/alignment paper 🚨 🤔 How can we obtain personalizable LLMs without explicitly re-training reward models/LLMs for each user? ✔ We introduce a new zero-shot alignment method to control LLM responses via the system message 🚀

thumb_up_off_alt214

chat_bubble_outline4

repeat54

shareShare

Seungone Kim @ NAACL2025

@seungonekim

a year ago

🤔How can we systematically assess an LM's proficiency in a specific capability without using summary measures like helpfulness or simple proxy tasks like multiple-choice QA? Introducing the ✨BiGGen Bench, a benchmark that directly evaluates nine core capabilities of LMs.

thumb_up_off_alt194

chat_bubble_outline8

repeat57

shareShare

Hoyeon Chang

@hoyeon_chang

a year ago

🚨 New paper 🚨 How Large Language Models Acquire Factual Knowledge During Pretraining? I’m thrilled to announce the release of my new paper! 🎉 This research explores how LLMs acquire and retain factual knowledge during pretraining. Here are some key insights:

thumb_up_off_alt523

chat_bubble_outline12

repeat119

shareShare

Doyoung Kim

@doyoungkim_ml

a year ago

🤔 Humans excel at generalizing planning into extrapolated data or rapidly adapting with limited train data. How is it possible for language models? Introducing 🧠Cognitive Map for Language Models, a framework achieving Optimal Planning via Verbally Representing the World Model🌍

thumb_up_off_alt36

chat_bubble_outline1

repeat13

shareShare

MiyoungKo

@miyoung_ko

a year ago

📢 Excited to share our latest paper on the reasoning capabilities of LLMs! Our research dives into how these models recall and utilize factual knowledge during solving complex questions. [🧵1 / 10] arxiv.org/abs/2406.19502

thumb_up_off_alt306

chat_bubble_outline4

repeat73

shareShare

Alice Oh

@aliceoh

a year ago

We are hosting wonderful NLP colleagues at KAIST on their way to ACL Bangkok! 🤩 On-site registration is closed, but the talks will be broadcast on Zoom. Please join us! Date/Time: Aug 10, 2024, 10:05-12:30 KST (UCT+9) Parallel Session 1: Advanced Language Models and AI

thumb_up_off_alt118

chat_bubble_outline2

repeat40

shareShare

Hanna Hajishirzi

@hannahajishirzi

a year ago

Molmo, our first open multimodal language model, is here; we've equipped our OLMo with eyes! 👀 ✨ Molmo: Raising the bar and outperforming the latest Llama 3.2 models. 🚀 Molmo-72B: Competes head-to-head with leading proprietary models. 🔥 MolmoE-1B: Ultra-efficient,

thumb_up_off_alt79

chat_bubble_outline0

repeat7

shareShare

jiyeon kim

@jiyeonkimd

a year ago

❓Do LLMs maintain the capability of knowledge acquisition throughout pretraining? If not, what is driving force behind it? ❗Our findings reveal that decreasing knowledge entropy hinders knowledge acquisition and retention as pretraining progresses. 📄arxiv.org/abs/2410.01380

thumb_up_off_alt153

chat_bubble_outline4

repeat32

shareShare

Seonghyeon Ye

@seonghyeonye

a year ago

🚀 First step to unlocking Generalist Robots! Introducing 🤖LAPA🤖, a new SOTA open-sourced 7B VLA pretrained without using action labels. 💪SOTA VLA trained with Open X (outperforming OpenVLA on cross and multi embodiment) 😯LAPA enables learning from human videos, unlocking

thumb_up_off_alt216

chat_bubble_outline3

repeat58

shareShare

Joel Jang

@jang_yoel

a year ago

Excited to introduce 𝐋𝐀𝐏𝐀: the first unsupervised pretraining method for Vision-Language-Action models. Outperforms SOTA models trained with ground-truth actions 30x more efficient than conventional VLA pretraining 📝: arxiv.org/abs/2410.11758 🧵 1/9

thumb_up_off_alt260

chat_bubble_outline8

repeat64

shareShare

Haebin Shin @ NAACL2025

@haebinshin_

a year ago

🚨 New paper alert! 🚨 Isn’t it wasteful to repeat lengthy & complex agent prompts every time? Introducing "Generative Context Distillation"—a new lightweight method to internalize prompt. 🦾 Powerful performance 💵 Efficient inference "without the need for a prompt📜" [1/7]

thumb_up_off_alt48

chat_bubble_outline1

repeat20

shareShare

Paul Vicol

@paulvicol

a year ago

🎉Thanks for attending the #NeurIPS2024 Workshop on #AdaptiveFoundationModels! 🚀Speakers: Russ Salakhutdinov Sander Dieleman Kate Saenko Matthias Bethge/Vishaal Udandarao Minjoon Seo Bing Liu Tianqi Chen 🔥Organizers: Paul Vicol Mengye Ren Renjie Liao Naila Murray Wei-Chiu Ma Beidi Chen 🧵Recap!

🎉Thanks for attending the #NeurIPS2024 Workshop on #AdaptiveFoundationModels!

🚀Speakers: <a href="/rsalakhu/">Russ Salakhutdinov</a> <a href="/sedielem/">Sander Dieleman</a> <a href="/kate_saenko_/">Kate Saenko</a> <a href="/MatthiasBethge/">Matthias Bethge</a>/<a href="/vishaal_urao/">Vishaal Udandarao</a> <a href="/seo_minjoon/">Minjoon Seo</a> Bing Liu <a href="/tqchenml/">Tianqi Chen</a>

🔥Organizers: <a href="/PaulVicol/">Paul Vicol</a> <a href="/mengyer/">Mengye Ren</a> <a href="/lrjconan/">Renjie Liao</a> <a href="/NailaMurray/">Naila Murray</a> <a href="/weichiuma/">Wei-Chiu Ma</a> <a href="/BeidiChen/">Beidi Chen</a>

🧵Recap!

thumb_up_off_alt47

chat_bubble_outline4

repeat8

shareShare

jiyeon kim

@jiyeonkimd

7 months ago

Presenting ✨Knowledg Entropy✨ at #ICLR2025 today in Oral 5C(Garnet 216-218) at 10:30AM and in Poster 6(#251) from 3:00PM We investigated how changes in a model's tendency to integrate its parametric knowledge during pretraining affect knowledge acquisition and forgetting

thumb_up_off_alt46

chat_bubble_outline1

repeat9

shareShare

Dongkeun Yoon

@dongkeun_yoon

6 months ago

🙁 LLMs are overconfident even when they are dead wrong. 🧐 What about reasoning models? Can they actually tell us “My answer is only 60% likely to be correct”? ❗Our paper suggests that they can! Through extensive analysis, we investigate what enables this emergent ability.

thumb_up_off_alt298

chat_bubble_outline9

repeat50

shareShare

Yunjae Won

@yunjae_won_

6 months ago

[1/6] Ever wondered why Direct Preference Optimization is so effective for aligning LLMs? 🤔 Our new paper dives deep into the theory behind DPO's success, through the lens of information gain. Paper: "Differential Information: An Information-Theoretic Perspective on Preference

thumb_up_off_alt64

chat_bubble_outline4

repeat22

shareShare

Sahara AI

@saharalabsai

5 months ago

🚨 Episode 3 of The AI Agent Takeover is happening June 5 at 1PM KST! We’re sitting down with our very own Sean Ren | Sahara AI 🔆 and Minjoon Seo, CEO & Co-Founder of Config Intelligence to dive into the next wave of physical AI and how robots can learn from human demonstrations.

🚨 Episode 3 of The AI Agent Takeover is happening June 5 at 1PM KST!

We’re sitting down with our very own <a href="/xiangrenNLP/">Sean Ren | Sahara AI 🔆</a> and <a href="/seo_minjoon/">Minjoon Seo</a>, CEO & Co-Founder of Config Intelligence to dive into the next wave of physical AI and how robots can learn from human demonstrations.

thumb_up_off_alt3,3K

chat_bubble_outline1,1K

repeat1,1K

shareShare