Dan Roth (@danrothnlp) Twitter Tweets • TwiCopy

NAACL HLT 2025

4 years ago

📢 The #NAACL2022 Call For Papers is out! 2022.naacl.org/calls/papers/ New this year: - Reviewing for main conference submissions will be handled by ACLRollingReview, except for Special Theme submissions. - Optional reproducibility badges!

thumb_up_off_alt52

chat_bubble_outline2

repeat11

shareShare

NAACL HLT 2025

@naaclmeeting

4 years ago

ACLRollingReview The theme of NAACL 2022 is “Human-Centered NLP”. We invite submissions that address research questions that meaningfully incorporate stakeholders in the design, development, and evaluation of NLP resources, models and systems. More details: 2022.naacl.org/blog/special-t…

thumb_up_off_alt55

chat_bubble_outline1

repeat28

shareShare

CoNLL 2025

@conll_conf

4 years ago

BabyBERTa: Learning More Grammar With Small-Scale Child-Directed Language By Philip A. Huebner, Elior Sulem, Cynthia Fisher and Dan Roth

thumb_up_off_alt18

chat_bubble_outline1

repeat4

shareShare

Dan Roth

@danrothnlp

3 years ago

Excited to announce a new product from AWS AI: Amazon CodeWhisperer aws.amazon.com/blogs/aws/now-…

thumb_up_off_alt10

chat_bubble_outline0

repeat1

shareShare

Adam Seligman

@adamse

3 years ago

aws.amazon.com/codewhisperer/ is really neat. Helps you code faster, checks for security vulns, discloses licenses of code it drew from, and works great for AWS APIs. Boom! Amazon Web Services putting ML to work for developers

thumb_up_off_alt5

chat_bubble_outline1

repeat3

shareShare

Dan Roth

@danrothnlp

3 years ago

Just out from AWS AI: aws.amazon.com/blogs/machine-…

thumb_up_off_alt22

chat_bubble_outline0

repeat5

shareShare

Sopan Khosla

@khoslasopan

2 years ago

Super excited to announce that our "3rd Workshop on NLP for Medical Conversations" will be co-located with IJCNLP-AACL 2023!! Website and CFP: nlpmc-2023.github.io AACL 2025 #AACL2023 #NLProc #NLP #AI #DigitalHealth #HealthTech #Healthcare

thumb_up_off_alt11

chat_bubble_outline1

repeat8

shareShare

Randall Hunt

@jrhunt

2 years ago

I’ve been working with Amazon Web Services’s #Bedrock service for a couple of months now at Caylent, and I’d like to share some of what I’ve learned. 🧵

thumb_up_off_alt317

chat_bubble_outline8

repeat90

shareShare

Xingyu Fu

@xingyufu2

a year ago

Can GPT-4V and Gemini-Pro perceive the world the way humans do? 🤔 Can they solve the vision tasks that humans can in the blink of an eye? 😉 tldr; NO, they are far worse than us 💁🏻‍♀️ Introducing BLINK👁 zeyofu.github.io/blink/, a novel benchmark that studies visual perception

thumb_up_off_alt410

chat_bubble_outline9

repeat126

shareShare

Zijian Wang @ ICLR 🇸🇬

@zijianwang30

a year ago

🚀Introducing "Fewer Truncations Improve Language Modeling" at #ICML2024 We tackle a fundamental issue in LLM pre-training: docs are often broken into pieces. Such truncation hinders model from learning to compose logically coherent and factually grounded content. 👇🧵1/n

thumb_up_off_alt43

chat_bubble_outline2

repeat10

shareShare

Zijian Wang @ ICLR 🇸🇬

@zijianwang30

a year ago

The common practice in LLM pre-training is to concat all docs then split into equal-length chunks. This is efficient but hurts data integrity: doc fragmentation leads to loss of info, and causes next-token prediction to be ungrounded, making model prone to hallucination.🧵2/n

thumb_up_off_alt4

chat_bubble_outline1

repeat2

shareShare

Zijian Wang @ ICLR 🇸🇬

@zijianwang30

a year ago

Best-fit Packing completely eliminates unnecessary truncations while retaining the same training efficiency as concatenation with <0.01% overhead tested on popular pre-training datasets like Technology Innovation Institute's RefinedWeb and BigCode's Stack.🧵5/n

thumb_up_off_alt3

chat_bubble_outline1

repeat1

shareShare

Xingyu Fu

@xingyufu2

a year ago

Can Text-to-Image models understand common sense? 🤔 Can they generate images that fit everyday common sense? 🤔 tldr; NO, they are far less intelligent than us 💁🏻‍♀️ Introducing Commonsense-T2I 💡 zeyofu.github.io/CommonsenseT2I/, a novel evaluation and benchmark designed to measure

thumb_up_off_alt132

chat_bubble_outline7

repeat39

shareShare

Xingyu Fu

@xingyufu2

a year ago

🔥Highlights of the Commonsense-T2I benchmark: 📚Pairwise text prompts with minimum token change ⚙️Rigorous automatic evaluation with descriptions for expected outputs ❗️Even DALL-E 3 only achieves below 50% accuracy (2/n)

thumb_up_off_alt12

chat_bubble_outline1

repeat3

shareShare

Xingyu Fu

@xingyufu2

a year ago

🔥Error Examples from DALL-E 3 👀More Visualizations: zeyofu.github.io/CommonsenseT2I… (3/n)

thumb_up_off_alt13

chat_bubble_outline1

repeat2

shareShare

Xingyu Fu

@xingyufu2

a year ago

😺 This work is done with my amazing collaborators: Yujie Lu, muyu he, William Wang Dan Roth YOU ARE THE BEST!!! 😎🔥 (n/n)

thumb_up_off_alt8

chat_bubble_outline3

repeat1

shareShare

Weijia Shi

@weijiashi2

a year ago

Augmenting GPT-4o with Visual Sketchpad ✏️ We introduce Sketchpad agent, a framework that equips multimodal LLMs with a visual canvas and drawing tools 🎨 . Improving GPT-4o's performance in vision and math tasks 📈 🔗: visualsketchpad.github.io

thumb_up_off_alt286

chat_bubble_outline10

repeat53

shareShare