Ben Bogin (@ben_bogin) Twitter Tweets • TwiCopy

Ben Bogin

@ben_bogin

+ Follow

CS PhD student at Tel-Aviv University, studying #NLProc.
benbogin.github.io

ID: 150610839

calendar_today01-06-2010 10:56:41

136 Tweet

640 Followers

428 Following

Aizenberg

@aizenberg55

a year ago

THREAD: Gaza as “open prison” or “caged” is a core lie by Israel haters & anti-Zionists. Part of thinking is to never give credit for Israel leaving Gaza permanently as it shatters entire narrative of “perpetual occupation”; all blame MUST remain on Israel. Here's the truth 1/

thumb_up_off_alt3,3K

chat_bubble_outline138

repeat1,1K

shareShare

Hen Mazzig

@henmazzig

a year ago

Hamas: “we will repeat the October 7 massacre time and again, 1M times if we need to, until we end the occupation.“ Journalist: “occupation of Gaza?” Hamas: “no, all of Israel.”

thumb_up_off_alt15,15K

chat_bubble_outline2,2K

repeat7,7K

shareShare

Ella Travels (Ella Kenan)

@ellatravelslove

a year ago

FYI - this is what a genocide looks like. #Pallywood

thumb_up_off_alt3,3K

chat_bubble_outline150

repeat1,1K

shareShare

Maayan Zin

@zinmaayan1007

a year ago

Worldwide people chant for "Palestine to be Free" yet my two daughters, captives of Hamas, are ignored. Even as thousands in London push for ceasefire, there's silence on their release. I'm heartbroken by the world's indifference. Dafna and Ella, I miss you so much 💔

thumb_up_off_alt19,19K

chat_bubble_outline1,1K

repeat7,7K

shareShare

Ai2

@allen_ai

10 months ago

OLMo is here! And it’s 100% open. It’s a state-of-the-art LLM and we are releasing it with all pre-training data and code. Let’s get to work on understanding the science behind LLMs. Learn more about the framework and how to access it here: blog.allenai.org/olmo-open-lang…

thumb_up_off_alt1,1K

chat_bubble_outline29

repeat347

shareShare

AK

@_akhaliq

10 months ago

Allen AI presents Dolma an Open Corpus of Three Trillion Tokens for Language Model Pretraining Research paper page: huggingface.co/papers/2402.00… dataset: huggingface.co/datasets/allen… release Dolma, a three trillion tokens English corpus, built from a diverse mixture of web content,

thumb_up_off_alt381

chat_bubble_outline4

repeat82

shareShare

Haidar Khan

@haidarkk1

10 months ago

Skeptical about LLM benchmarks telling the whole story? 🤔 Tiny tweaks to tests like MMLU can shuffle model rankings like a deck of cards. 🃏 Our latest work delves into #LLM benchmarks to highlight this ArXiv link: arxiv.org/abs/2402.01781

thumb_up_off_alt511

chat_bubble_outline16

repeat86

shareShare

Maor Ivgi

@maorivg

5 months ago

1/5 🧠 Excited to share our latest paper focusing on the heart of LLM training: data curation! We train a 7B LLM achieving 64% on 5-shot MMLU, using only 2.6T tokens. The key to this performance? Exceptional data curation. #LLM #DataCuration

thumb_up_off_alt57

chat_bubble_outline3

repeat17

shareShare

Maor Ivgi

@maorivg

5 months ago

1/7 🚨 What do LLMs do when they are uncertain? We found that the stronger the LLM, the more it hallucinates and the less it loops! This pattern extends to sampling methods and instruction tuning. 🧵👇 Mor Geva Jonathan Berant Ori Yoran

thumb_up_off_alt122

chat_bubble_outline2

repeat30

shareShare

Ben Bogin

@ben_bogin

4 months ago

Our new benchmark with challenging real-world agent tasks! Great work led by Ori Yoran

thumb_up_off_alt20

chat_bubble_outline0

repeat0

shareShare