Stephen Bach (@stevebach) Twitter Tweets • TwiCopy

Stephen Bach

@stevebach

+ Follow

Asst. prof. @BrownCSDept. Working on improving how humans teach computers. Weak supervision, zero-shot learning, few-shot learning, and high-level knowledge.

ID: 8453442

linkhttps://cs.brown.edu/people/sbach calendar_today27-08-2007 03:36:42

1,1K Tweet

1,1K Takipçi

473 Takip Edilen

Gate.io

@gate_io

5 hours ago

🔥The 9th Round of Easy Loan, Earn $40 Reward is in progress❗️ ⏰ Promotion Period: January 15th - Feburary 15th, 2025 👉 Register now and check more details at gate.io/campaigns/358

thumb_up_off_alt34

chat_bubble_outline39

repeat6

shareShare

Really interesting findings from Yong and many great collaborators. Test-time scaling generalizes cross-lingually, but maybe not in the way you’d hope. S1 tends to quote in the original language and then think in English.

thumb_up_off_alt19

chat_bubble_outline0

repeat2

shareShare

Sophia Yang, Ph.D.

@sophiamyang

a month ago

Can an AI trained in English solve math problems in other languages without extra training?

thumb_up_off_alt628

chat_bubble_outline18

repeat82

shareShare

Daniel Litt

@littmath

a month ago

asdfasdf Right, I think in the near term we should expect progress to be driven more by productivity increases for existing human scientists than, like, super-clever AI. My hope is that this lets us cover more attention-bottlenecks, but I don’t think it buys us much creativity etc.

thumb_up_off_alt3

chat_bubble_outline1

repeat1

shareShare

Daniel Khashabi 🕊️

@danielkhashabi

a month ago

Long-form inputs (e.g., needle-in-haystack setups) are the crucial aspect of high-impact LLM applications. While previous studies have flagged issues like positional bias and distracting documents, they've missed a crucial element: the size of the gold/relevant context. In our

thumb_up_off_alt51

chat_bubble_outline3

repeat17

shareShare

Yisong Yue

@yisongyue

25 days ago

Excited for the CLEVER Benchmark for verified code generation in Lean, led by Amitayush Thakur & team! 161 tasks! ✅ Fully verified — all correctness is machine-checked 📷 Leakage-resistant — specs are non-computable propositions, so models can't copy logic 🧠 Truly end-to-end

thumb_up_off_alt39

chat_bubble_outline1

repeat3

shareShare

Stephen Bach

@stevebach

24 days ago

So excited about the Snorkel AI news! We’ve been saying for a long time that data is the key. This is a big next step.

thumb_up_off_alt22

chat_bubble_outline2

repeat1

shareShare

Forbes

@forbes

24 days ago

Snorkel AI Raises $100 Million To Build Better Evaluators For AI Models trib.al/ij2cSiD trib.al/ij2cSiD

thumb_up_off_alt24

chat_bubble_outline8

repeat6

shareShare

Yong Zheng-Xin (Yong)

@yong_zhengxin

21 days ago

🧵 Multilingual safety training/eval is now standard practice, but a critical question remains: Is multilingual safety actually solved? Our new survey with Cohere Labs answers this and dives deep into: - Language gap in safety research - Future priority areas Thread 👇

thumb_up_off_alt59

chat_bubble_outline4

repeat29

shareShare

Greg Durrett

@gregd_nlp

21 days ago

Great to work on this benchmark with astronomers in our NSF-Simons CosmicAI institute! What I like about it: (1) focus on data processing & visualization, a "bite-sized" AI4Sci task (not automating all of research) (2) eval with VLM-as-a-judge (possible with strong, modern VLMs)

thumb_up_off_alt25

chat_bubble_outline2

repeat3

shareShare

Amina Abdullahi

@amilah_dul

21 days ago

New KDD 2025 paper: Can large language models (LLMs) reason like biomedical scientists? We introduce K-Paths, a retrieval framework for extracting reasoning paths from knowledge graphs (KGs) to aid drug discovery tasks. 👇 Thread:

thumb_up_off_alt14

chat_bubble_outline3

repeat7

shareShare

Brown CS

@browncsdept

20 days ago

We're happy to announce that effective as of July 1, 2025, faculty members Stephen Bach and Srinath Sridhar have received named chairs. Steve is now the Eliot Horowitz Assistant Professor in CS and Srinath is the John E. Savage Assistant Professor in CS: cs.brown.edu/news/2025/06/0…

We're happy to announce that effective as of July 1, 2025, faculty members <a href="/stevebach/">Stephen Bach</a> and <a href="/drsrinathsridha/">Srinath Sridhar</a> have received named chairs. Steve is now the Eliot Horowitz Assistant Professor in CS and Srinath is the John E. Savage Assistant Professor in CS: cs.brown.edu/news/2025/06/0…

thumb_up_off_alt82

chat_bubble_outline0

repeat6

shareShare

Alex Ratner

@ajratner

11 days ago

Scale alone is not enough for AI data. Quality and complexity are equally critical. Excited to support all of these for LLM developers with Snorkel AI Data-as-a-Service, and to share our new leaderboard! — Our decade-plus of research and work in AI data has a simple point:

thumb_up_off_alt142

chat_bubble_outline15

repeat33

shareShare