Lintang Sutawika (@lintangsutawika) Twitter Tweets • TwiCopy

Lintang Sutawika

@lintangsutawika

+ Follow

Incoming Ph.D. student @LTIatCMU. Researcher at @AIEleuther.
Maintainer of LM-Eval Harness.
Here for machine learning papers and discussion.

ID:767583770471833600

calendar_today22-08-2016 04:46:46

238 Tweets

387 Followers

566 Following

Hailey Schoelkopf

3 weeks ago

Torchtune is shipping with LM Evaluation Harness integration for evals of finetunes! Excited to see lm-eval adopted by the ecosystem—evals are crucial.

we (Lintang Sutawika and I) are looking forward to collaborating with the torchtune team to build out deeper integration!

thumb_up_off_alt33

chat_bubble_outline0

account_circle

Lintang Sutawika

@lintangsutawika

3 weeks ago

Dataset remains an important yet poorly understood part of language model development. The fact that s simple change (dataset and tokenizer) results in substantial improvements means there are important needles in a humongous haystack of data that should be understood better.

thumb_up_off_alt34

chat_bubble_outline0

account_circle

Stella Biderman

@BlancheMinerva

4 months ago

I'm presenting this with Edward Raff today! Come chat about memorization in LLMs, how open data and checkpoints enable interpretability research, and how interp. enables better AI & policy work.

NeurIPS Conference morning poster 1513, #NeurIPS2023 Booz Allen Hamilton EleutherAI

thumb_up_off_alt80

chat_bubble_outline0

account_circle

Aviya Skowron

9 months ago

Meredith Whittaker One of the strongest pro arguments is that OS demystifies the tech, while the major companies pretend it’s magic. OS has a better track record of documentation when transparency is desperately needed in AI.
But I agree that an Apache 2.0 model is not going to take down Microsoft

thumb_up_off_alt6

chat_bubble_outline0

account_circle

Hailey Schoelkopf

9 months ago

Presenting Pythia at board # 609 at 10:30 am today!

Come by to talk LLM training, enabling interp + novel data effect studies, open science, and more!

Presenting Pythia at board # 609 at 10:30 am today! Come by to talk LLM training, enabling interp + novel data effect studies, open science, and more!

thumb_up_off_alt108

chat_bubble_outline0

account_circle

Stella Biderman

@BlancheMinerva

9 months ago

@[email protected] on Mastodon I get a lot of these emails too :( I now have a standard boilerplate letter that I offer to send to people explaining that “LLM detectors” are bullshit and don’t work on an individual document level.

thumb_up_off_alt22

chat_bubble_outline0

account_circle

Cohere For AI

9 months ago

Next Tuesday, July 18, you are invited to join our open science community for a talk from Nora Belrose on Concept Erasure and Elicit Latent Knowledge.

Thanks to Jonas and Jen Iofinova for organizing.

Register here: lnkd.in/gXDBWxkh

Next Tuesday, July 18, you are invited to join our open science community for a talk from @norabelrose on Concept Erasure and Elicit Latent Knowledge. Thanks to @jonas_kg and @oohaijen for organizing. Register here: lnkd.in/gXDBWxkh

thumb_up_off_alt26

chat_bubble_outline0

account_circle

Riley Goodside

9 months ago

this is wild — kNN using a gzip-based distance metric outperforms BERT and other neural methods for OOD sentence classification

intuition: 2 texts similar if cat-ing one to the other barely increases gzip size

no training, no tuning, no params — this is the entire algorithm:

this is wild — kNN using a gzip-based distance metric outperforms BERT and other neural methods for OOD sentence classification intuition: 2 texts similar if cat-ing one to the other barely increases gzip size no training, no tuning, no params — this is the entire algorithm:

thumb_up_off_alt7,4K

chat_bubble_outline0

account_circle

EleutherAI

9 months ago

If you’re attending #ACL2023NLP or #icml2023 don't miss our seven exciting papers on crosslingual adaption of LLMs, the Pythia model suite, novel training methodologies for LLMs, data trusts, and more!

🧵

thumb_up_off_alt33

chat_bubble_outline0

account_circle

Stella Biderman

@BlancheMinerva

10 months ago

I wish we need this in AI. Talia Ringer 🟣 🎗️ regularly tells me about how awesome it is. I would love to get badges like this on my papers, and would 100% start deliberately biasing submissions to venues offering them.

thumb_up_off_alt78

chat_bubble_outline0

account_circle

clem 🤗

@ClementDelangue

10 months ago

There’s no money in open-source!

There’s no money in open-source!

thumb_up_off_alt183

chat_bubble_outline0

account_circle

Mark Riedl

10 months ago

One. I just want one.

thumb_up_off_alt23

chat_bubble_outline0

account_circle

Neal Parikh

10 months ago

This is a letter Feynman wrote to a former student who wrote congratulating him for the Nobel. I’ve posted it before but I really find it worth it to read especially as a student or early stage research person.

This is a letter Feynman wrote to a former student who wrote congratulating him for the Nobel. I’ve posted it before but I really find it worth it to read especially as a student or early stage research person.

thumb_up_off_alt6,2K

chat_bubble_outline0

account_circle

Aidan Gomez

11 months ago

it's pretty extraordinary that we have a literal doomsday cult steering the narrative on tech regulation. Not a figurative doomsday cult, a literal 'omnipotent entity is coming, it will annihilate us all' doomsday cult.

thumb_up_off_alt1,0K

chat_bubble_outline0

account_circle

Iz Beltagy

11 months ago

Why I am excited about OLMo - because I want to see the academic and open research on LLMs catching up with proprietary research. I know it is difficult but we should try, and open source is our best bet.

thumb_up_off_alt39

chat_bubble_outline0

account_circle

Naomi Saphra

1 year ago

The open source perf gap has consistently remained on the data side, and academia doesn’t have the right incentives to foster data specialists. We need paid nonprofit data curators and cleaners, because data work is unglamorous and won’t get you tenure.

thumb_up_off_alt121

chat_bubble_outline0

account_circle

Lintang Sutawika

@lintangsutawika

1 year ago

A highlight from ICLR 2023 at Kigali Rwanda: Meeting @[email protected] on Mastodon

A highlight from ICLR 2023 at Kigali Rwanda: Meeting @timnitGebru

thumb_up_off_alt80

chat_bubble_outline0

account_circle

fpc ok :)