Lintang Sutawika
@lintangsutawika
Incoming Ph.D. student @LTIatCMU. Researcher at @AIEleuther.
Maintainer of LM-Eval Harness.
Here for machine learning papers and discussion.
ID:767583770471833600
22-08-2016 04:46:46
238 Tweets
387 Followers
566 Following
Torchtune is shipping with LM Evaluation Harness integration for evals of finetunes! Excited to see lm-eval adopted by the ecosystem—evals are crucial.
we (Lintang Sutawika and I) are looking forward to collaborating with the torchtune team to build out deeper integration!
I'm presenting this with Edward Raff today! Come chat about memorization in LLMs, how open data and checkpoints enable interpretability research, and how interp. enables better AI & policy work.
NeurIPS Conference morning poster 1513, #NeurIPS2023 Booz Allen Hamilton EleutherAI
Meredith Whittaker One of the strongest pro arguments is that OS demystifies the tech, while the major companies pretend it’s magic. OS has a better track record of documentation when transparency is desperately needed in AI.
But I agree that an Apache 2.0 model is not going to take down Microsoft
@[email protected] on Mastodon I get a lot of these emails too :( I now have a standard boilerplate letter that I offer to send to people explaining that “LLM detectors” are bullshit and don’t work on an individual document level.
Next Tuesday, July 18, you are invited to join our open science community for a talk from Nora Belrose on Concept Erasure and Elicit Latent Knowledge.
Thanks to Jonas and Jen Iofinova for organizing.
Register here: lnkd.in/gXDBWxkh
I wish we need this in AI. Talia Ringer 🟣 🎗️ regularly tells me about how awesome it is. I would love to get badges like this on my papers, and would 100% start deliberately biasing submissions to venues offering them.