Victor Lecomte (@vclecomte) Twitter Tweets • TwiCopy

Victor Lecomte

@vclecomte

+ Follow

CS PhD student at Stanford / Researcher at the Alignment Research Center

ID: 846711415771664384

linkhttps://vlecomte.github.io/ calendar_today28-03-2017 13:11:47

241 Tweet

653 Followers

201 Following

Victor Lecomte

@vclecomte

2 years ago

I've been getting into ML theory recently, and it was a pleasure to learn from Rylan Schaeffer while working on this project!

thumb_up_off_alt12

chat_bubble_outline0

repeat0

shareShare

My first dabble at studying learning dynamics (and at AI safety-related work)! It was a lot of fun figuring out the exact speed at which encodings get sparser under L1-regularization; I didn't expect the math to end up being so nice. 🙂

thumb_up_off_alt11

chat_bubble_outline1

repeat1

shareShare

kushal thaman (at ICLR 🇸🇬)

@kushal1t

2 years ago

Excited to share the first paper of my undergrad: "Incidental Polysemanticity" arxiv.org/abs/2312.03096! We present a second, "incidental" origin story of polysemanticity in task-optimized DNNs. Done in collaboration with Victor Lecomte trevor (taylor’s version) Rylan Schaeffer Sanmi Koyejo (1/n)

thumb_up_off_alt43

chat_bubble_outline9

repeat7

shareShare

Michael Cohen

@michael05156007

a year ago

We sent this letter to Gavin Newsom this morning. He should sign SB 1047! 🧵

We sent this letter to <a href="/GavinNewsom/">Gavin Newsom</a> this morning. He should sign SB 1047! 🧵

thumb_up_off_alt113

chat_bubble_outline11

repeat33

shareShare

Victor Lecomte

@vclecomte

a year ago

This is the first paper I've worked on at ARC, and I think it's pretty cool! 🙂

thumb_up_off_alt6

chat_bubble_outline0

repeat0

shareShare

Eric Neyman

@ericneyman

a year ago

Last week, ARC put out a new paper! The paper is a research update on the "heuristic estimation" direction of our research into explaining neural network behavior. The paper starts by explaining what we mean by "heuristic estimation", through an example and three analogies 🧵

thumb_up_off_alt27

chat_bubble_outline1

repeat6

shareShare

Gabriel Wu

@gabrieldwu1

a year ago

The Alignment Research Center (ARC) just released our first empirical paper: Estimating the Probabilities of Rare Outputs in Language Models. In this thread, I'll motivate the problem of low probability estimation and describe our setting/methods. 🧵

thumb_up_off_alt135

chat_bubble_outline1

repeat8

shareShare

METR

@metr_evals

a year ago

How close are current AI agents to automating AI R&D? Our new ML research engineering benchmark (RE-Bench) addresses this question by directly comparing frontier models such as Claude 3.5 Sonnet and o1-preview with 50+ human experts on 7 challenging research engineering tasks.

thumb_up_off_alt849

chat_bubble_outline15

repeat172

shareShare

Ryan Greenblatt

@ryanpgreenblatt

a year ago

New Redwood Research (Redwood Research) paper in collaboration with Anthropic: We demonstrate cases where Claude fakes alignment when it strongly dislikes what it is being trained to do. (Thread)

thumb_up_off_alt347

chat_bubble_outline9

repeat44

shareShare

Rob Wiblin

@robertwiblin

7 months ago

A new legal letter aimed at OpenAI lays out in stark terms the money and power grab OpenAI is trying to trick its board members into accepting — what one analyst calls "the theft of the millennium." The simple facts of the case are both devastating and darkly hilarious. I'll

thumb_up_off_alt4,4K

chat_bubble_outline429

repeat1,1K

shareShare

Victor Lecomte

@vclecomte

6 months ago

A cute question about inner product sketching came up in our research; any leads would be appreciated! 🙂 cstheory.stackexchange.com/questions/5539…

thumb_up_off_alt1

chat_bubble_outline0

repeat1

shareShare