MF FOOM (@mf_foom) 's Twitter Profile
MF FOOM

@mf_foom

masked attention head

ID: 1681867033309507584

linkhttp://github.com/MF-FOOM calendar_today20-07-2023 03:22:29

369 Tweet

1,1K Followers

255 Following

Sasha Rush (@srush_nlp) 's Twitter Profile Photo

Talk: Inverting Language Models (by dr. jack morris) youtube.com/watch?v=lguThu… Techniques for extracting text from vector databases and prompts from LLM APIs.

thebes (@voooooogel) 's Twitter Profile Photo

New blog post: how to make LLMs go fast! Want to understand how people are making LLMs go brrrrr? This post is a survey of lots of different LLM inference optimizations, ranging from "everyone uses this in prod" to "I cooked this up last week (but it seems to work)"

New blog post: how to make LLMs go fast! Want to understand how people are making LLMs go brrrrr? This post is a survey of lots of different LLM inference optimizations, ranging from "everyone uses this in prod" to "I cooked this up last week (but it seems to work)"
thebes (@voooooogel) 's Twitter Profile Photo

new blog post! played around w/ representation engineering, and released a new library for training control vectors in <60s! i train some useful control vectors, some… out there… control vectors, test them for jailbreaking, test them *against* jailbreaking, and more!

new blog post! played around w/ representation engineering, and released a new library for training control vectors in &lt;60s! i train some useful control vectors, some… out there… control vectors, test them for jailbreaking, test them *against* jailbreaking, and more!
Ofir Press (@ofirpress) 's Twitter Profile Photo

When a student sadly tells me that the idea we've been working on for weeks was just arXived, I say: "Great! We've just gotten *strong* confirmation that our thinking was in the right direction. We've had the initial work done for us. Lets figure out how to make this 10x better"

Haize Labs (@haizelabs) 's Twitter Profile Photo

ā€¼ļøāš ļøbad day to be a LLMāš ļøā€¼ļø Haize Labs took one of our favorite adversarial attack algorithms, GCG, and made it *38x* faster

ā€¼ļøāš ļøbad day to be a LLMāš ļøā€¼ļø

<a href="/haizelabs/">Haize Labs</a> took one of our favorite adversarial attack algorithms, GCG, and made it *38x* faster
Charles Foster (@cfgeek) 's Twitter Profile Photo

Contrast pairs are overpowered. Once you have them, you can use them to generate control vectors, and to initialize classifiers, and to do RL/DPO, and probably more

MF FOOM (@mf_foom) 's Twitter Profile Photo

Why not just train your embedding model to embed queries directly into answer space? Feels like an inefficient use of flops.

MF FOOM (@mf_foom) 's Twitter Profile Photo

there was a good thread recently estimating the current upper bound of tokens in the world (including Gmail and other private repositories), but I can't find it can anyone point me to it?