MF FOOM (@mf_foom) Twitter Tweets • TwiCopy

MF FOOM

@mf_foom

+ Follow

masked attention head

ID: 1681867033309507584

linkhttp://github.com/MF-FOOM calendar_today20-07-2023 03:22:29

369 Tweet

1,1K Followers

255 Following

MF FOOM

@mf_foom

2 years ago

👀 github.com/pytorch-labs/f…

thumb_up_off_alt9

chat_bubble_outline1

repeat0

shareShare

Sasha Rush

@srush_nlp

2 years ago

Talk: Inverting Language Models (by dr. jack morris) youtube.com/watch?v=lguThu… Techniques for extracting text from vector databases and prompts from LLM APIs.

thumb_up_off_alt203

chat_bubble_outline3

repeat21

shareShare

New blog post: how to make LLMs go fast! Want to understand how people are making LLMs go brrrrr? This post is a survey of lots of different LLM inference optimizations, ranging from "everyone uses this in prod" to "I cooked this up last week (but it seems to work)"

thumb_up_off_alt1,1K

chat_bubble_outline12

repeat151

shareShare

thebes

@voooooogel

2 years ago

new blog post! played around w/ representation engineering, and released a new library for training control vectors in <60s! i train some useful control vectors, some… out there… control vectors, test them for jailbreaking, test them *against* jailbreaking, and more!

thumb_up_off_alt435

chat_bubble_outline39

repeat33

shareShare

Ofir Press

@ofirpress

2 years ago

When a student sadly tells me that the idea we've been working on for weeks was just arXived, I say: "Great! We've just gotten *strong* confirmation that our thinking was in the right direction. We've had the initial work done for us. Lets figure out how to make this 10x better"

thumb_up_off_alt234

chat_bubble_outline6

repeat14

shareShare

Haize Labs

@haizelabs

2 years ago

‼️⚠️bad day to be a LLM⚠️‼️ Haize Labs took one of our favorite adversarial attack algorithms, GCG, and made it *38x* faster

‼️⚠️bad day to be a LLM⚠️‼️

<a href="/haizelabs/">Haize Labs</a> took one of our favorite adversarial attack algorithms, GCG, and made it *38x* faster

thumb_up_off_alt67

chat_bubble_outline2

repeat15

shareShare

MF FOOM

@mf_foom

2 years ago

Yes, and get SOTA by doing so. x.com/vaibhav_adlakh…

thumb_up_off_alt11

chat_bubble_outline1

repeat1

shareShare

MF FOOM

@mf_foom

2 years ago

This is the most in-depth technical report we’ve gotten about a frontier-class model in a while!

thumb_up_off_alt47

chat_bubble_outline2

repeat2

shareShare

Charles Foster

@cfgeek

2 years ago

Contrast pairs are overpowered. Once you have them, you can use them to generate control vectors, and to initialize classifiers, and to do RL/DPO, and probably more

thumb_up_off_alt81

chat_bubble_outline2

repeat6

shareShare

MF FOOM

@mf_foom

2 years ago

Why not just train your embedding model to embed queries directly into answer space? Feels like an inefficient use of flops.

thumb_up_off_alt16

chat_bubble_outline3

repeat0

shareShare

MF FOOM

@mf_foom

2 years ago

there was a good thread recently estimating the current upper bound of tokens in the world (including Gmail and other private repositories), but I can't find it can anyone point me to it?

thumb_up_off_alt4

chat_bubble_outline2

repeat0

shareShare

MF FOOM

@mf_foom

2 years ago

he can't keep getting away with it

thumb_up_off_alt12

chat_bubble_outline0

repeat0

shareShare