GX Xu (@gx_nlp) Twitter Tweets • TwiCopy

Ruibo Liu

3 years ago

🎲Life is a game. Play by your rules! 🎮 Stable Alignment enables LM to learn social norms from simulated everyday interactions in a social game! 👫 Check this out 👇: arxiv.org/abs/2305.16960

thumb_up_off_alt263

chat_bubble_outline5

repeat49

shareShare

Here is an incredible Claude 3 prompt for engineers. Use it to speed up any code by identifying inefficiencies and rectifying them: --- <prompt_explanation> You are a world expert in making code run faster. You use any resource you can to do so. Given some code, first, explain

thumb_up_off_alt814

chat_bubble_outline13

repeat105

shareShare

Jeremy Howard

@jeremyphoward

2 years ago

I'd given up using ChatGPT for all but the most basic tasks -- I just wasn't getting answers that were good enough to be of practical use to me. But Claude 3 Opus is being genuinely useful, and it's making me use LLM chat again. Thanks Anthropic!

thumb_up_off_alt1,1K

chat_bubble_outline55

repeat78

shareShare

Inflection AI

@inflectionai

2 years ago

Evaluation is everything! While testing Inflection-2.5, we found that MT-Bench has a bunch of incorrect answers. Here we share the corrections for everyone to use, and we release a new Physics GRE benchmark for people to try out. inflection.ai/inflection-2-5

thumb_up_off_alt344

chat_bubble_outline17

repeat147

shareShare

Jeremy Howard

@jeremyphoward

2 years ago

Today, with Tim Dettmers, Hugging Face, & @mobius_labs, we're releasing FSDP/QLoRA, a new project that lets you efficiently train very large (70b) models on a home computer with consumer gaming GPUs. 1/🧵 answer.ai/posts/2024-03-…

thumb_up_off_alt3,3K

chat_bubble_outline84

repeat654

shareShare

Brendan Dolan-Gavitt

@moyix

2 years ago

I gave Claude 3 the entire source of a small C GIF decoding library I found on GitHub, and asked it to write me a Python function to generate random GIFs that exercised the parser. Its GIF generator got 92% line coverage in the decoder and found 4 memory safety bugs and one hang.

thumb_up_off_alt2,2K

chat_bubble_outline35

repeat237

shareShare

swyx

@swyx

2 years ago

I've now had multiple >20min phone calls with AI therapists and it feels completely natural. Every AI Engineer should be building their own therapist rn, and voice is the right medium. forget typing. go on a long walk and talk thru your day, your childhood, your dreams,

thumb_up_off_alt763

chat_bubble_outline60

repeat44

shareShare

Cognition

@cognition_labs

2 years ago

Raw footage of Devin! Expect to see more soon 👀

thumb_up_off_alt1,1K

chat_bubble_outline71

repeat147

shareShare

Elron Bandel

@elronbandel

2 years ago

A personal note: Unitxt originated within the Leshem (Legend) Choshen 🤖🤗 fusing team, aiming to streamline the sharing of academic outputs, primarily through model weights but also data. In the process of training various models on numerous datasets, we encountered significant challenges related

thumb_up_off_alt6

chat_bubble_outline1

repeat1

shareShare

GX Xu

@gx_nlp

2 years ago

Even powerful LLM like Claude3 Opus breaks with the simplest attacks to start hallucinating about “non-existing” context about “steps”. The kind of mistake that a human 5 year old wouldnt make. 😉

thumb_up_off_alt10

chat_bubble_outline2

repeat1

shareShare

GX Xu

@gx_nlp

2 years ago

TLDR: Looking for a RLHF method that combines the best of PPO and DPO, stable training, and gives amazing result? BRAIN theoretically unites DPO and PPO, and empirical shown to out-perform! An earlier pre-print of the ICML paper is available now🔥

thumb_up_off_alt3

chat_bubble_outline0

repeat0

shareShare

GX Xu

@gx_nlp

2 years ago

A new RL alignment method, here’s Gaurav’s excellent blog that explains why BRAIn is more stable and gives better performance than PPO and DPO 🔥

thumb_up_off_alt2

chat_bubble_outline0

repeat0

shareShare

GX Xu

@gx_nlp

2 years ago

IBM’s Agent101 tops AWE Benchmark! Amazing work led by Avi Sil

thumb_up_off_alt2

chat_bubble_outline1

repeat0

shareShare

lmarena.ai (formerly lmsys.org)

@lmarena_ai

2 years ago

Congrats Google DeepMind on the new Gemma-2 27B & 9B release! Gemma-2 was tested in the Arena under the codename "*late-june-chatbots" and now out of stealth. Its early result matches the best open models (Llama-3-70B, Nemotron-340B) with only 27B parameters! Impressively,

Congrats <a href="/GoogleDeepMind/">Google DeepMind</a> on the new Gemma-2 27B & 9B release!

Gemma-2 was tested in the Arena under the codename "*late-june-chatbots" and now out of stealth. Its early result matches the best open models (Llama-3-70B, Nemotron-340B) with only 27B parameters!

Impressively,

thumb_up_off_alt528

chat_bubble_outline9

repeat99

shareShare

Red Hat AI

@redhat_ai

a year ago

.Red Hat AI Innovation team just dropped a new research paper on inference-time scaling! 🚨 All built on vLLM. Paper and code here: …abilistic-inference-scaling.github.io Cheers to paper authors Akash Srivastava, Kai Xu, GX Xu, Shivchander Sudalairaj, and Isha Puri!

.<a href="/RedHat/">Red Hat</a> AI Innovation team just dropped a new research paper on inference-time scaling! 🚨

All built on <a href="/vllm_project/">vLLM</a>.

Paper and code here: …abilistic-inference-scaling.github.io

Cheers to paper authors <a href="/variational_i/">Akash Srivastava</a>, <a href="/xukai92/">Kai Xu</a>, <a href="/GX_NLP/">GX Xu</a>, Shivchander Sudalairaj, and <a href="/ishapuri101/">Isha Puri</a>!

thumb_up_off_alt15

chat_bubble_outline1

repeat9

shareShare

Isha Puri

@ishapuri101

a year ago

[1/x] can we scale small, open LMs to o1 level? Using classical probabilistic inference methods, YES! Joint MIT CSAIL / Red Hat AI Innovation Team work introduces a particle filtering approach to scaling inference w/o any training! check out …abilistic-inference-scaling.github.io

[1/x] can we scale small, open LMs to o1 level? Using classical probabilistic inference methods, YES! Joint <a href="/MIT_CSAIL/">MIT CSAIL</a> / <a href="/RedHat/">Red Hat</a> AI Innovation Team work introduces a particle filtering approach to scaling inference w/o any training! check out …abilistic-inference-scaling.github.io

thumb_up_off_alt229

chat_bubble_outline2

repeat68

shareShare

Hao Wang

@hw_haowang

a year ago

[1/x] 🚀 We're excited to share our latest work on improving inference-time efficiency for LLMs through KV cache quantization---a key step toward making long-context reasoning more scalable and memory-efficient.

thumb_up_off_alt26

chat_bubble_outline9

repeat8

shareShare

Red Hat AI

@redhat_ai

a year ago

LLM inference is too slow, too expensive, and too hard to scale. 🚨 Introducing llm-d, a Kubernetes-native distributed inference framework, to change that—using vLLM (vLLM), smart scheduling, and disaggregated compute. Here’s how it works—and how you can use it today:

thumb_up_off_alt553

chat_bubble_outline4

repeat86

shareShare

Red Hat AI

@redhat_ai

10 months ago

Random Samples, our weekly seminar series that bridges the gap between cutting-edge AI research and real-world application, continue this Friday, July 18! Title: Grounding Feedback is All You Need: Aligning Small Vision-Language Models Abstract: While recent vision-language

thumb_up_off_alt4

chat_bubble_outline0

repeat1

shareShare

GX Xu

Ruibo Liu

Matt Shumer

Jeremy Howard

Inflection AI

Jeremy Howard

Brendan Dolan-Gavitt

swyx

Cognition

Elron Bandel

GX Xu

GX Xu

GX Xu

GX Xu

lmarena.ai (formerly lmsys.org)

Red Hat AI

Isha Puri

Hao Wang

Red Hat AI

Red Hat AI