Christopher Chou (@chrischou03) Twitter Tweets • TwiCopy

Christopher Chou

@chrischou03

a year ago

What happens when we give colorblind tests to LLMs? Super cool analysis by Lisa!

thumb_up_off_alt4

chat_bubble_outline6

repeat0

shareShare

lmarena.ai (formerly lmsys.org)

@lmarena_ai

a year ago

We are thrilled to announce the milestone release of SGLang Runtime v0.2, featuring significant inference optimizations after months of hard work. It achieves up to 2.1x higher throughput compared to TRT-LLM and up to 3.8x higher throughput compared to vLLM. It consistently

thumb_up_off_alt526

chat_bubble_outline13

repeat125

shareShare

Lianmin Zheng

@lm_zheng

a year ago

Grok-2 is here, a new frontier-level model from @xAI! I still remember the good old days when I was a GPU-poor grad student, playing with the Vicuna model and building the Chatbot Arena leaderboard with just a few GPUs. But now, my job at xAI is developing systems for the 100K

thumb_up_off_alt354

chat_bubble_outline12

repeat18

shareShare

lmarena.ai (formerly lmsys.org)

@lmarena_ai

a year ago

Chatbot Arena update❤️‍🔥 Exciting news—@xAI's Grok-2 and Grok-mini are now officially on the leaderboard! With over 6000 community votes, Grok-2 has claimed the #2 spot, surpassing GPT-4o (May) and tying with the latest Gemini! Grok-2-mini also impresses at #5. Grok-2 excels in

thumb_up_off_alt2,2K

chat_bubble_outline193

repeat321

shareShare

Christopher Chou

@chrischou03

a year ago

Check out SGLang!

thumb_up_off_alt8

chat_bubble_outline0

repeat0

shareShare

Christopher Chou

@chrischou03

10 months ago

Fantastic work by Lisa. I often feel like I can predict a model's identity based on their output and she formalizes these vibes!

thumb_up_off_alt0

chat_bubble_outline1

repeat0

shareShare

Christopher Chou

@chrischou03

8 months ago

Check out Image Arena! We support models like Flux, Recraft, Dall-E and more for free!

thumb_up_off_alt19

chat_bubble_outline3

repeat4

shareShare

Christopher Chou

@chrischou03

7 months ago

We have released our initial leaderboard for text-to-image models! Check it out and let us know what you think. One thing interesting to see is the ranking shifts based on whether the prompts are preset or not.

thumb_up_off_alt26

chat_bubble_outline2

repeat4

shareShare

Christopher Chou

@chrischou03

7 months ago

Love the work on making it easier for people to explore our data and how people are using our platform

thumb_up_off_alt4

chat_bubble_outline0

repeat1

shareShare

vLLM

@vllm_project

7 months ago

🚀 With the v0.7.0 release today, we are excited to announce the alpha release of vLLM V1: A major architectural upgrade with 1.7x speedup! Clean code, optimized execution loop, zero-overhead prefix caching, enhanced multimodal support, and more.

thumb_up_off_alt646

chat_bubble_outline15

repeat96

shareShare

Christopher Chou

@chrischou03

6 months ago

Excellent work by Parth! Can’t wait to test this out myself :)

thumb_up_off_alt2

chat_bubble_outline0

repeat0

shareShare

Ion Stoica

@istoica05

6 months ago

The progress in AI is down to three basic resources; (1) people (experts), (2) data, and (3) infrastructure. Arguably, at this point the US is only ahead in (3). Also, at this point the Chinese open source models are ahead. Not only DeepSeek but also Qwen. This is a fact.

thumb_up_off_alt859

chat_bubble_outline67

repeat154

shareShare

Christopher Chou

@chrischou03

6 months ago

Congrats to the xAI team :)

thumb_up_off_alt3

chat_bubble_outline0

repeat0

shareShare

Blockchain at Berkeley

@calblockchain

5 months ago

💰🏆B@B members just swiped 30k in Prizes in one weekend, check out all the projects below: 🌟 Projects: Command Flare by Ravi Riley Rohan Vardhan Shorewala Oleg Viatkin - RAG Knowledge - 1st place AND Consensus Learning - 3rd place Voice To Flare by @mason_arditi, Romain ,

thumb_up_off_alt52

chat_bubble_outline10

repeat14

shareShare

Christopher Chou

@chrischou03

3 months ago

Excited to finally share what we've been working on for the past year! Marin is a platform for developing foundation models from data curation to large-scale model training to evaluation. It's been a privilege to work on this with so many amazing people!

thumb_up_off_alt31

chat_bubble_outline6

repeat3

shareShare

Christopher Chou

@chrischou03

3 months ago

LFG Chatbot Arena! AI Evaluation is an incredibly hard thing to get right especially balancing between user preference and model capability. I’m confident in the team to execute on this mission. Excited for the future!

thumb_up_off_alt13

chat_bubble_outline0

repeat0

shareShare

Nick Jiang @ ICLR

@nickhjiang

2 months ago

Vision transformers have high-norm outliers that hurt performance and distort attention. While prior work removed them by retraining with “register” tokens, we find the mechanism behind outliers and make registers at ✨test-time✨—giving clean features and better performance! 🧵

thumb_up_off_alt995

chat_bubble_outline15

repeat134

shareShare

Lisa Dunlap

@lisabdunlap

2 months ago

At #CVPR2025 ? Come see my talk on building evals which embrace the fuzziness of generative models at the EVAL-FoMo workshop today! This talk had everything - from Chatbot Arena to model vibes to designing UI's :P Details: June 11th, 4:30pm, room 210

thumb_up_off_alt22

chat_bubble_outline1

repeat3

shareShare

David Hall

@dlwh

2 months ago

So about a month ago, Percy posted a version of this plot of our Marin 32B pretraining run. We got a lot of feedback, both public and private, that the spikes were bad. (This is a thread about how we fixed the spikes. Bear with me. )

thumb_up_off_alt968

chat_bubble_outline21

repeat94

shareShare

Google AI Developers

@googleaidevs

a month ago

.Center for Research on Foundation Models's Marin project has released the first fully open model in JAX. It’s an 'open lab' sharing the entire research process - including code, data, and logs, to enable reproducibility and further innovation. developers.googleblog.com/en/stanfords-m…

thumb_up_off_alt315

chat_bubble_outline5

repeat53

shareShare