Jian Zhang (@jianzhangcs) Twitter Tweets • TwiCopy

Jian Zhang

2 years ago

Last mile quality & robustness (the high-hanging fruits) is going to be the test bed for enterprise LLM players in 2024.

thumb_up_off_alt2

chat_bubble_outline0

repeat0

shareShare

e/ia - Intelligence Amplification - Does not seek to build superintelligent God entity that replaces humans. - Builds “bicycle for the mind” tools that empower and extend the information processing capabilities of humans. - Of all humans, not a top percentile. - Faithful to

thumb_up_off_alt5,5K

chat_bubble_outline353

repeat753

shareShare

Jian Zhang

@jianzhangcs

2 years ago

🤖 It was a great conversation with Ben Ben Lorica 罗瑞卡 to exchange thoughts on GenAI copilot & agents for cybersecurity and enterprise workflow in general! ⛰At @NexusflowX , we are very excited to build the Gen AI foundations for enterprise workflow agents, owned, controllable and

thumb_up_off_alt5

chat_bubble_outline1

repeat3

shareShare

Jian Zhang

@jianzhangcs

2 years ago

Through many customer conversations, we discovered the top adoption criterion of GenAI agents to be the reliability to accomplish tasks with minimal hallucinations. 📢Checkout how @NexusflowX leverage a new agent building paradigm, with extractive reasoning as the first-class

thumb_up_off_alt5

chat_bubble_outline0

repeat1

shareShare

Jian Zhang

@jianzhangcs

2 years ago

Very exciting long context length multi modality capability from Google. The full repo code analysis part is pretty amazing!

thumb_up_off_alt2

chat_bubble_outline0

repeat0

shareShare

Jian Zhang

@jianzhangcs

2 years ago

📢Exciting release of Starling-7B-beta chat model and Starling-34B-RM reward model powered by Nexusflow latest technology. I am continuously amazed by how fast and powerful the small striking team behind Starling is!

thumb_up_off_alt5

chat_bubble_outline0

repeat1

shareShare

Jian Zhang

@jianzhangcs

2 years ago

Great work from Banghua Zhu and team Top ranking reward model on reward bench from Ai2 and new starling beta for chat

thumb_up_off_alt2

chat_bubble_outline0

repeat1

shareShare

Jian Zhang

@jianzhangcs

2 years ago

🔥 Starling-7B Beta from @NexusflowX climbing fast on Chatbot Arena, outperforming or rivaling larger models like Gemini Pro, Mixtral 8 * 7B and ranked as #1 7B chat model. 🤔 While DBRX from Databricks presents a new strong open model, the story of Starling-7B Beta shows how

thumb_up_off_alt10

chat_bubble_outline0

repeat4

shareShare

SambaNova Systems

@sambanovaai

2 years ago

🚀🌟🚀Excited to announce Samba-CoE v0.2, which outperforms DBRX by Databricks Mosaic Research and Databricks, Mixtral-8x7B from Mistral AI, and Grok-1 by Grok at a breakneck speed of 330 tokens/s. These breakthrough speeds were achieved without sacrificing precision and only on 8 sockets,

🚀🌟🚀Excited to announce Samba-CoE v0.2, which outperforms DBRX by <a href="/DbrxMosaicAI/">Databricks Mosaic Research</a> and <a href="/databricks/">Databricks</a>, Mixtral-8x7B from <a href="/MistralAI/">Mistral AI</a>, and Grok-1 by <a href="/grok/">Grok</a> at a breakneck speed of 330 tokens/s.
These breakthrough speeds were achieved without sacrificing precision and only on 8 sockets,

thumb_up_off_alt373

chat_bubble_outline24

repeat94

shareShare

SambaNova Systems

@sambanovaai

2 years ago

Running at 430 tokens/second using full precision and 8 sockets, #Llama3 from AI at Meta is now available on SambaNova Platform: fast.snova.ai 🚀Get full 16-bit precision 🚀Spend on only 8 chips, not 576 chips for 430 tokens/second! Trim chips, not precision! Test

thumb_up_off_alt123

chat_bubble_outline5

repeat37

shareShare

Jian Zhang

@jianzhangcs

a year ago

📢 New function calling and GenAI agent course launched in collaboration with Andrew Ng and DeepLearning.AI. Come and try out the tutorial built by Venkat and @NexusflowX team! Hope you enjoy it.

thumb_up_off_alt23

chat_bubble_outline0

repeat5

shareShare

Jian Zhang

@jianzhangcs

a year ago

📢 Excited to release Athene-Llama3-70B chat LLM, delivering new record on Arena Hard from @lmsys Chatbot Arena! 🔥For the first time, open-weight models really breathe down the neck of Claude-3.5 and GPT-4o on Arena Hard. 🛠️Athene-70B comes from @NexusflowX targeted

thumb_up_off_alt18

chat_bubble_outline2

repeat8

shareShare

Yann LeCun

@ylecun

a year ago

Human specialists excel through education & experience. This applies to LLMs: post-training of free/open models can produce top performers on various benchmarks. Athene-70B, a fine-tuned version of Llama-3-70B, is ascending to the top of Arena-Hard. #OpenAlwaysWins

thumb_up_off_alt194

chat_bubble_outline10

repeat37

shareShare

sankalp

@dejavucoder

a year ago

investors when you say agentic workflows instead of function calls in a while loop

thumb_up_off_alt1,1K

chat_bubble_outline11

repeat93

shareShare

lmarena.ai (formerly lmsys.org)

@lmarena_ai

a year ago

Chatbot Arena update! @NexusflowX's Athene-70B, an open-weight model fine-tuned from Llama-3-70B, is now ranked #8 on the leaderboard with a significant 30+ ELO boost from Llama-3. We see balanced multilingual capability and strong performance in hard prompts/coding. Congrats

thumb_up_off_alt307

chat_bubble_outline7

repeat36

shareShare

AK

@_akhaliq

a year ago

Athene-V2 Advancing Beyond the Limits of Scaling with Targeted Post-training

thumb_up_off_alt97

chat_bubble_outline2

repeat22

shareShare

Colin Kealty

@bartowski1182

a year ago

Did you see those shiny new Athene models from our friends at @NexusflowX ? Good news, as usual you can run them right now in moved to: @lmstudio ! :D Find them using the CLI tool 'lms get athene-v2-chat' or on huggingface as usual! huggingface.co/lmstudio-commu… huggingface.co/lmstudio-commu…

thumb_up_off_alt27

chat_bubble_outline1

repeat10

shareShare

lmarena.ai (formerly lmsys.org)

@lmarena_ai

a year ago

Congrats Nexusflow on the latest Athene-V2-72B release, matching top models across hard benchmarks! Now it comes the real test—Athene is live in Arena for human evaluation. Come ask tough prompts at lmarena. ai!

thumb_up_off_alt148

chat_bubble_outline4

repeat16

shareShare

lmarena.ai (formerly lmsys.org)

@lmarena_ai

a year ago

Exciting update from Chatbot Arena! Athene-V2-Chat-72B by Nexusflow debuts as the best open model, matching proprietary models like GPT-4o/Sonnet in technical domains (e.g., math, coding, hard prompts)! Category ranking: - Math: #3 - Coding: #7 - Hard Prompt: #6 - Overall #10

Exciting update from Chatbot Arena!

Athene-V2-Chat-72B by <a href="/NexusflowX/">Nexusflow</a> debuts as the best open model, matching proprietary models like GPT-4o/Sonnet in technical domains (e.g., math, coding, hard prompts)!

Category ranking:
- Math: #3
- Coding: #7
- Hard Prompt: #6
- Overall #10

thumb_up_off_alt262

chat_bubble_outline4

repeat54

shareShare

Jiantao Jiao

@jiantaoj

2 months ago

🚀 We’re hiring at NVIDIA! Our team is pushing the frontier of LLM / DLM post-training and system optimization. We are looking for exceptional people with large-scale LLM + systems experience to join us (full time only). 🔹 Focus areas include: •Post-training of large models

thumb_up_off_alt470

chat_bubble_outline22

repeat35

shareShare

Jian Zhang

Jian Zhang

Andrej Karpathy

Jian Zhang

Jian Zhang

Jian Zhang

Jian Zhang

Jian Zhang

Jian Zhang

SambaNova Systems

SambaNova Systems

Jian Zhang

Jian Zhang

Yann LeCun

sankalp

lmarena.ai (formerly lmsys.org)

AK

Colin Kealty

lmarena.ai (formerly lmsys.org)

lmarena.ai (formerly lmsys.org)

Jiantao Jiao