Jian Zhang (@jianzhangcs) 's Twitter Profile
Jian Zhang

@jianzhangcs

Co-founder, CTO & VP Engineering at @NexusflowX | Ex-Director of Machine Learning at @SambaNovaAI | PhD in machine learning at @Stanford

ID: 877293859528622080

linkhttps://www.linkedin.com/in/jian-zhang-10383a98/ calendar_today20-06-2017 22:35:30

95 Tweet

384 Takipçi

230 Takip Edilen

Jian Zhang (@jianzhangcs) 's Twitter Profile Photo

Last mile quality & robustness (the high-hanging fruits) is going to be the test bed for enterprise LLM players in 2024.

Andrej Karpathy (@karpathy) 's Twitter Profile Photo

e/ia - Intelligence Amplification - Does not seek to build superintelligent God entity that replaces humans. - Builds “bicycle for the mind” tools that empower and extend the information processing capabilities of humans. - Of all humans, not a top percentile. - Faithful to

e/ia - Intelligence Amplification
- Does not seek to build superintelligent God entity that replaces humans.
- Builds “bicycle for the mind” tools that empower and extend the information processing capabilities of humans.
- Of all humans, not a top percentile.
- Faithful to
Jian Zhang (@jianzhangcs) 's Twitter Profile Photo

🤖 It was a great conversation with Ben Ben Lorica 罗瑞卡 to exchange thoughts on GenAI copilot & agents for cybersecurity and enterprise workflow in general! ⛰At @NexusflowX , we are very excited to build the Gen AI foundations for enterprise workflow agents, owned, controllable and

Jian Zhang (@jianzhangcs) 's Twitter Profile Photo

Through many customer conversations, we discovered the top adoption criterion of GenAI agents to be the reliability to accomplish tasks with minimal hallucinations. 📢Checkout how @NexusflowX leverage a new agent building paradigm, with extractive reasoning as the first-class

Jian Zhang (@jianzhangcs) 's Twitter Profile Photo

Very exciting long context length multi modality capability from Google. The full repo code analysis part is pretty amazing!

Jian Zhang (@jianzhangcs) 's Twitter Profile Photo

📢Exciting release of Starling-7B-beta chat model and Starling-34B-RM reward model powered by Nexusflow latest technology. I am continuously amazed by how fast and powerful the small striking team behind Starling is!

Jian Zhang (@jianzhangcs) 's Twitter Profile Photo

🔥 Starling-7B Beta from @NexusflowX climbing fast on Chatbot Arena, outperforming or rivaling larger models like Gemini Pro, Mixtral 8 * 7B and ranked as #1 7B chat model. 🤔 While DBRX from Databricks presents a new strong open model, the story of Starling-7B Beta shows how

SambaNova Systems (@sambanovaai) 's Twitter Profile Photo

🚀🌟🚀Excited to announce Samba-CoE v0.2, which outperforms DBRX by Databricks Mosaic Research and Databricks, Mixtral-8x7B from Mistral AI, and Grok-1 by Grok at a breakneck speed of 330 tokens/s. These breakthrough speeds were achieved without sacrificing precision and only on 8 sockets,

🚀🌟🚀Excited to announce Samba-CoE v0.2, which outperforms DBRX by <a href="/DbrxMosaicAI/">Databricks Mosaic Research</a> and <a href="/databricks/">Databricks</a>, Mixtral-8x7B from <a href="/MistralAI/">Mistral AI</a>, and Grok-1 by <a href="/grok/">Grok</a> at a breakneck speed of 330 tokens/s. 
These breakthrough speeds were achieved without sacrificing precision and only on 8 sockets,
SambaNova Systems (@sambanovaai) 's Twitter Profile Photo

Running at 430 tokens/second using full precision and 8 sockets, #Llama3 from AI at Meta is now available on SambaNova Platform: fast.snova.ai 🚀Get full 16-bit precision 🚀Spend on only 8 chips, not 576 chips for 430 tokens/second! Trim chips, not precision! Test

Jian Zhang (@jianzhangcs) 's Twitter Profile Photo

📢 New function calling and GenAI agent course launched in collaboration with Andrew Ng and DeepLearning.AI. Come and try out the tutorial built by Venkat and @NexusflowX team! Hope you enjoy it.

Jian Zhang (@jianzhangcs) 's Twitter Profile Photo

📢 Excited to release Athene-Llama3-70B chat LLM, delivering new record on Arena Hard from @lmsys Chatbot Arena! 🔥For the first time, open-weight models really breathe down the neck of Claude-3.5 and GPT-4o on Arena Hard. 🛠️Athene-70B comes from @NexusflowX targeted

Yann LeCun (@ylecun) 's Twitter Profile Photo

Human specialists excel through education & experience. This applies to LLMs: post-training of free/open models can produce top performers on various benchmarks. Athene-70B, a fine-tuned version of Llama-3-70B, is ascending to the top of Arena-Hard. #OpenAlwaysWins

lmarena.ai (formerly lmsys.org) (@lmarena_ai) 's Twitter Profile Photo

Chatbot Arena update! @NexusflowX's Athene-70B, an open-weight model fine-tuned from Llama-3-70B, is now ranked #8 on the leaderboard with a significant 30+ ELO boost from Llama-3. We see balanced multilingual capability and strong performance in hard prompts/coding. Congrats

Chatbot Arena update!

@NexusflowX's Athene-70B, an open-weight model fine-tuned from Llama-3-70B, is now ranked #8 on  the leaderboard with a significant 30+ ELO boost from Llama-3. 

We see balanced multilingual capability and strong performance in hard prompts/coding. Congrats
Colin Kealty (@bartowski1182) 's Twitter Profile Photo

Did you see those shiny new Athene models from our friends at @NexusflowX ? Good news, as usual you can run them right now in moved to: @lmstudio ! :D Find them using the CLI tool 'lms get athene-v2-chat' or on huggingface as usual! huggingface.co/lmstudio-commu… huggingface.co/lmstudio-commu…

lmarena.ai (formerly lmsys.org) (@lmarena_ai) 's Twitter Profile Photo

Congrats Nexusflow on the latest Athene-V2-72B release, matching top models across hard benchmarks! Now it comes the real test—Athene is live in Arena for human evaluation. Come ask tough prompts at lmarena. ai!

lmarena.ai (formerly lmsys.org) (@lmarena_ai) 's Twitter Profile Photo

Exciting update from Chatbot Arena! Athene-V2-Chat-72B by Nexusflow debuts as the best open model, matching proprietary models like GPT-4o/Sonnet in technical domains (e.g., math, coding, hard prompts)! Category ranking: - Math: #3 - Coding: #7 - Hard Prompt: #6 - Overall #10

Exciting update from Chatbot Arena!

Athene-V2-Chat-72B by <a href="/NexusflowX/">Nexusflow</a> debuts as the best open model, matching proprietary models like GPT-4o/Sonnet in technical domains (e.g., math, coding, hard prompts)!

Category ranking:
- Math: #3
- Coding: #7
- Hard Prompt: #6
- Overall #10
Jiantao Jiao (@jiantaoj) 's Twitter Profile Photo

🚀 We’re hiring at NVIDIA! Our team is pushing the frontier of LLM / DLM post-training and system optimization. We are looking for exceptional people with large-scale LLM + systems experience to join us (full time only). 🔹 Focus areas include: •Post-training of large models