Junyang Lin(@JustinLin610) 's Twitter Profileg
Junyang Lin

@JustinLin610

Chief Evangelist Officer of Qwen Team & OpenDevin, building LLM and LMM. Now @Alibaba_Qwen . Previously @PKU1898 LANCO group. ❤️ 🍵 ☕️ 🍷 🥃

ID:4473952878

linkhttps://www.linkedin.com/in/junyang-lin-0b2b38151/ calendar_today06-12-2015 10:28:42

1,1K Tweets

5,0K Followers

1,4K Following

Eric Hartford(@erhartford) 's Twitter Profile Photo

Dolphin-2.9-8x22b is in the oven.
fft, deepspeed zero3 param offload, 8k sequence, half the layers are targeted.
This is a significantly improved, filtered dataset. Function calling, agentic, math, dolphin and dolphin-coder.

Dolphin-2.9-8x22b is in the oven. fft, deepspeed zero3 param offload, 8k sequence, half the layers are targeted. This is a significantly improved, filtered dataset. Function calling, agentic, math, dolphin and dolphin-coder.
account_circle
Aran Komatsuzaki(@arankomatsuzaki) 's Twitter Profile Photo

Google presents Best Practices and Lessons Learned on Synthetic Data for Language Models

Provides an overview of synthetic data research, discussing its applications, challenges, and future directions

arxiv.org/abs/2404.07503

Google presents Best Practices and Lessons Learned on Synthetic Data for Language Models Provides an overview of synthetic data research, discussing its applications, challenges, and future directions arxiv.org/abs/2404.07503
account_circle
Omar Sanseviero(@osanseviero) 's Twitter Profile Photo

Welcome Zephyr 141B to Hugging Chat🔥

🎉A Mixtral-8x22B fine-tune
⚡️Super fast generation with TGI
🤗Fully open source (from the data to the UI)

huggingface.co/chat/models/Hu…

Welcome Zephyr 141B to Hugging Chat🔥 🎉A Mixtral-8x22B fine-tune ⚡️Super fast generation with TGI 🤗Fully open source (from the data to the UI) huggingface.co/chat/models/Hu…
account_circle
Junyang Lin(@JustinLin610) 's Twitter Profile Photo

Wow this is new to me! But I have been confident in my model's French but didn't expect it to be somewhat top level.

account_circle
Junyang Lin(@JustinLin610) 's Twitter Profile Photo

Wow I really love this sub arena! It provides a more comprehensive eval for sure! Btw what makes me surprised is our model perf in French (my second favorite language) LGTM!🥰

account_circle
Tianbao Xie(@TianbaoX) 's Twitter Profile Photo

🤔Can we assess agents across various apps & OS w.o. crafting new envs?

OSWorld🖥️: A unified, real computer env for multimodal agents to evaluate open-ended computer tasks with arbitrary apps and interfaces on Ubuntu, Windows, & macOS.

+ annotated 369 real-world computer tasks…

account_circle
William Fedus(@LiamFedus) 's Twitter Profile Photo

Our improved model in the arena at lmsys and we’ve rolled out to ChatGPT users today — stay tuned for better versions to come

account_circle
Vasek Mlejnsky(@mlejva) 's Twitter Profile Photo

I've been working on integrating E2B to OpenDevin from Junyang Lin and I'm pretty excited where the open source community is heading

The open-source future looks bright

account_circle
Jim Fan(@DrJimFan) 's Twitter Profile Photo

The moat of software AI agents is not the thin wrapper layer (Devin, SWE-Agent), but the underlying LLM. Instead of benchmarking the wrapper, I think SWE-Bench is excellent for evaluating coding LLMs instead:

Hold the agent layer fixed and vary only the LLM backend. Provide all…

The moat of software AI agents is not the thin wrapper layer (Devin, SWE-Agent), but the underlying LLM. Instead of benchmarking the wrapper, I think SWE-Bench is excellent for evaluating coding LLMs instead: Hold the agent layer fixed and vary only the LLM backend. Provide all…
account_circle
Wenhu Chen(@WenhuChen) 's Twitter Profile Photo

Check out our recent paper on Music Pretraining with Transformers. This is a teamwork by lots of awesome collaborators at different institutions.

account_circle
nisten(@nisten) 's Twitter Profile Photo

God bless Justine Tunney's llamacpp kernels,
Mixtral8x22b running CPU ONLY at ~9 tokens per sec.
Yep that's GPT4 class AI.

I'll push out cpu-optimized 4bit/8bit EdgeQuants after benchmarking.

account_circle
Junyang Lin(@JustinLin610) 's Twitter Profile Photo

This month is crazy. We just opensourced two models in these two to three weeks but we are already behind now. gotta do something man🥹

account_circle
Graham Neubig(@gneubig) 's Twitter Profile Photo

Check out our new method for evaluating the quality of generated images, VQAScore! It's simple, runs locally, and is relatively good at evaluation.

account_circle
Vaibhav (VB) Srivastav(@reach_vb) 's Twitter Profile Photo

IT WORKS! Running Mixtral 8x22B with Transformers! 🔥

Running on a DGX (4x A100 - 80GB) with CPU offloading 🤯

account_circle