Yauhen Pyl (@yauhenpyl) 's Twitter Profile
Yauhen Pyl

@yauhenpyl

ID: 1597221277189767169

calendar_today28-11-2022 13:30:26

146 Tweet

9 Followers

111 Following

GitHub Projects Community (@githubprojects) 's Twitter Profile Photo

| ̄ ̄ ̄ ̄ ̄ ̄ ̄ ̄ ̄ ̄ ̄ ̄ ̄ ̄| !Push To Production On Friday| |______________| \ (•◡•) / \ / —— | | |_ |_

Qwen (@alibaba_qwen) 's Twitter Profile Photo

Bye Qwen3-235B-A22B, hello Qwen3-235B-A22B-2507! After talking with the community and thinking it through, we decided to stop using hybrid thinking mode. Instead, we’ll train Instruct and Thinking models separately so we can get the best quality possible. Today, we’re releasing

Bye Qwen3-235B-A22B, hello Qwen3-235B-A22B-2507!

After talking with the community and thinking it through, we decided to stop using hybrid thinking mode. Instead, we’ll train Instruct and Thinking models separately so we can get the best quality possible. Today, we’re releasing
Freddie Vargus (@freddie_v4) 's Twitter Profile Photo

today we're releasing a new small model (0.5B) for detecting problems with tool usage in agents, trained on 50M tokens from publicly available MCP server tools it's great at picking up on tool accuracy issues and outperforms larger models

Qwen (@alibaba_qwen) 's Twitter Profile Photo

🚀 Introducing Qwen3-MT – our most powerful translation model yet! Trained on trillions of multilingual tokens, it supports 92+ languages—covering 95%+ of the world’s population. 🌍✨ 🔑 Why Qwen3-MT? ✅ Top-tier translation quality ✅ Customizable: terminology control, domain

🚀 Introducing Qwen3-MT – our most powerful translation model yet!

Trained on trillions of multilingual tokens, it supports 92+ languages—covering 95%+ of the world’s population. 🌍✨

🔑 Why Qwen3-MT?
✅ Top-tier translation quality
✅ Customizable: terminology control, domain
Qwen (@alibaba_qwen) 's Twitter Profile Photo

🚀 We’re excited to introduce Qwen3-235B-A22B-Thinking-2507 — our most advanced reasoning model yet! Over the past 3 months, we’ve significantly scaled and enhanced the thinking capability of Qwen3, achieving: ✅ Improved performance in logical reasoning, math, science & coding

🚀 We’re excited to introduce Qwen3-235B-A22B-Thinking-2507 — our most advanced reasoning model yet!

Over the past 3 months, we’ve significantly scaled and enhanced the thinking capability of Qwen3, achieving:
✅ Improved performance in logical reasoning, math, science & coding
Qwen (@alibaba_qwen) 's Twitter Profile Photo

🚀 Qwen3-30B-A3B-Thinking-2507, a medium-size model that can think! • Nice performance on reasoning tasks, including math, science, code & beyond • Good at tool use, competitive with larger models • Native support of 256K-token context, extendable to 1M Qwen Chat: Go to

🚀 Qwen3-30B-A3B-Thinking-2507, a medium-size model that can think!

• Nice performance on reasoning tasks, including math, science, code & beyond
• Good at tool use, competitive with larger models
• Native support of 256K-token context, extendable to 1M

Qwen Chat: Go to
Hunyuan (@tencenthunyuan) 's Twitter Profile Photo

🚀We're expanding the Tencent Hunyuan open-source LLM ecosystem with four compact models (0.5B, 1.8B, 4B, 7B)! Designed for low-power scenarios like consumer-grade GPUs, smart vehicles, smart home devices, mobile phones, and PCs, these models support cost-effective fine-tuning

🚀We're expanding the Tencent Hunyuan open-source LLM ecosystem with four compact models (0.5B, 1.8B, 4B, 7B)! Designed for low-power scenarios like consumer-grade GPUs, smart vehicles, smart home devices, mobile phones, and PCs, these models support cost-effective fine-tuning
ElevenLabs (@elevenlabsio) 's Twitter Profile Photo

Introducing Eleven Music. The highest quality AI music model. - Complete control over genre, style, and structure - Multi-lingual, including English, Spanish, German, Japanese and more - Edit the sound and lyrics of individual sections or the whole song

Qwen (@alibaba_qwen) 's Twitter Profile Photo

Today we release the APIs of our Flash series, which support Qwen3-Coder and Qwen3-2507 now. Both APIs support the context length of 1M tokens. They are fast and accurate, and they are cost-effectve as well. Feel free to take a try! Qwen3-Coder-Flash Model Card:

Qwen (@alibaba_qwen) 's Twitter Profile Photo

🚀 Introducing Qwen3-4B-Instruct-2507 & Qwen3-4B-Thinking-2507 — smarter, sharper, and 256K-ready! 🔹 Instruct: Boosted general skills, multilingual coverage, and long-context instruction following. 🔹 Thinking: Advanced reasoning in logic, math, science & code — built for

🚀 Introducing Qwen3-4B-Instruct-2507 & Qwen3-4B-Thinking-2507 — smarter, sharper, and 256K-ready!

🔹 Instruct: Boosted general skills, multilingual coverage, and long-context instruction following.

🔹 Thinking: Advanced reasoning in logic, math, science & code — built for
Etienne Bernard (@etiennebcp) 's Twitter Profile Photo

We are releasing NuMarkdown-8B-Thinking, an open-source (MIT License) reasoning OCR model 🧠✨📄 NuMarkdown-8B-Thinking is apparently the first (!) reasoning VLM specialized in converting PDFs/Scans/Spreadsheets into Markdown files (typically used for RAG applications). It

We are releasing NuMarkdown-8B-Thinking, an open-source (MIT License) reasoning OCR model 🧠✨📄

NuMarkdown-8B-Thinking is apparently the first (!) reasoning VLM specialized in converting PDFs/Scans/Spreadsheets into Markdown files (typically used for RAG applications).

It
Qwen (@alibaba_qwen) 's Twitter Profile Photo

🚀 Qwen3-30B-A3B-2507 and Qwen3-235B-A22B-2507 now support ultra-long context—up to 1 million tokens! 🔧 Powered by: • Dual Chunk Attention (DCA) – A length extrapolation method that splits long sequences into manageable chunks while preserving global coherence. •

🚀 Qwen3-30B-A3B-2507 and Qwen3-235B-A22B-2507 now support ultra-long context—up to 1 million tokens!

🔧 Powered by:

• Dual Chunk Attention (DCA) –  A length extrapolation method that splits long sequences into manageable chunks while preserving global coherence.  

•
Qwen (@alibaba_qwen) 's Twitter Profile Photo

💡 You get 2,000 free Qwen Code runs every day! Run this one simple command: npx @​qwen-code/qwen-code@latest Hit Enter, and that’s it! 🚀 Now with Qwen OAuth support — super easy to use. Try it now and supercharge your vibe code! 💻⚡ Github:github.com/QwenLM/qwen-co…

💡 You get 2,000 free Qwen Code runs every day!

Run this one simple command:
npx @​qwen-code/qwen-code@latest
Hit Enter, and that’s it!
🚀 Now with Qwen OAuth support — super easy to use.
Try it now and supercharge your vibe code! 💻⚡
Github:github.com/QwenLM/qwen-co…
Deedy (@deedydas) 's Twitter Profile Photo

Huge computer science result: A Tsinghua professor JUST discovered the fastest shortest path algorithm for graphs in 40yrs. This improves on Turing award winner Tarjan’s O(m + nlogn) with Dijkstra’s, something every Computer Science student learns in college.

Huge computer science result:

A Tsinghua professor JUST discovered the fastest shortest path algorithm for graphs in 40yrs.

This improves on Turing award winner Tarjan’s O(m + nlogn) with Dijkstra’s, something every Computer Science student learns in college.
Mistral AI (@mistralai) 's Twitter Profile Photo

Mistral Document AI is now in Microsoft Azure AI Foundry! Ever struggled to extract useful info from PDFs, scans, or complex files? Our AI understands tables, charts, and even multilingual text — turning messy documents into clean, actionable data. Thank you, Microsoft. We’re

Mistral Document AI is now in <a href="/MicrosoftAzure/">Microsoft Azure</a> AI Foundry!

Ever struggled to extract useful info from PDFs, scans, or complex files? Our AI understands tables, charts, and even multilingual text — turning messy documents into clean, actionable data.

Thank you, <a href="/Microsoft/">Microsoft</a>. We’re