Kai Zhang (@kaizhang9546) Twitter Tweets • TwiCopy

Gate.io

5 hours ago

🔥The 9th Round of Easy Loan, Earn $40 Reward is in progress❗️ ⏰ Promotion Period: January 15th - Feburary 15th, 2025 👉 Register now and check more details at gate.io/campaigns/358

thumb_up_off_alt34

chat_bubble_outline39

repeat6

shareShare

Our new OpenAI o3 and o4-mini models further confirm that scaling inference improves intelligence, and that scaling RL shifts up the whole compute vs. intelligence curve. There is still a lot of room to scale both of these further.

Our new <a href="/OpenAI/">OpenAI</a> o3 and o4-mini models further confirm that scaling inference improves intelligence, and that scaling RL shifts up the whole compute vs. intelligence curve. There is still a lot of room to scale both of these further.

thumb_up_off_alt1,1K

chat_bubble_outline52

repeat165

shareShare

Wan

@alibaba_wan

3 months ago

1/3 🚀Thrilled to introduce Wan2.1-FLF2V-14B - our first 14B-parameter large model for First-Last-Frame to video generation! Open-source, open-source, open-source! Empowering digital artists with unprecedented efficiency and creative flexibility. #wan #AIGC #alart

thumb_up_off_alt1,1K

chat_bubble_outline45

repeat290

shareShare

Cognition

@cognition_labs

2 months ago

Project DeepWiki Up-to-date documentation you can talk to, for every repo in the world. Think Deep Research for GitHub – powered by Devin. It’s free for open-source, no sign-up! Visit deepwiki com or just swap github → deepwiki on any repo URL:

thumb_up_off_alt4,4K

chat_bubble_outline137

repeat724

shareShare

Awni Hannun

@awnihannun

2 months ago

Qwen3 and Qwen3 MoEs are already supported in the latest mlx-lm thanks to Prince Canuma and Gökdeniz Gülmez pip install -U mlx-lm Awesome that Qwen ships a model for every device: -iPhone: 0.6B, 4B -Macbook: 8B, 30B, 3B/30B MoE -M2, M3 Ultra: 22B/235B MoE

thumb_up_off_alt331

chat_bubble_outline10

repeat43

shareShare

Junyang Lin

@justinlin610

2 months ago

Qwen3 is finally out! It really takes some time for our guys to figure out methods to solve some problems that are not fancy. How to scale RL with stable training, how to balance data from different domains, how to increase the support of more languages with performance

thumb_up_off_alt1,1K

chat_bubble_outline84

repeat187

shareShare

Kai Zhang

@kaizhang9546

2 months ago

So excited to see the progress in open-source models!

thumb_up_off_alt8

chat_bubble_outline0

repeat0

shareShare

Matt Deitke

@mattdeitke

2 months ago

I’m very excited to introduce Vy, the AI that sees and acts on your computer! It’s a first glimpse of what we’ve been working on at Vercept! Early computers trapped the world's best experts in low-level tasks–loading code, managing memory, fighting errors. Progress

thumb_up_off_alt76

chat_bubble_outline12

repeat18

shareShare

Runway

@runwayml

2 months ago

Today we are releasing Gen-4 References to all paid plans. Now anyone can generate consistent characters, locations and more. With References, you can use photos, generated images, 3D models or selfies to place yourself or others into any scene you can imagine. More examples

thumb_up_off_alt2,2K

chat_bubble_outline105

repeat339

shareShare

Hanwen Jiang

@hanwenjiang1

2 months ago

Supervised learning has held 3D Vision back for too long. Meet RayZer — a self-supervised 3D model trained with zero 3D labels: ❌ No supervision of camera & geometry ✅ Just RGB images And the wild part? RayZer outperforms supervised methods (as 3D labels from COLMAP is noisy)

thumb_up_off_alt391

chat_bubble_outline5

repeat69

shareShare

Chris Rockwell

@_crockwell

2 months ago

Excited to share ☀️Lightspeed⚡, a photorealistic, synthetic dataset with ground truth pose used for benchmarking alongside DynPose-100K! Now available for download: huggingface.co/datasets/nvidi… Paper accepted to #CVPR2025: arxiv.org/abs/2504.17788

thumb_up_off_alt196

chat_bubble_outline3

repeat31

shareShare

elvis

@omarsar0

a month ago

AgenticSeek: Private, Local Manus Alternative This is worth checking. It's a local alternative to Manus AI that can autonomously browse the web, write code, and plan tasks. It's built for local reasoning models, runs on your hardware, and keeps all data on your device.

thumb_up_off_alt462

chat_bubble_outline14

repeat79

shareShare

AK

@_akhaliq

a month ago

Direct3D-S2 Gigascale 3D Generation Made Easy with Spatial Sparse Attention high resolution 3D generation from image

thumb_up_off_alt305

chat_bubble_outline6

repeat60

shareShare

Alex Patrascu

@maxescu

a month ago

The things you can do with Veo 3 are... Insane! Here are my favorite explorations from today: 1. Google DeepMind

thumb_up_off_alt726

chat_bubble_outline46

repeat98

shareShare

Tianyuan Zhang

@tianyuanzhang99

a month ago

Bored of linear recurrent memories (e.g., linear attention) and want a scalable, nonlinear alternative? Our new paper “Test-Time Training Done Right” propose LaCT (Large Chunk Test-Time Training) — a highly efficient, massively scalable nonlinear memory with: 💡 Pure PyTorch

thumb_up_off_alt390

chat_bubble_outline5

repeat74

shareShare

Zhao Dong

@flycooler_zd

25 days ago

🚀 Excited to announce our CVPR 2025 Workshop: 3D Digital Twin: Progress, Challenges, and Future Directions 🗓 June 12, 2025 · 9:00 AM–5:00 PM 📢 Incredible lineup: Richard Newcombe, Andrea Vedaldi Visual Geometry Group (VGG),Hao (Richard) Zhang,Qianqian Wang,Dr. Xiaoshuai Zhang Hillbot,

thumb_up_off_alt53

chat_bubble_outline2

repeat21

shareShare

Kai Zhang

@kaizhang9546

21 days ago

Welcome to our poster to chat!

thumb_up_off_alt14

chat_bubble_outline1

repeat0

shareShare

Two Minute Papers

@twominutepapers

11 days ago

NVIDIA’s AI watched 150,000 videos… and learned to relight scenes incredibly well! No game engine. No 3D software. And it has an amazing cat demo. 🐱💡 Hold on to your papers! Full video: youtube.com/watch?v=yRk6vG…

thumb_up_off_alt32

chat_bubble_outline1

repeat13

shareShare

Jason Wei

@_jasonwei

5 days ago

The most rewarding thing about working in the office on nights and weekends is not the actual work you get done, but the spontaneous conversations with other people who are always working. They’re the people who tend to do big things and will become your most successful friends

thumb_up_off_alt733

chat_bubble_outline29

repeat26

shareShare

Tanishq Mathew Abraham, Ph.D.

@iscienceluvr

5 days ago

The Automated LLM Speedrunning Benchmark: Reproducing NanoGPT Improvements "To evaluate the ability of AI agents to reproduce results in an active research area, we introduce the Automated LLM Speedrunning Benchmark, leveraging the research community’s contributions on the

thumb_up_off_alt251

chat_bubble_outline7

repeat33

shareShare

Jason Weston

@jaseweston

5 days ago

🌉 Bridging Offline & Online RL for LLMs 🌉 📝: arxiv.org/abs/2506.21495 New paper shows on verifiable & non-verifiable tasks: - Online DPO & GRPO give similar performance. - Semi-online (iterative) DPO with sync every s steps (more efficient!) works very well also. - Offline DPO

thumb_up_off_alt446

chat_bubble_outline1

repeat96

shareShare

Kai Zhang

Gate.io

Noam Brown

Wan

Cognition

Awni Hannun

Junyang Lin

Kai Zhang

Matt Deitke

Runway

Hanwen Jiang

Chris Rockwell

elvis

AK

Alex Patrascu

Tianyuan Zhang

Zhao Dong

Kai Zhang

Two Minute Papers

Jason Wei

Tanishq Mathew Abraham, Ph.D.

Jason Weston