Vuk Rosić (@vukrosic99) Twitter Tweets • TwiCopy

Vuk Rosić

@vukrosic99

+ Follow

🤖 AI Research Scientist
📊 Agents, LLMs, Inference (test) time scaling, RLHF..
🤝 Solve math and understand the universe
🧑‍🎓 我在学习中文，随时可以跟我用中文聊天

ID: 2234985075

linkhttps://www.youtube.com/channel/UC7XJj9pv_11a11FUxCMz15g calendar_today20-12-2013 16:28:24

150 Tweet

21 Takipçi

340 Takip Edilen

Vuk Rosić

@vukrosic99

6 months ago

DeepSeek INFINITE Context Window - Encode Text As Images - DeepSeek OCR 📝➡️🖼️ Screenshot massive amount of text and process it like images theoretically infinite context window yt youtu.be/bxTkOCv7SGM bilibili bilibili.com/video/BV1p4WSz…

thumb_up_off_alt1

chat_bubble_outline0

repeat0

shareShare

Vuk Rosić

@vukrosic99

6 months ago

Build 10,000 GPU Cluster - TikTok's LLM Training - New Paper by ByteDance explaining their GPU cluster yt - youtu.be/MAgURs2iFCQ bilibili - bilibili.com/video/BV1kvW9z…

thumb_up_off_alt3

chat_bubble_outline0

repeat0

shareShare

Vuk Rosić

@vukrosic99

6 months ago

Master RMSNorm From Scratch - Step by Step Tutorial Used in LLMs, Transformers...extremely popular normalization - but how does it work? youtube - youtu.be/HgSdYtPgJnU bilibili - bilibili.com/video/BV1zxWoz…

thumb_up_off_alt3

chat_bubble_outline0

repeat0

shareShare

Vuk Rosić

@vukrosic99

6 months ago

13x FASTER Video Generation - Sparse Linear Attention Transformer yt - youtu.be/SMNswPiU8go bilibili - bilibili.com/video/BV15Esaz… arxiv - arxiv.org/pdf/2509.24006

thumb_up_off_alt5

chat_bubble_outline0

repeat0

shareShare

Vuk Rosić

@vukrosic99

6 months ago

Explaning new paper 'Definition of AGI' YouTube - youtu.be/7AWl-EqsD8w Bilibili - bilibili.com/video/BV1zesvz… paper - agidefinition.ai/paper.pdf

thumb_up_off_alt3

chat_bubble_outline0

repeat0

shareShare

Vuk Rosić

@vukrosic99

6 months ago

My new video is going viral - Build 10,000 GPU Cluster - TikTok's LLM Training - New Paper - youtu.be/MAgURs2iFCQ

thumb_up_off_alt3

chat_bubble_outline0

repeat0

shareShare

Vuk Rosić

@vukrosic99

6 months ago

just crossed 10k subscribers on youtube a lot of new videos about AI papers and code (LLMs, Diffsuoin, video generation) - youtube.com/channel/UC7XJj…

thumb_up_off_alt3

chat_bubble_outline0

repeat0

shareShare

Vuk Rosić

@vukrosic99

6 months ago

new best open source model Minimax M2 coming Oct 27

thumb_up_off_alt11

chat_bubble_outline0

repeat0

shareShare

Vuk Rosić

@vukrosic99

6 months ago

thoughts?

thumb_up_off_alt3

chat_bubble_outline0

repeat0

shareShare

Vuk Rosić

@vukrosic99

6 months ago

I started sitting whole day making videos on AI research / math and doing nothing else, check new videos here - youtube.com/@vukrosic/vide…

thumb_up_off_alt10

chat_bubble_outline0

repeat1

shareShare

Vuk Rosić

@vukrosic99

6 months ago

Decaying learning rate prevents you from training LLM indefinitely, this is how you can replace learning rate decay with checkpoint merging - youtube.com/watch?v=Z5kEG7…

thumb_up_off_alt4

chat_bubble_outline0

repeat0

shareShare

Vuk Rosić

@vukrosic99

6 months ago

New video - ADAM Optimizer Explained Step by Step - First & Second Moment, Zero Bias Correction youtu.be/5tygLBoN_io

thumb_up_off_alt2

chat_bubble_outline0

repeat0

shareShare

Vuk Rosić

@vukrosic99

6 months ago

LATENT Thinking LLM by TikTok parent ByteDance - Looped Transformer Thinking - New Paper Explained YouTube - youtu.be/SCvo_pO35eg bilibli 中文字幕 (潜思维大语言模型 — 循环Transformer思维 — 新论文详解) - bilibili.com/video/BV1h21JB…

thumb_up_off_alt3

chat_bubble_outline0

repeat0

shareShare

Vuk Rosić

@vukrosic99

6 months ago

so like how do I know what the best thing to research is 😂😂😂😂😂😂😂😂😂

thumb_up_off_alt1

chat_bubble_outline0

repeat0

shareShare

Vuk Rosić

@vukrosic99

6 months ago

I'm hearing Kimi K2 Thinking closed the gap even more with GPT and Claude, and surpassed in some areas, which I didn't expect. I thought open source will be stuck a bit behind. I wonder if it was Su Jianlin's cooking.

thumb_up_off_alt1

chat_bubble_outline0

repeat0

shareShare

Vuk Rosić

@vukrosic99

6 months ago

月之暗面

thumb_up_off_alt0

chat_bubble_outline0

repeat0

shareShare