Murat 💾 (@muratyillmaz_) Twitter Tweets • TwiCopy

Murat 💾

5 months ago

Harika bir Çin filmi. Java developer bir elemanın yaşından ötürü işini kaybedip hayata tutunmaya çalışmasını anlatıyor. Çin'de ki çalışma şartlarına çok güzel ışık tutmuş olduğunu Çinli 3 farklı arkadaşımdan teyit ederek söylüyorum.

thumb_up_off_alt1

chat_bubble_outline0

repeat0

shareShare

Murat 💾

@muratyillmaz_

5 months ago

Multi-Head, Multi-Query ve Grouped-Query Attention mekanizmalarını bellek tüketimi altında karşılaştırdığım ve DeepSeek'in Multi-Head Latent Attention'ın KV-Cache perspektifinde burada oluşturduğu farklılığı ele aldığım bir yazı hazırladım. Colab mevcut. medium.com/p/attention-me…

thumb_up_off_alt2

chat_bubble_outline0

repeat0

shareShare

Murat 💾

@muratyillmaz_

5 months ago

Geri bildirim yapabilirseniz sevinirim

thumb_up_off_alt1

chat_bubble_outline0

repeat0

shareShare

Murat 💾

@muratyillmaz_

5 months ago

Çok ilginç olay

thumb_up_off_alt0

chat_bubble_outline0

repeat0

shareShare

Xin Eric Wang @ ICLR 2025

@xwang_lk

5 months ago

This precisely explains why llama 4 failed. Politicians won the politics game while real scientists struggled with computing resources.

thumb_up_off_alt8

chat_bubble_outline0

repeat2

shareShare

Murat 💾

@muratyillmaz_

5 months ago

O kadar merak ediyorum ki nasıl bir şey ortaya çıkartacaklarını

thumb_up_off_alt0

chat_bubble_outline0

repeat0

shareShare

Murat 💾

@muratyillmaz_

4 months ago

youtube.com/live/eRALNkl8J…

thumb_up_off_alt0

chat_bubble_outline0

repeat0

shareShare

Murat 💾

@muratyillmaz_

3 months ago

Microsoft Research China tarafından denenmiş LLM lere prompt yazmayı standartlaştırmayı hedefleyen güzel bir yaklaşım. Prompt Orchestration Markup Language: POML arxiv.org/pdf/2508.13948 github.com/microsoft/poml

thumb_up_off_alt0

chat_bubble_outline0

repeat0

shareShare

机器之心 JIQIZHIXIN

@synced_global

2 months ago

Wow, a new post-training method. SFT = efficient but capped 🚦 RL = powerful but slow 🐢 Now enter: Guess-Think-Answer (GTA) GTA fuses guess (SFT), think (reflection), and answer (RL-shaped). Result: ⚡ Faster convergence than RL 📈 Higher ceiling than SFT 🛠️ Gradient

thumb_up_off_alt337

chat_bubble_outline7

repeat67

shareShare

enise👩🏻‍💻

@enisebytes

a month ago

Türk yazılım camiası hep çok bilen az yapanlarla dolu maalesef

thumb_up_off_alt431

chat_bubble_outline11

repeat19

shareShare

alphaXiv

@askalphaxiv

a month ago

Introducing NotebookLM for arXiv papers 🚀 Transform dense AI research into an engaging conversation With context across thousands of related papers, it captures motivations, draws connections to SOTA, and explains key insights like a professor who's read the entire field

thumb_up_off_alt2,2K

chat_bubble_outline40

repeat390

shareShare

vLLM

@vllm_project

a month ago

Announcing the completely reimagined vLLM TPU! In collaboration with Google, we've launched a new high-performance TPU backend unifying PyTorch and JAX under a single lowering path for amazing performance and flexibility. 🚀 What's New? - JAX + Pytorch: Run PyTorch models on

Announcing the completely reimagined vLLM TPU! In collaboration with <a href="/Google/">Google</a>, we've launched a new high-performance TPU backend unifying <a href="/PyTorch/">PyTorch</a> and JAX under a single lowering path for amazing performance and flexibility.

🚀 What's New?
- JAX + Pytorch: Run PyTorch models on

thumb_up_off_alt969

chat_bubble_outline17

repeat122

shareShare