Krishna Mohan (@kmohan2006) 's Twitter Profile
Krishna Mohan

@kmohan2006

Denoising present to hopefully get brighter future | loves diffusion models

ID: 1794236443789078528

calendar_today25-05-2024 05:18:21

2,2K Tweet

2,2K Takipçi

273 Takip Edilen

fal (@fal) 's Twitter Profile Photo

🚨🚨🚨 Veo 3 by Google, world's most advanced video generation model, is now available FIRST on fal! Try it TODAY THIS IS NOT A DRILL! fal.ai/models/fal-ai/…

Sarvam AI (@sarvamai) 's Twitter Profile Photo

Today we’re announcing Sarvam-Translate, an open-weights model that translates text across 22 Indian languages, with support for long-form text and the ability to handle diverse formats, contexts, and styles. Sarvam-Translate stands out for its ability to handle the complexities

Today we’re announcing Sarvam-Translate, an open-weights model that translates text across 22 Indian languages, with support for long-form text and the ability to handle diverse formats, contexts, and styles.

Sarvam-Translate stands out for its ability to handle the complexities
The AI Timeline (@theaitimeline) 's Twitter Profile Photo

🚨This week's top AI/ML research papers: - Log-Linear Attention - Beyond the 80/20 Rule - Why Gradients Rapidly Increase Near the End of Training - How much do language models memorize? - General agents need world models - The Illusion of Thinking - MiMo-VL Technical Report -

🚨This week's top AI/ML research papers:

- Log-Linear Attention
- Beyond the 80/20 Rule
- Why Gradients Rapidly Increase Near the End of Training
- How much do language models memorize?
- General agents need world models
- The Illusion of Thinking
- MiMo-VL Technical Report
-
The AI Timeline (@theaitimeline) 's Twitter Profile Photo

🚨This week's top AI/ML research papers: - Self-Adapting Language Models - V-JEPA 2 - The Illusion of the Illusion of Thinking - Magistral - Reinforcement Pre-Training - VideoDeepResearch - Unsupervised Elicitation of LMs - CoRT - The Diffusion Duality - Ming-Omni - One

🚨This week's top AI/ML research papers:

- Self-Adapting Language Models
- V-JEPA 2
- The Illusion of the Illusion of Thinking
- Magistral
- Reinforcement Pre-Training
- VideoDeepResearch
- Unsupervised Elicitation of LMs
- CoRT
- The Diffusion Duality
- Ming-Omni
- One
puch.ai (@puch_ai) 's Twitter Profile Photo

*** Internship Opportunity *** Puch AI is hiring! Join us in building AI for Billions. 💰 Stipend: ₹1L/month 🗓️ Start Date: July 2025 📍 Location: Virtual 🚀 Career Path: PPOs for top performers Open Roles: 1. AI Engineer 2. Software Engineer 3. DevOps Engineer If you're

*** Internship Opportunity ***

Puch AI is hiring! Join us in building AI for Billions.

💰 Stipend: ₹1L/month
🗓️ Start Date: July 2025
📍 Location: Virtual
🚀 Career Path: PPOs for top performers

Open Roles:
1. AI Engineer
2. Software Engineer
3. DevOps Engineer

If you're
Unsloth AI (@unslothai) 's Twitter Profile Photo

We made a complete Guide on Reinforcement Learning for LLMs! Learn about: • RL's goal & why it's key to building intelligent AI agents • Why o3, Claude 4 & R1 use RL • GRPO, RLHF, DPO, reward functions • Training your own local R1 model via Unsloth 🔗docs.unsloth.ai/basics/reinfor…

We made a complete Guide on Reinforcement Learning for LLMs!

Learn about:
• RL's goal & why it's key to building intelligent AI agents
• Why o3, Claude 4 & R1 use RL
• GRPO, RLHF, DPO, reward functions
• Training your own local R1 model via Unsloth

🔗docs.unsloth.ai/basics/reinfor…