Vivek Gupta (@keviv9) 's Twitter Profile
Vivek Gupta

@keviv9

Assistant Professor @SCAI_ASU; PostDoc @cogcomp @Penn, ed-@UUtah,@iitkanpur. @Bloomberg @MSFTResearch Fellow; ex-@MetaAI @IBM @Verisk @samsungresearch @Synopsys

ID: 446867959

linkhttps://vgupta123.github.io/ calendar_today26-12-2011 07:23:16

2,2K Tweet

2,2K Followers

5,5K Following

Kyunghyun Cho (@kchonyc) 's Twitter Profile Photo

what do you teach at the first-year graduate course on machine learning, in this era of LLM and large-scale compute? here's my experiment on answering this question: let's teach everything that admits SGD and that is not LLM, and ask students to read old papers.

what do you teach at the first-year graduate course on machine learning, in this era of LLM and large-scale compute? here's my experiment on answering this question: let's teach everything that admits SGD and that is not LLM, and ask students to read old papers.
Vivek Gupta (@keviv9) 's Twitter Profile Photo

🎉 #ACL2025NLP 🎉 CoRAL Lab ASU School of Computing and Augmented Intelligence, ASU Ira A. Fulton Schools of Engineering has 4 papers accepted at ACL 2025! 🙌 All thanks 😊 🙏 and congratulations 🎉 🎊 to amazing students authors !! ✅ ACL-Main: 📌 GETReason – Enhancing Image Context Extraction through Hierarchical Multi-Agent

Vivek Gupta (@keviv9) 's Twitter Profile Photo

🎉 #ACL2025NLP – Cherry on Top! 🎉 In addition to our conference papers, we’re thrilled to share that CoRAL also got a demo paper accepted: PRAISE: Enhancing Product Descriptions with LLM-Driven Structured Insights - Adnan Qidwai, Srija Mukhopadhyay, Prerana Khatiwada,

AshutoshShrivastava (@ai_for_success) 's Twitter Profile Photo

This week in AI - Google Jules - Google Veo-3 - Google Flow AI - Gemini native audio - Gemma 3n on-device AI - Claude Sonnet 4 & Opus 4 - Anthropic Clode Code Agent - Mistral open-source Devstral - Microsoft GitHub Copilot agent Kepping up with AI news is 😂

This week in AI 
- Google Jules
- Google Veo-3
- Google Flow AI
- Gemini native audio
- Gemma 3n on-device AI 
- Claude Sonnet 4 & Opus 4 
- Anthropic Clode Code Agent 
- Mistral open-source Devstral
- Microsoft GitHub Copilot agent

Kepping up with AI news is 😂
Google DeepMind (@googledeepmind) 's Twitter Profile Photo

We're thrilled to announce SignGemma, our most capable model for translating sign language into spoken text. 🧏 This open model is coming to the Gemma model family later this year, opening up new possibilities for inclusive tech. Share your feedback and interest in early

AAAI (@realaaai) 's Twitter Profile Photo

➡️ AAAI-26 Workshops: Call for Proposals ⬅️ The AAAI-26 Workshop co-chairs encourage the submission of proposals for interdisciplinary workshops. Proposals that focus on important or novel application domains for AI are also encouraged. bit.ly/4dAhJsp

ASU School of Computing and Augmented Intelligence (@scai_asu) 's Twitter Profile Photo

🎓✨From "Namaste" to "Hello the American way," #SCAI Associate Professor Srividya Bansal brought the house(and hearts!) down at this year’s Arizona State University International Student Graduation Celebration with a speech as insightful as it was inspiring. See the speech: youtube.com/watch?v=6jh678…

Alham Fikri Aji (@alhamfikri) 's Twitter Profile Photo

Research isn’t always about chasing SOTA. It’s about understanding why things work (or don’t) and pushing the boundaries of our knowledge Negative results like this are still valuable and essential for the community Looking forward to the next banger!

Vaishnavh Nagarajan (@_vaishnavh) 's Twitter Profile Photo

📢 New paper on creativity & multi-token prediction! We design minimal open-ended tasks to argue: → LLMs are limited in creativity since they learn to predict the next token → creativity can be improved via multi-token learning & injecting noise ("seed-conditioning" 🌱) 1/ 🧵

📢 New paper on creativity & multi-token prediction! We design minimal open-ended tasks to argue:

→ LLMs are limited in creativity since they learn to predict the next token

→ creativity can be improved via multi-token learning & injecting noise ("seed-conditioning" 🌱) 1/ 🧵
Ximing Lu (@gximing) 's Twitter Profile Photo

What happens when you ✨scale up RL✨? In our new work, Prolonged RL, we significantly scale RL training to >2k steps and >130k problems—and observe exciting, non-saturating gains as we spend more compute 🚀.

What happens when you ✨scale up RL✨? In our new work, Prolonged RL, we significantly scale RL training to >2k steps and >130k problems—and observe exciting, non-saturating gains as we spend more compute 🚀.
Atsuyuki Miyai @UTokyo (@atsumiyaiam) 's Twitter Profile Photo

🧙‍♂️ Imagine web agents that don’t just browse but handle your tedious digital chores! 📣 Our team developed WebChoreArena - 532 human-curated tasks, crafted over 300+ hours - Tests agents on massive information memorization, mathematical reasoning, and long-term memory -

🧙‍♂️ Imagine web agents that don’t just browse but handle your tedious digital chores! 

📣 Our team developed WebChoreArena 
- 532 human-curated tasks, crafted over 300+ hours 
- Tests agents on massive information memorization, mathematical reasoning, and long-term memory 
-
Chau Minh Pham (@chautmpham) 's Twitter Profile Photo

🤔 What if you gave an LLM thousands of random human-written paragraphs and told it to write something new -- while copying 90% of its output from those texts? 🧟 You get what we call a Frankentext! 💡 Frankentexts are surprisingly coherent and tough for AI detectors to flag.

🤔 What if you gave an LLM thousands of random human-written paragraphs and told it to write something new -- while copying 90% of its output from those texts?

🧟 You get what we call a Frankentext!

💡 Frankentexts are surprisingly coherent and tough for AI detectors to flag.
Botao Yu (@botaoyu24) 's Twitter Profile Photo

🔬 Introducing ChemMCP, the first MCP-compatible toolkit for empowering AI models with advanced chemistry capabilities! In recent years, we’ve seen rising interest in tool-using AI agents across domains. Particularly in scientific domains like chemistry, LLMs alone still fall

Shiv Malik (@shivmalik) 's Twitter Profile Photo

Jen Zhu Fascinating! Made you a 20 min deep dive podcast about this! Cracking the 3D Kakeya Conjecture: Prof. Wang's Groundbreaking Breakthrough, pathaka.ai/podcast/previe…

Yu Su @#ICLR2025 (@ysu_nlp) 's Twitter Profile Photo

📈 Scaling may be hitting a wall in the digital world, but it's only beginning in the biological world! We trained a foundation model on 214M images of ~1M species (50% of named species on Earth 🐨🐠🌻🦠) and found emergent properties capturing hidden regularities in nature. 🧵

📈 Scaling may be hitting a wall in the digital world, but it's only beginning in the biological world!

We trained a foundation model on 214M images of ~1M species (50% of named species on Earth 🐨🐠🌻🦠) and found emergent properties capturing hidden regularities in nature.

🧵
Prateek Jain (@jainprateek_) 's Twitter Profile Photo

We are hiring Research Scientists for our Machine Learning and Optimization team at Google DeepMind Bangalore. If you're passionate about cutting-edge AI research and building efficient, elastic, customized, and safe LLMs, we'd love to hear from you. We are looking for

Pan Lu (@lupantech) 's Twitter Profile Photo

Do LLMs truly understand math proofs, or just guess? 🤔Our new study on #IneqMath dives deep into Olympiad-level inequality proofs & reveals a critical gap: LLMs are often good at finding answers, but struggle with rigorous, sound proofs. ➡️ ineqmath.github.io To tackle

Do LLMs truly understand math proofs, or just guess? 🤔Our new study on #IneqMath dives deep into Olympiad-level inequality proofs & reveals a critical gap: LLMs are often good at finding answers, but struggle with rigorous, sound proofs.

➡️ ineqmath.github.io

To tackle
机器之心 JIQIZHIXIN (@synced_global) 's Twitter Profile Photo

Finally, A survey of Deep Research! This 95-page survey, from Zhejiang University, examines the rapidly evolving field of Deep Research systems—AI-powered applications that automate complex research workflows through the integration of large language models, advanced information

Finally, A survey of Deep Research!

This 95-page survey, from Zhejiang University, examines the rapidly evolving field of Deep Research systems—AI-powered applications that automate complex research workflows through the integration of large language models, advanced information