Karthik Valmeekam (@karthikv792) 's Twitter Profile
Karthik Valmeekam

@karthikv792

Currently telling stories about AI @ASU

ID: 1356538098

linkhttp://karthikv792.github.io calendar_today16-04-2013 10:26:08

350 Tweet

737 Followers

265 Following

Subbarao Kambhampati (కంభంపాటి సుబ్బారావు) (@rao2z) 's Twitter Profile Photo

👉 Our #NeurIPS2024 paper on Chain of Thoughtlessness @ 11AM poster session today (East Hall #3010, 11AM-2pm...). All three of us are here and looking forward to chat/answer qns.. 🙏

👉 Our #NeurIPS2024 paper on Chain of Thoughtlessness @ 11AM poster session today  (East Hall #3010, 11AM-2pm...). All three of us are here and looking forward to chat/answer qns.. 🙏
ASU School of Computing and Augmented Intelligence (@scai_asu) 's Twitter Profile Photo

Congrats to Karthik Valmeekam, a #PhD student working under the supervision of Subbarao Kambhampati (కంభంపాటి సుబ్బారావు) in #SCAI, who has received an IBM PhD Fellowship Award from IBM Research. Given since 1951, the award recognizes research excellence in PhD students addressing areas of great technological interest.

Karthik Valmeekam (@karthikv792) 's Twitter Profile Photo

📢 DeepSeek-R1 on PlanBench 📢 DeepSeek-R1 gets similar performance as OpenAI’s o1 (preview)—achieving 96.6% on Blocksworld and 39.8% on its obfuscated version, Mystery BW. The best part? ⚡It’s 21x cheaper than o1-preview, offering similar results at a fraction of the cost!

📢 DeepSeek-R1 on PlanBench 📢

DeepSeek-R1 gets similar performance as OpenAI’s o1 (preview)—achieving 96.6% on Blocksworld and 39.8% on its obfuscated version, Mystery BW.

The best part? 

⚡It’s 21x cheaper than o1-preview, offering similar results at a fraction of the cost!
Subbarao Kambhampati (కంభంపాటి సుబ్బారావు) (@rao2z) 's Twitter Profile Photo

📢"On the Self-Verification Limitations of LLMs in Reasoning and Planning Tasks" arxiv.org/abs/2402.08115 with kstechly and Karthik Valmeekam apparently made it to #ICLR2025.. Swimming🏊 to Singapore.. 😎 [The 🧵s below give the details.. ]

Subbarao Kambhampati (కంభంపాటి సుబ్బారావు) (@rao2z) 's Twitter Profile Photo

On the use of Verifiers with LLMs--External vs. Internal LLM-Modulo (with kstechly, Karthik Valmeekam and @21st_Warlock ) We are a bit tickled that verifiers seem to be all the rage on the AI twitter, as we have been advocating use of external verifiers of various

On the use of Verifiers with LLMs--External vs. Internal LLM-Modulo (with <a href="/kayastechly/">kstechly</a>,  <a href="/karthikv792/">Karthik Valmeekam</a> and @21st_Warlock )

We are a bit tickled that verifiers seem to be all the rage on the AI twitter, as we have been advocating use of external verifiers of various
Subbarao Kambhampati (కంభంపాటి సుబ్బారావు) (@rao2z) 's Twitter Profile Photo

On the MDP formulation of LLMs used in R1 [Not quite #SundayHarangue] Everyone knows that R1 is using RL on LLMs. Most also know that RL is done on an underlying MDP formulation. Not everyone might have grokked the rather strange nature of the MDP formulation used. We had a fun

On the MDP formulation of LLMs used in R1 [Not quite #SundayHarangue]

Everyone knows that R1 is using RL on LLMs. Most also know that RL is done on an underlying MDP formulation. Not everyone might have grokked the rather strange nature of the MDP formulation used. We had a fun
Subbarao Kambhampati (కంభంపాటి సుబ్బారావు) (@rao2z) 's Twitter Profile Photo

Back in September, when we evaluated o1, and used LRM (Large Reasoning Model) to refer to that type of models, there was some pushback. Now looking at many arXiv ICML submissions analyzing these beasts, I see that LRM is becoming pretty standard usage. That's so fetch.. 😋

Sarath Sreedharan (@sarath_ssreedh) 's Twitter Profile Photo

[1/3] I am incredibly humbled and honored to be selected as one of IEEE AI's 10 to watch. However, this wouldn't have been possible without my wonderful collaborators. To start with, I will always be thankful that Subbarao Kambhampati (కంభంపాటి సుబ్బారావు) agreed to work with me.

Subbarao Kambhampati (కంభంపాటి సుబ్బారావు) (@rao2z) 's Twitter Profile Photo

Our invited commentary for the Annals of NYAS titled "(How) Do reasoning models reason?" is now online 👉 nyaspubs.onlinelibrary.wiley.com/doi/epdf/10.11… It is a written version of my recent talks (and #SundayHarangues) on the recent developments in LRMs.. (w/ kstechly & Karthik Valmeekam )

Our invited commentary for the Annals of <a href="/NYASciences/">NYAS</a> titled "(How) Do reasoning models reason?" is now online 

👉 nyaspubs.onlinelibrary.wiley.com/doi/epdf/10.11…

It is a written version of my recent talks (and #SundayHarangues) on the recent developments in LRMs.. 

(w/ <a href="/kayastechly/">kstechly</a> &amp; <a href="/karthikv792/">Karthik Valmeekam</a> )
Subbarao Kambhampati (కంభంపాటి సుబ్బారావు) (@rao2z) 's Twitter Profile Photo

PSA for #ICLR2025 authors frantically making posters: Stop worrying! Just prompt o3 and GPT-4o and you will have an AGI-ready poster in seconds! Here is one kstechly and Karthik Valmeekam cooked up--and it looks fully legit from poster distance!

PSA for #ICLR2025 authors frantically making posters: Stop worrying! Just prompt o3 and GPT-4o and you will have an AGI-ready poster in seconds! Here is one <a href="/kayastechly/">kstechly</a> and <a href="/karthikv792/">Karthik Valmeekam</a> cooked up--and it looks fully legit from poster distance!
Subbarao Kambhampati (కంభంపాటి సుబ్బారావు) (@rao2z) 's Twitter Profile Photo

So kstechly 🎓 from Arizona State University yesterday. Kaya was a rather unusual Yochanite-- didn't take any courses or do theses with me; and wasn't even in a CS degree! She was ever present in lab, and lunches though--and a force in all group meetings and many papers. She will be missed..

So <a href="/kayastechly/">kstechly</a> 🎓 from <a href="/ASU/">Arizona State University</a> yesterday. Kaya was a rather unusual Yochanite-- didn't take any courses or do theses with me; and wasn't even in a CS degree!

She was ever present in lab, and lunches though--and a force in all group meetings and many papers. 

She will be missed..
Subbarao Kambhampati (కంభంపాటి సుబ్బారావు) (@rao2z) 's Twitter Profile Photo

Some of what that recent Apple LRM limitations paper shows is known (pardon my friendly Schmidhubering; I do welcome more LLM studies with scientific skepticism). Our study 👇 from Sep 2024 shows o1 accuracy degrading as complexity increases.. 1/ x.com/rao2z/status/1…