kstechly (@kayastechly) Twitter Tweets • TwiCopy

kstechly

@kayastechly

+ Follow

Linguistics M.A. at ASU working in the Yochan lab.

ID: 1707282194576920576

linkhttp://kstechly.github.io calendar_today28-09-2023 06:32:58

6 Tweet

145 Followers

59 Following

Subbarao Kambhampati (కంభంపాటి సుబ్బారావు)

@rao2z

a year ago

One paper, lead by kstechly (w/ Matthew Marquez), evaluated the claims over a suite of graph coloring problems. The setup allows for GPT4 guessing a valid coloring in stand alone and self-critiquing modes. There is an external sound verifier outside the self-critiquing loop. 2/

One paper, lead by <a href="/kayastechly/">kstechly</a> (w/ <a href="/mattdmarq/">Matthew Marquez</a>), evaluated the claims over a suite of graph coloring problems. The setup allows for GPT4 guessing a valid coloring in stand alone and self-critiquing modes. There is an external sound verifier outside the self-critiquing loop. 2/

thumb_up_off_alt35

chat_bubble_outline1

repeat3

shareShare

Subbarao Kambhampati (కంభంపాటి సుబ్బారావు)

@rao2z

a year ago

📢 Check out these posters on LLM Self-Critiquing (in)abilities in reasoning and planning tasks, being presented at the #NeurIPS2023 "Foundation Models for Decision Making" workshop today (12/15) by yochanites Karthik Valmeekam and kstechly in Hall E2.

thumb_up_off_alt18

chat_bubble_outline0

repeat6

shareShare

Subbarao Kambhampati (కంభంపాటి సుబ్బారావు)

@rao2z

8 months ago

📢 "On the self-verification limitations of LLMs in Reasoning and Planning Tasks" arxiv.org/abs/2402.08115 (lead by Karthik Valmeekam and kstechly) 👇 Investigates LLM self-verification in three formal benchmarks--Game of 24, Graph Coloring and Planning, and shows that accuracy

📢 "On the self-verification limitations of LLMs in Reasoning and Planning Tasks" arxiv.org/abs/2402.08115 (lead by <a href="/karthikv792/">Karthik Valmeekam</a> and <a href="/kayastechly/">kstechly</a>) 👇

Investigates LLM self-verification in three formal benchmarks--Game of 24, Graph Coloring and Planning, and shows that accuracy

thumb_up_off_alt113

chat_bubble_outline4

repeat38

shareShare

Subbarao Kambhampati (కంభంపాటి సుబ్బారావు)

@rao2z

7 months ago

Two Yochanites drove 15 hours to Austin city limits and saw this.. 🤗

thumb_up_off_alt10

chat_bubble_outline0

repeat1

shareShare

Subbarao Kambhampati (కంభంపాటి సుబ్బారావు)

@rao2z

2 months ago

A research note describing our evaluation of the planning capabilities of o1 🍓 is now on arXiv.org arxiv.org/abs/2409.13373 (thanks to Karthik Valmeekam & kstechly). As promised, here is a summary (..although you should read the whole thing..) 🧵 1/

A research note describing our evaluation of the planning capabilities of o1 🍓 is now on <a href="/arxiv/">arXiv.org</a> arxiv.org/abs/2409.13373 (thanks to <a href="/karthikv792/">Karthik Valmeekam</a> & <a href="/kayastechly/">kstechly</a>). As promised, here is a summary (..although you should read the whole thing..) 🧵 1/

thumb_up_off_alt609

chat_bubble_outline17

repeat107

shareShare

Subbarao Kambhampati (కంభంపాటి సుబ్బారావు)

@rao2z

2 months ago

Woo hoo.. Chain of Thoughtlessness paper will be showing up at #NeurIPS2024 🤗 Congrats to Karthik Valmeekam kstechly [details below 👇]

thumb_up_off_alt91

chat_bubble_outline3

repeat13

shareShare