Harsh Trivedi (@harsh3vedi) 's Twitter Profile
Harsh Trivedi

@harsh3vedi

πŸ€– Building AI agents & interactive environments: 🌍 AppWorld (appworld.dev) #NLProc PhD @stonybrooku. Prev: @allen_ai @CILVRatNYU. On πŸ¦‹ same handle.

ID: 2275933639

linkhttp://harshtrivedi.me/ calendar_today04-01-2014 10:37:29

491 Tweet

625 Followers

909 Following

Cohere Labs (@cohere_labs) 's Twitter Profile Photo

Monday, December 2nd, check out Harsh Trivedi and our Geo Regional Asia group for a talk on "AppWorld: Reliable Evaluation of Interactive Agents in a World of Apps and People." πŸ‘₯

Monday, December 2nd, check out <a href="/harsh3vedi/">Harsh Trivedi</a> and our Geo Regional Asia group for a talk on "AppWorld: Reliable Evaluation of Interactive Agents in a World of Apps and People." πŸ‘₯
Harsh Trivedi (@harsh3vedi) 's Twitter Profile Photo

🚨 Happening today in 5 hours at Cohere For AI! πŸ‘‰ Consider joining, especially, if you are interested in agentic code generation, tool use, digital automation, and careful environment & benchmark creation for language agents!

Justin Chih-Yao Chen (@cyjustinchen) 's Twitter Profile Photo

🚨 Reverse Thinking Makes LLMs Stronger Reasoners We can often reason from a problem to a solution and also in reverse to enhance our overall reasoning. RevThink shows that LLMs can also benefit from reverse thinking πŸ‘‰ 13.53% gains + sample efficiency + strong generalization!

🚨 Reverse Thinking Makes LLMs Stronger Reasoners

We can often reason from a problem to a solution and also in reverse to enhance our overall reasoning. RevThink shows that LLMs can also benefit from reverse thinking πŸ‘‰ 13.53% gains + sample efficiency + strong generalization!
Niranjan (@b_niranjan) 's Twitter Profile Photo

Excited to host the wonderful Mohit Bansal this Friday as part of our Distinguished Lecture Series. The broader @AI_SBU community, the #NLProc, and CV groups at Stony Brook University Dept. of Computer Science are looking forward to this. p.s. There will be no remote options for the talk unfortunately.

Harsh Trivedi (@harsh3vedi) 's Twitter Profile Photo

❓Is there any Python lib that can trace LLM calls' input/output + cost locally? It needs to work w/ many LLM provider & orchestrator libs w/o any code change πŸ‘‰I know wandb/weave supports it by patching LLM libs, but I want a 100% local solution that doesn't require an API key

Francesco Orabona (@bremen79) 's Twitter Profile Photo

🚨 4th edition of the KAUST Rising Stars in AI Symposium Apply here: kaust.edu.sa/en/news/rising… We'll select the best PhD students, postdocs, early career faculty and industry researchers in AI to present their work at KAUST Deadline: December 18 Please share it widely!

🚨 4th edition of the KAUST Rising Stars in AI Symposium

Apply here: kaust.edu.sa/en/news/rising…

We'll select the best PhD students, postdocs, early career faculty and industry researchers in AI to present their work at KAUST

Deadline: December 18

Please share it widely!
Alexandre Lacoste (@alex_lacoste_) 's Twitter Profile Photo

🧡-1 We are thrilled to release #AgentLab, a new open-source package for developing and evaluating web agents. This builds on the new #BrowserGym package which supports 10 different benchmarks, including #WebArena.

🧡-1
We are thrilled to release #AgentLab, a new open-source package for developing and evaluating web agents. This builds on the new #BrowserGym package which supports 10 different benchmarks, including #WebArena.
Elias Stengel-Eskin (on the faculty job market) (@eliaseskin) 's Twitter Profile Photo

🚨 I am on the faculty job market this year 🚨 I will be presenting at #NeurIPS2024 and am happy to chat in-person or digitally! I work on developing AI agents that can collaborate and communicate robustly with us and each other. My work covers 3 key problemsπŸ‘‡ 1⃣ Multi-agent +

🚨 I am on the faculty job market this year 🚨
I will be presenting at #NeurIPS2024 and am happy to chat in-person or digitally!

I work on developing AI agents that can collaborate and communicate robustly with us and each other.
My work covers 3 key problemsπŸ‘‡

1⃣ Multi-agent +
Niloofar (on faculty job market!) (@niloofar_mire) 's Twitter Profile Photo

I'm on the faculty market and at #NeurIPS!πŸ‘©β€πŸ« homes.cs.washington.edu/~niloofar/ I work on privacy, memorization, and emerging challenges in data use for AI. Privacy isn't about PII removal but about controlling the flow of information contextually, & LLMs are still really bad at this!

I'm on the faculty market and at #NeurIPS!πŸ‘©β€πŸ«
homes.cs.washington.edu/~niloofar/

I work on privacy, memorization, and emerging challenges in data use for AI.

Privacy isn't about PII removal but about controlling the flow of information contextually, &amp; LLMs are still really bad at this!
Hao Zhu 朱昊 (@_hao_zhu) 's Twitter Profile Photo

I always believe speech will be the default communication channel between humans and AI agents b/c talking is more efficient, and can convey way more information than text along. Can current audio LMs unlock this potential? To study this, we are launching a new platform Talk

I always believe speech will be the default communication channel between humans and AI agents b/c talking is more efficient, and can convey way more information than text along. Can current audio LMs unlock this potential? 

To study this, we are launching a new platform Talk
Ruiqi Zhong (@zhongruiqi) 's Twitter Profile Photo

I'm on the academic job market! I build AI systems that assist humans in complicated tasks (e.g. pattern discovery/automate software development), and focus on cases when their outputs are hard-to-explain or evaluate. I'll be at NeurIPS'24 from 12/10-12/15. Happy to catch up!

Kareem Ahmed (@kareemyousrii) 's Twitter Profile Photo

Excited to give an oral presentation of our work "Controllable Generation via Locally Constrained Resampling" @ #NeurIPS2024 SafeGenAI TL;DR We fix greedy constrained decoding using an ad hoc LLM approximation that we tractably condition on the constraint and reweighing samples

Peter Jansen ( @peterjansen-ai.bsky.social ) (@peterjansen_ai) 's Twitter Profile Photo

There are still a few days left to submit to the AI & Scientific Discovery Workshop at NAACL HLT 2025 ! Both archival and non-archival (i.e. submitted or published) works that you'd like to present to a highly interested audience welcome. ai-and-scientific-discovery.github.io