Ching-An Cheng (Hiring 2025 intern) (@chinganc_rl) Twitter Tweets • TwiCopy

Ching-An Cheng (Hiring 2025 intern)

@chinganc_rl

+ Follow

Principal Researcher at @MSFTResearch, working on usable theory and algorithms for Reinforcement Learning, Generative Optimization, and Robotics.

ID: 1234013958056509445

linkhttp://www.chinganc.com calendar_today01-03-2020 07:14:01

106 Tweet

1,1K Takipçi

99 Takip Edilen

Jiao Sun

@sunjiao123sun_

a year ago

Mitigating racial bias from LLMs is a lot easier than removing it from humans! Can’t believe this happened at the best AI conference NeurIPS Conference We have ethical reviews for authors, but missed it for invited speakers? 😡

Mitigating racial bias from LLMs is a lot easier than removing it from humans!

Can’t believe this happened at the best AI conference <a href="/NeurIPSConf/">NeurIPS Conference</a>

We have ethical reviews for authors, but missed it for invited speakers? 😡

thumb_up_off_alt3,3K

chat_bubble_outline184

repeat837

shareShare

Ching-An Cheng (Hiring 2025 intern)

@chinganc_rl

a year ago

#NeurIPS2024 Super fun talking to tons of people yesterday. Like seeing people got genuinely surprised and laughed. Non stop 3 hrs talking. Finally the poster session is over and I can take a break :). Looking forward to seeing new research inspired by #Trace. Great job

thumb_up_off_alt24

chat_bubble_outline1

repeat1

shareShare

Andrey Kolobov

@andrey__kolobov

a year ago

I'm hiring researchers for my physically embodied AI & robotics team at MSR! 🤖👇 jobs.careers.microsoft.com/us/en/job/1778… Physically embodied agents, both in the humanoid robot form and beyond, are the new computational platform of tomorrow. As with personal computers many decades ago, these

thumb_up_off_alt126

chat_bubble_outline1

repeat12

shareShare

Ching-An Cheng (Hiring 2025 intern)

@chinganc_rl

a year ago

MSR is hiring robotics researchers!! Good time to join! 😎

thumb_up_off_alt32

chat_bubble_outline0

repeat4

shareShare

Microsoft Research

@msftresearch

10 months ago

Announcing AutoGen 0.4, fully reimagined library for building advanced agentic AI systems, developed to improve code quality and robustness. Its asynchronous, event-driven architecture is designed to support dynamic, scalable workflows. Learn more: msft.it/6012ohgli

thumb_up_off_alt707

chat_bubble_outline16

repeat178

shareShare

RL_Conference

@rl_conference

7 months ago

The RLC accepted workshops list is out (link in next tweet)! Programmatic RL Causal RL RL and videogames Inductive biases and RL and returning from last year: RL beyond rewards, finding the frame, and RL in practice!

thumb_up_off_alt103

chat_bubble_outline1

repeat15

shareShare

Ching-An Cheng (Hiring 2025 intern)

@chinganc_rl

6 months ago

Started my new job at #Google Research recently. Super excited about what can be done here. 😎

thumb_up_off_alt297

chat_bubble_outline22

repeat3

shareShare

Shao-Hua Sun

@shaohua0116

6 months ago

Our ICML & RLC workshops welcome contributions using programmatic representations as policies, reward functions, skill libraries, task generators, environment models, etc., to improve interpretability, generalization, efficiency, & safety in agent learning & RL! Please retweet 🙏

thumb_up_off_alt50

chat_bubble_outline4

repeat10

shareShare

Ching-An Cheng (Hiring 2025 intern)

@chinganc_rl

5 months ago

Check out this new optimization framework (github.com/datarobot/syftr) by #DataRobot that can automatically search for "Pareto-optimal" solutions for agentic workflows. It's built on our LLM generative optimization framework #Trace. Excited to see more applications of #Trace! 😎

thumb_up_off_alt7

chat_bubble_outline0

repeat0

shareShare

Allen Nie (🇺🇦☮️)

@allen_a_nie

5 months ago

Decision-making with LLM can be studied with RL! Can an agent solve a task with text feedback (OS terminal, compiler, a person) efficiently? How can we understand the difficulty? We propose a new notion of learning complexity to study learning with language feedback only. 🧵👇

thumb_up_off_alt79

chat_bubble_outline2

repeat16

shareShare

Ching-An Cheng (Hiring 2025 intern)

@chinganc_rl

5 months ago

Super excited about this work done by our former intern Wanqiao Xu . We show Learning from Language Feedback (LLF) with LLM can be formally studied with provable no-regret learning algorithms. This result builds a foundation toward new theories for LLM learning and optimization.

thumb_up_off_alt17

chat_bubble_outline0

repeat1

shareShare