Li Ding (@li_ding_) Twitter Tweets • TwiCopy

Li Ding

2 years ago

Good news: QDHF will be presented as a Spotlight at the NeurIPS 2023 ALOE workshop: sites.google.com/view/aloe2023. See you in New Orleans and let's chat!

thumb_up_off_alt25

chat_bubble_outline1

repeat6

shareShare

Great work led by Ryan Bahlous-Boldi ! QD maintains diversity with pre-defined diversity metrics (or learned, e.g., QDHF!), but is it necessary? This paper proposes an alternative that uses MMO to solve deceptive RL tasks, and outperforms ME on QD-score w/o even optimizing towards it!

thumb_up_off_alt4

chat_bubble_outline0

repeat0

shareShare

Scott Niekum

@scottniekum

2 years ago

Thrilled to announce the first annual Reinforcement Learning Conference RL_Conference, which will be held at UMass Amherst August 9-12! RLC is the first strongly peer-reviewed RL venue with proceedings, and our call for papers is now available: rl-conference.cc. 🧵

Thrilled to announce the first annual Reinforcement Learning Conference <a href="/RL_Conference/">RL_Conference</a>, which will be held at UMass Amherst August 9-12! RLC is the first strongly peer-reviewed RL venue with proceedings, and our call for papers is now available: rl-conference.cc. 🧵

thumb_up_off_alt105

chat_bubble_outline1

repeat11

shareShare

RL_Conference

@rl_conference

2 years ago

Thrilled to announce the first annual Reinforcement Learning Conference RL_Conference, which will be held at UMass Amherst August 9-12! RLC is the first strongly peer-reviewed RL venue with proceedings, and our call for papers is now available: rl-conference.cc.

thumb_up_off_alt228

chat_bubble_outline3

repeat87

shareShare

Li Ding

@li_ding_

2 years ago

Had a great time yesterday presenting our QDHF work at #NeurIPS2023 ALOE Workshop! Fantastic workshop with great vibes. Glad to meet so many people in open-endedness, QD, RL, especially my amazing collaborators Jenny Zhang and Jeff Clune! (Thanks to Ryan Bahlous-Boldi for this photo!)

Had a great time yesterday presenting our QDHF work at #NeurIPS2023 <a href="/aloeworkshop/">ALOE Workshop</a>! Fantastic workshop with great vibes. Glad to meet so many people in open-endedness, QD, RL, especially my amazing collaborators <a href="/jennyzhangzt/">Jenny Zhang</a> and <a href="/jeffclune/">Jeff Clune</a>! (Thanks to <a href="/RyanBoldi/">Ryan Bahlous-Boldi</a> for this photo!)

thumb_up_off_alt15

chat_bubble_outline1

repeat1

shareShare

Li Ding

@li_ding_

2 years ago

🚀Thrilled to release the QDHF tutorial in pyribs! Big shoutout to Bryon Tjanaka for his meticulous editing and insightful feedback👏. Dive into the tutorial to explore how QDHF enhances GenAI models with diversified, high-quality responses and apply these insights to your projects!

thumb_up_off_alt15

chat_bubble_outline0

repeat5

shareShare

Scott Niekum

@scottniekum

a year ago

Preferences in RLHF often come from many people with differing values. Ryan's work explores how to infer a set of representative reward functions that captures that diversity, so that we can better reason about risk and fairness in these settings.

thumb_up_off_alt14

chat_bubble_outline0

repeat3

shareShare

Li Ding

@li_ding_

a year ago

POPL enhances the safety and fairness of RLHF by aligning agents and LLMs with diverse human values. It effectively addresses hidden contexts in preferences, ensuring risk-sensitive alignment without additional labeling. Led by Ryan Bahlous-Boldi, w/ Lee Spector and Scott Niekum.

thumb_up_off_alt4

chat_bubble_outline0

repeat0

shareShare

Li Ding

@li_ding_

a year ago

Thrilled to announce I’ve successfully defended my PhD! 🎓 Deeply grateful to my advisor Lee Spector, my committee Scott Niekum, Subhransu Maji, Jeff Clune, and all collaborators, friends, and family. Milestone achieved, excited for the next chapter!

Thrilled to announce I’ve successfully defended my PhD! 🎓 Deeply grateful to my advisor Lee Spector, my committee <a href="/scottniekum/">Scott Niekum</a>, <a href="/MajiSubhransu/">Subhransu Maji</a>, <a href="/jeffclune/">Jeff Clune</a>, and all collaborators, friends, and family. Milestone achieved, excited for the next chapter!

thumb_up_off_alt21

chat_bubble_outline9

repeat1

shareShare