Harshit Sikchi (@harshit_sikchi) Twitter Tweets • TwiCopy

Harshit Sikchi

@harshit_sikchi

+ Follow

I study Reinforcement Learning; Currently PhD candidate @UTCompSci. Previously FAIR Paris @AIatMeta, @CMU_Robotics @NVIDIAAI @UberATG.

ID: 1016909057964118021

linkhttps://hari-sikchi.github.io/ calendar_today11-07-2018 04:56:27

353 Tweet

1,1K Followers

1,1K Following

Harshit Sikchi

@harshit_sikchi

a month ago

Exploration is crucial for the next breakthrough in reasoning; Behavioral Foundation Models may change the way we do low level control; Come to discuss about all these topics and more RL_Conference !

thumb_up_off_alt34

chat_bubble_outline0

repeat6

shareShare

Harshit Sikchi

@harshit_sikchi

a month ago

I will be RL_Conference presenting the below work on Fast Adaptation on Wednesday August 6 at 10:20 am and some works on unsupervised RL and imitation at RLBrew workshop on August 5.

thumb_up_off_alt107

chat_bubble_outline3

repeat10

shareShare

Harshit Sikchi

@harshit_sikchi

a month ago

RLC 2025 begins soon!

thumb_up_off_alt14

chat_bubble_outline0

repeat0

shareShare

Giving two talks tomorrow at RL_Conference on Imitation Learning and IRL in multi-agent systems See you at 11am at RL Beyond Rewards Workshop and at 4pm at the Causal Reinforcement Learning workshop! I'll also present our recent work on combining Behavioral Cloning and PbRL at RL Beyond Rewards Workshop

thumb_up_off_alt48

chat_bubble_outline2

repeat4

shareShare

Greg Brockman

@gdb

a month ago

Just released gpt-oss: state-of-the-art open-weight language models that deliver strong real-world performance. Runs locally on a laptop!

thumb_up_off_alt6,6K

chat_bubble_outline159

repeat489

shareShare

Pablo Samuel Castro

@pcastr

a month ago

Super thought-provoking talk by Dale Schuurmans RL_Conference on LLMs and computation, and why value-based RL doesn't (or can't?) work for post-training.

Super thought-provoking talk by Dale Schuurmans <a href="/RL_Conference/">RL_Conference</a> on LLMs and computation, and why value-based RL doesn't (or can't?) work for post-training.

thumb_up_off_alt337

chat_bubble_outline10

repeat30

shareShare

Harshit Sikchi

@harshit_sikchi

a month ago

when you know a conference is lit!⁦RL_Conference⁩

thumb_up_off_alt45

chat_bubble_outline0

repeat4

shareShare

Harshit Sikchi

@harshit_sikchi

24 days ago

It has been a good conference ⁦RL_Conference⁩ ; Below ⁦RL Beyond Rewards Workshop⁩ social, edmonton flame, a great talk. Conference detox needed now

It has been a good conference ⁦<a href="/RL_Conference/">RL_Conference</a>⁩ ; Below ⁦<a href="/RLBRew_RLC/">RL Beyond Rewards Workshop</a>⁩ social, edmonton flame, a great talk. Conference detox needed now

thumb_up_off_alt32

chat_bubble_outline1

repeat1

shareShare

Sebastien Bubeck

@sebastienbubeck

14 days ago

Claim: gpt-5-pro can prove new interesting mathematics. Proof: I took a convex optimization paper with a clean open problem in it and asked gpt-5-pro to work on it. It proved a better bound than what is in the paper, and I checked the proof it's correct. Details below.

thumb_up_off_alt4,4K

chat_bubble_outline150

repeat659

shareShare

Michele Wang

@michelelwang

4 days ago

our team at openai is hiring technical staff to build frontier evals for finance. If you're passionate about measuring real-world capabilities, have a love/hate relationship with Excel, or are an ex-banker/ex-investor with technical skills, please reach out!

thumb_up_off_alt1,1K

chat_bubble_outline38

repeat47

shareShare

Harshit Sikchi

Harshit Sikchi

Harshit Sikchi

Harshit Sikchi

Giorgia Ramponi

Greg Brockman

Pablo Samuel Castro

Harshit Sikchi

Harshit Sikchi

Sebastien Bubeck

Michele Wang