Harshit Sikchi (@harshit_sikchi) 's Twitter Profile
Harshit Sikchi

@harshit_sikchi

I study Reinforcement Learning; Currently PhD candidate @UTCompSci. Previously FAIR Paris @AIatMeta, @CMU_Robotics @NVIDIAAI @UberATG.

ID: 1016909057964118021

linkhttps://hari-sikchi.github.io/ calendar_today11-07-2018 04:56:27

353 Tweet

1,1K Followers

1,1K Following

Harshit Sikchi (@harshit_sikchi) 's Twitter Profile Photo

Exploration is crucial for the next breakthrough in reasoning; Behavioral Foundation Models may change the way we do low level control; Come to discuss about all these topics and more RL_Conference !

Harshit Sikchi (@harshit_sikchi) 's Twitter Profile Photo

I will be RL_Conference presenting the below work on Fast Adaptation on Wednesday August 6 at 10:20 am and some works on unsupervised RL and imitation at RLBrew workshop on August 5.

Giorgia Ramponi (@gio_ramponi) 's Twitter Profile Photo

Giving two talks tomorrow at RL_Conference on Imitation Learning and IRL in multi-agent systems See you at 11am at RL Beyond Rewards Workshop and at 4pm at the Causal Reinforcement Learning workshop! I'll also present our recent work on combining Behavioral Cloning and PbRL at RL Beyond Rewards Workshop

Greg Brockman (@gdb) 's Twitter Profile Photo

Just released gpt-oss: state-of-the-art open-weight language models that deliver strong real-world performance. Runs locally on a laptop!

Just released gpt-oss: state-of-the-art open-weight language models that deliver strong real-world performance. Runs locally on a laptop!
Pablo Samuel Castro (@pcastr) 's Twitter Profile Photo

Super thought-provoking talk by Dale Schuurmans RL_Conference on LLMs and computation, and why value-based RL doesn't (or can't?) work for post-training.

Super thought-provoking talk by Dale Schuurmans <a href="/RL_Conference/">RL_Conference</a> on LLMs and computation, and why value-based RL doesn't (or can't?) work for post-training.
Sebastien Bubeck (@sebastienbubeck) 's Twitter Profile Photo

Claim: gpt-5-pro can prove new interesting mathematics. Proof: I took a convex optimization paper with a clean open problem in it and asked gpt-5-pro to work on it. It proved a better bound than what is in the paper, and I checked the proof it's correct. Details below.

Claim: gpt-5-pro can prove new interesting mathematics.

Proof: I took a convex optimization paper with a clean open problem in it and asked gpt-5-pro to work on it. It proved a better bound than what is in the paper, and I checked the proof it's correct.

Details below.
Michele Wang (@michelelwang) 's Twitter Profile Photo

our team at openai is hiring technical staff to build frontier evals for finance. If you're passionate about measuring real-world capabilities, have a love/hate relationship with Excel, or are an ex-banker/ex-investor with technical skills, please reach out!