Kyunghyun Cho(@kchonyc) 's Twitter Profileg
Kyunghyun Cho

@kchonyc

a combination of a mediocre scientist, a mediocre manager, a mediocre advisor & a mediocre PC at @nyuniversity (@CILVRatNYU) & @genentech (@PrescientDesign).

ID:48642190

linkhttps://www.kyunghyuncho.me calendar_today19-06-2009 07:35:16

12,4K Tweets

60,9K Followers

2,2K Following

Jason Weston(@jaseweston) 's Twitter Profile Photo

🚨 Iterative Reasoning Preference Optimization 🚨
- Iterative algorithm for reasoning tasks: generate pairs & apply DPO+NLL
- Improves accuracy over iterations on GSM8K, MATH, ARC & beats baselines
E.g. Llama2-70B GSM8K: 55.6%->81.6% (88.7% maj32)
arxiv.org/abs/2404.19733
🧵(1/5)

🚨 Iterative Reasoning Preference Optimization 🚨 - Iterative algorithm for reasoning tasks: generate pairs & apply DPO+NLL - Improves accuracy over iterations on GSM8K, MATH, ARC & beats baselines E.g. Llama2-70B GSM8K: 55.6%->81.6% (88.7% maj32) arxiv.org/abs/2404.19733 🧵(1/5)
account_circle
Kyunghyun Cho(@kchonyc) 's Twitter Profile Photo

a new blog post, because it is Saturday.

<Fixing DPO but I have a dinner reservation …> kyunghyuncho.me/a-proper-prefe…

account_circle
CAI2R(@cai2r) 's Twitter Profile Photo

Congrats to Artie Shen on being honored by New York University w/ an outstanding dissertation award in the public health & allied health category for 'Toward Explainable for Medical Image Analysis.'

🧵
1/10

Congrats to @ArtieShen on being honored by @nyuniversity w/ an outstanding dissertation award in the public health & allied health category for 'Toward Explainable #DeepLearning for Medical Image Analysis.' 🧵 1/10
account_circle
Kyunghyun Cho(@kchonyc) 's Twitter Profile Photo

once Yann LeCun told me (heavily paraphrased), it's not F=ma but \min (F-ma)^2. i didn't realize its importance, but it is perhaps the most enlightning perspective i've ever heard.

account_circle
Brenden Lake(@LakeBrenden) 's Twitter Profile Photo

I got tenure! It was fitting that I got to celebrate with the lab right after the news. Working together for the last 6.5 years has been a blast.

I got tenure! It was fitting that I got to celebrate with the lab right after the news. Working together for the last 6.5 years has been a blast.
account_circle
Kyunghyun Cho(@kchonyc) 's Twitter Profile Photo

your leadership doesn't want you to do it: have you tried Pubmed-QA zero-shot eval without questions but only with abstracts?

account_circle
Kyunghyun Cho(@kchonyc) 's Twitter Profile Photo

arxiv.org/abs/2404.08819 a nice study by Merrill, Petty & Sabharwal.

it looks like i won't have to wait too much longer for the reinvention of LSTM/GRU by LLM bros.

account_circle
Yi Tay(@YiTayML) 's Twitter Profile Photo

It's been a wild ride. Just 20 of us, burning through thousands of H100s over the past months, we're glad to finally share this with the world! 💪

One of the goals we’ve had when starting Reka was to build cool innovative models at the frontier. Reaching GPT-4/Opus level was a

account_circle
Kyunghyun Cho(@kchonyc) 's Twitter Profile Photo

i was suddenly reminded of this article from 2018.

'.. statistical physicists were primed to see power laws everywhere .. there’s a “power law religion.”'

'“We would even squint at the computer screen from an angle to get a better idea if a curve was straight or not,” recalled

i was suddenly reminded of this article from 2018. '.. statistical physicists were primed to see power laws everywhere .. there’s a “power law religion.”' '“We would even squint at the computer screen from an angle to get a better idea if a curve was straight or not,” recalled
account_circle
Kyunghyun Cho(@kchonyc) 's Twitter Profile Photo

oh i think i figured out:

1. create a new repository on github
2. clone the repository in Lightning AI ⚡️ studio
3. press 'Open' to work directly on the cloned repo

oh i think i figured out: 1. create a new repository on github 2. clone the repository in @LightningAI studio 3. press 'Open' to work directly on the cloned repo
account_circle