HapticAI (@haptic_ai) 's Twitter Profile
HapticAI

@haptic_ai

Your LLM’s personal trainer. Get with the program, nerds.

ID: 1770479406781460480

linkhttps://www.hapticai.dev/ calendar_today20-03-2024 15:56:22

232 Tweet

1,1K Takipçi

11 Takip Edilen

Nathan Lambert (@natolambert) 's Twitter Profile Photo

Tulu 2.5 work is still underrated RLHF paper. Lots of industry interest, not that much immediate academic uptake. Really great empirical study on how DPO and PPO work across datasets and implementation. Core to a lot of our efforts on Tulu 3.

Tulu 2.5 work is still underrated RLHF paper. Lots of industry interest, not that much immediate academic uptake. Really great empirical study on how DPO and PPO work across datasets and implementation.
Core to a lot of our efforts on Tulu 3.